Talent.com
Huawei Technologies Canada Co., Ltd.
Agentic RL Researcher – Distributed ComputingHuawei Technologies Canada Co., Ltd. • Markham, ON, CA
Agentic RL Researcher – Distributed Computing

Agentic RL Researcher – Distributed Computing

Huawei Technologies Canada Co., Ltd. • Markham, ON, CA
30+ days ago
Job type
  • Permanent
Job description

Huawei Canada has an immediate permanent opening for a Researcher.

About the team:

The Distributed Data Storage and Management Lab leads research in distributed data systems, aiming to develop next-generation cloud serverless products that encompass core infrastructure and databases. This lab addresses various data challenges, including cloud-native disaggregated databases, pay-by-query user models, and optimizing low-level data transfers via RDMA. Teams within this lab create advanced cloud serverless data infrastructure and implement cutting-edge networking technologies for Huawei's global AI infrastructure.


About the job:

  • Design and develop advanced Agentic Reinforcement Learning (RL) and Multi-Agent Reinforcement Learning (MARL) algorithms for cooperative, competitive, and mixed-agent environments, including CTDE, decentralized learning, and hierarchical agent systems.

  • Build scalable simulation and training platforms for large-scale agent systems, supporting self-play, population-based training, curriculum learning, and emergent behavior analysis.

  • Optimize multi-agent learning performance on distributed compute clusters, improving sample efficiency, credit assignment, agent coordination, communication learning, and training stability.

  • Research and prototype new approaches for multi-agent intelligence, including communication protocols, credit assignment, game-theoretic learning dynamics, meta-learning, and adaptive agent populations.

  • Translate cutting-edge research in agentic AI and MARL into production-ready systems for real-world or high-fidelity simulated environments.

  • Develop benchmarking frameworks and evaluation metrics for agent coordination, robustness, scalability, and safety.

  • Collaborate with research, infrastructure, and product teams to deploy scalable agentic learning systems in real-world applications.

  • Contribute to technical leadership and innovation through publications, patents, open-source contributions, and conference presentations.

The total target annual compensation for this position ranges from $106,000 to $156,000 depending on education, experience, and demonstrated expertise.



About the ideal candidate:

  • MS or PhD in Computer Science, Electrical Engineering, or a related field, with a focus on Reinforcement Learning, Multi-Agent Systems, Agentic AI, or Distributed AI.

  • Strong expertise in reinforcement learning algorithms, particularly in multi-agent settings (e.g., policy gradients, value-based methods, CTDE, credit assignment, and coordination in non-stationary environments).

  • Solid foundations in optimization, probability, and game theory, with the ability to design and analyze complex learning systems.

  • Experience building scalable RL training infrastructure, including distributed rollouts, large-scale simulation, and experiment pipelines.

  • Strong programming skills in Python and/or C++, with experience developing high-performance or distributed ML systems.

  • Demonstrated impact through research publications, open-source contributions, patents, or production ML systems in reinforcement learning, multi-agent learning, or large-scale AI systems.

Additional Information:

Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.

All applications for this position are reviewed directly by our hiring team, we do not use artificial intelligence tools to screen or select candidates.

Create a job alert for this search

Agentic RL Researcher – Distributed Computing • Markham, ON, CA

Similar jobs

Agentic RL Researcher – Distributed Computing

Huawei CanadaMarkham, York Region, CA
Permanent

Huawei Canada has an immediate permanent opening for a Researcher.The Distributed Data Storage and Management Lab leads research in distributed data systems, aiming to develop next-generation cloud... Show more

 • Promoted

Trevor looking for a babysitter or nanny in Georgina

SitlyGeorgina, CA
Full-time +1

My wife has injured her knee and is unable to fully care for our child.We need a temporary full time nanny to help care for our son. Show more

 • Promoted

Call Center Representative Agent Work From Home - Part-Time Focus Group Panelist

Apex Focus Group Inc.Keswick, ON, Canada
CA$75.00 hourly
Remote
Full-time +1

Job Title: Call Center Representative Agent Work From Home - Remote Panelists.Part-Time Focus Group Participants - Remote Work From Home (Up To $850/Week).Our company is seeking individuals to part... Show more

 • Promoted

Search Consultant - Remote - Berkner Group

Berkner Groupnewmarket, on, ca
Remote
Full-time

Berkner Group is a specialized search firm focused on building leadership and technical teams for companies across climate, deep tech, and other innovation-driven sectors.We work closely with found... Show more

 • Promoted

AI/ML Engineer Co‑op — Build Hiring AI (Remote)

Northeading TechnologiesToronto, ON, CA
Remote
Full-time

A technology company in Canada is seeking an AI/ML Engineer intern to work closely with the founding team to create impactful AI systems for their hiring platform.You will prototype AI agents, fine... Show more

 • Promoted

Sitter Wanted - Experienced Sitter Needed In Georgina, Ontario $20/Hour

Sitter.comGeorgina, Ontario, Canada
CA$20.00 hourly
Full-time

We are a family located in Georgina, Ontario seeking a caring and reliable sitter for our energetic toddler.This is a live-out position with occasional hours, perfect for a nanny looking to supplem... Show more

 • Promoted

Agentic AI Systems Developer - Remote

NTT America, Inc.Toronto, ON, CA
Remote
Full-time

We are currently seeking a Agentic AI Systems Developer - Remote to join our team in Toronto, Ontario (CA-ON), Canada (CA).You will design and build agentic AI systems for healthcare using the Neur... Show more

 • Promoted

Senior Agentic AI Developer - Remote

NTT America, Inc.Toronto, ON, CA
Remote
Full-time

Advance your career with NTT DATA as a Senior Agentic AI Systems Developer in a remote capacity.This role focuses on developing AI systems tailored for healthcare environments utilizing the NeuroSt... Show more

 • Promoted

Survey Taker: Earn up to $25 per survey (Remote)

Earn HausGeorgina, ON, CA
Remote
Full-time +1

Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se... Show more

 • Promoted

Complete Online Surveys For Cash (Up to $25/per)

Earn HausGeorgina, ON, CA
Full-time +1

Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se... Show more

 • Promoted

Design Researcher

Creative NicheGreater Toronto Area, Canada, Canada
Full-time

We're Hiring: Design Researcher (12-Month+ Contract | Toronto | 4 Days In-Office | 1 Day Remote).We’re on the lookout for a passionate Design Researcher to join a high-impact digital team within on... Show more

 • Promoted

Data Center Network Researcher

Tech Talent InternationalToronto, ON, CA
Full-time

About the job Data Center Network Researcher.Fortune 100/500/1000 companies to small and mid-sized organizations in Canada/US.One of our Fortune 100 ICT clients is expanding their DCN team by addin... Show more

 • Promoted

Agentic AI Systems Developer - Remote

NTT Data Americas, Inc.Toronto, ON, CA
Remote
Full-time

We are currently seeking a Agentic AI Systems Developer - Remote to join our team in Toronto, Ontario (CA-ON), Canada (CA).You will design and build agentic AI systems for healthcare using the Neur... Show more

 • Promoted

Senior Agentic AI Developer - Remote

NTT Data Americas, Inc.Toronto, ON, CA
Remote
Full-time

Advance your career with NTT DATA as a Senior Agentic AI Systems Developer in a remote capacity.This role focuses on developing AI systems tailored for healthcare environments utilizing the NeuroSt... Show more

 • Promoted

Remote‑Flexible Research Engineer, ML/NLP

CohereToronto, ON, CA
Remote
Full-time

A cutting-edge AI research organization in Toronto is seeking a Research Engineer to contribute to innovative AI systems.In this role, you will build experiments, debug models, and scale training p... Show more

 • Promoted

Agentic AI Researcher

LG Electronics CanadaToronto, ON, CA
Full-time

At LG, we create Innovation for a Better Life.We design products and services that make life better, easier, and more enjoyable.Whether it’s through smart functionality, design, or innovative techn... Show more

 • Promoted

Remote Research Engineer - Decentralized AI Systems

Yotta LabsToronto, ON, CA
Remote
Full-time

A leading tech company is seeking a Research Engineer specializing in decentralized AI systems.The role involves designing efficient workload orchestration for AI applications across a global netwo... Show more

 • Promoted

Agentic AI Systems Developer - Remote

NTT DATA, Inc.Toronto, ON, CA
Remote
Full-time

We are currently seeking a Agentic AI Systems Developer - Remote to join our team in Toronto, Ontario (CA-ON), Canada (CA).You will design and build agentic AI systems for healthcare using the Neur... Show more

 • Promoted

AI Agent Engineer: Deploy & Integrate (Remote UK)

CrestaToronto, ON, CA
Remote
Full-time

A leading AI technology firm is seeking an AI Agent Engineer to develop state-of-the-art AI agents.The role involves deploying AI solutions, optimizing performance, and collaborating across teams t... Show more

 • Promoted

AI and NLP Researcher - Emerging Risks at MSCI

MSCI IncToronto, ON, CA
Full-time

Enhance MSCI's risk evaluation capabilities as an AI and NLP Researcher focused on Emerging Risks.Be at the forefront of analyzing trends such as climate change and supply chain disruption.As part ... Show more