Talent.com
Huawei Technologies Canada Co., Ltd.
Researcher - Reinforcement LearningHuawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
30+ days ago
Job type
  • Temporary
Job description

Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.


About the team:

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job:

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.

  • LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.).

  • Agentic reinforcement learning for tool-using and browsing-based LLMs trained in interactive environments.

  • Agentic evaluation and benchmarking, including design of multi-turn, verifiable reasoning tasks.

  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning-enhanced LLMs and tool-using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.


Job requirements

About the ideal candidate:

  • PhD degree in Computer Science or related fields or master's degree with comparable experience.

  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.

  • Practical or research experience in reinforcement learning, self-supervised learning, or language model fine-tuning.

  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.

  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.

  • Familiarity with LLM post-training pipelines (RLHF, GRPO/PPO, SFT, LoRA, MoE, etc.) is an asset.

  • Experience with multi-agent RL, tool-use / browser/coding agents, is an asset.

  • Strong communication and writing skills; enthusiasm for open research and collaborative problem-solving.

Huawei aims to support a French-speaking work environment for its employees in Quebec. We have taken steps to avoid requiring a language other than French for this position. However, proficiency in English is essential for this role for the following reasons:

The person will be required to communicate regularly with colleagues located outside Quebec, where English is the primary language used for communication between offices. In addition, the nature of the tasks related to this position, which falls within a highly specialized field of artificial intelligence, also requires knowledge of English.

Create a job alert for this search

Researcher - Reinforcement Learning • Edmonton, Alberta, CA

Similar jobs

Remote Prospect Researcher - Fundraising (Part-Time)

BullyingCanadaEdmonton, Division No. 11, CA
Remote
Part-time +1

A national charity focused on bullying prevention is seeking a part-time Fundraising Prospect Researcher to support their fundraising efforts.This remote position involves researching prospects and... Show more

 • Promoted

Search Consultant - Remote

Berkner Groupedmonton, ab, ca
Remote
Full-time

Berkner Group is a specialized search firm focused on building leadership and technical teams for companies across climate, deep tech, and other innovation-driven sectors.We work closely with found... Show more

 • Promoted

Fundraising Prospect Researcher

BullyingCanadaEdmonton, Division No. 11, CA
Part-time +1

Registered charity BullyingCanada Inc.Fundraising Prospect Researcher to join our national team for a short-term contract starting.August 9, 2021 and ending November 30, 2021.This role will play a ... Show more

 • Promoted

Search Consultant - Remote - Berkner Group

Berkner Groupedmonton, ab, ca
Remote
Full-time

Berkner Group is a specialized search firm focused on building leadership and technical teams for companies across climate, deep tech, and other innovation-driven sectors.We work closely with found... Show more

 • Promoted

RevOps Practice Lead

MergeYourDataedmonton, ab, ca
Full-time

MergeYourData is a RevOps consultancy and Top 0.HubSpot Partner globally, currently growing 150% YoY.We work with mid-market B2B companies and multi-company organizations who need their CRM to func... Show more

 • Promoted

Remote Role for Recreation Specialist in AI Innovation

MercorEdmonton, Division No. 11, CA
Remote
Full-time

Take on a Remote Recreation Specialist position, enhancing AI research through your expertise and collaborative spirit.Create impactful deliverables and work asynchronously with dedicated research ... Show more

 • Promoted

Survey Taker: Earn up to $25 per survey (Remote)

Earn HausBeaumont, AB, CA
Remote
Full-time +1

Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se... Show more

 • Promoted

Online Survey Participant: Work Remote and Earn Up To $25 Per Survey

Earn HausBeaumont, AB, CA
Remote
Full-time +1

Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se... Show more

 • Promoted

Research Director, Software Channels

IDG (International Data Group)edmonton, ab, ca
Part-time

The Research Director for Software Channels & Ecosystems is a senior role covering channels and ecosystems specific to software-centric channels and ecosystems, and also all the external factors th... Show more

 • Promoted

Clinical Research Contracts Lead- Canada Remote

ICON Strategic Solutionsedmonton, ab, ca
Remote
Full-time

ICON plc is a world-leading healthcare intelligence and clinical research organization.We’re proud to foster an inclusive environment driving innovation and excellence, and we welcome you to join u... Show more

 • Promoted

Quantitative User Researcher - Mozilla Corporation

Mozilla CorporationEdmonton, Division No. 11, CA
Full-time

Shape product strategy at Mozilla Corporation as a Senior Staff Quantitative User Researcher, focusing on user insights and data analytics.Collaborate with multidisciplinary teams to influence key ... Show more

 • Promoted

Senior UX Researcher, Connected Stores (Remote)

InstacartEdmonton, Division No. 11, CA
Remote
Full-time

A leading grocery technology company in Canada is seeking a Senior User Researcher II to drive impactful research aiming to modernize in-store operations and enhance grocery shopping experiences.Th... Show more

 • Promoted

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd.Edmonton, Division No. 11, CA
Temporary

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable ... Show more

 • Promoted

Remote Research Analyst - Market Rent Analytics

AccuritycanadaEdmonton, Division No. 11, CA
Remote
Full-time

A national appraisal firm is looking for appraiser trainees and real estate professionals to support market rent analysis on a flexible schedule.This role offers per-file or hourly compensation and... Show more

 • Promoted

Clinical Research Contracts Lead- Canada Remote - ICON Strategic Solutions

ICON Strategic Solutionsedmonton, ab, ca
Remote
Full-time

ICON plc is a world-leading healthcare intelligence and clinical research organization.We’re proud to foster an inclusive environment driving innovation and excellence, and we welcome you to join u... Show more

 • Promoted

AI Implementation and Research Director

Info-Tech Research GroupEdmonton, Division No. 11, CA
Full-time

Shape the future of applied AI by directing client engagements and system prototyping.This position melds hands-on delivery with innovative research to enhance AI applications.The AI Implementation... Show more

 • Promoted

Remote Recreation Domain Expert for AI Research (Contract)

MercorEdmonton, Division No. 11, CA
Remote
Full-time

A leading AI talent agency is seeking Recreation Workers for a remote contract position lasting 3–4 weeks.Candidates should have at least 4 years of professional experience and excellent written co... Show more

 • Promoted

Research Director, Software Channels - edmonton

IDG (International Data Group)edmonton, ab, ca
Part-time

The Research Director for Software Channels & Ecosystems is a senior role covering channels and ecosystems specific to software-centric channels and ecosystems, and also all the external factors th... Show more

 • Promoted

Reinforcement Learning Engineer

Huawei CanadaEdmonton, Division No. 11, CA
Temporary

Excel in a dynamic environment as a Reinforcement Learning Engineer.Focus on designing and fine-tuning scalable ML infrastructure for cutting-edge recommendation systems and AI models.In this 12-mo... Show more

 • Promoted

Research Analyst

Pivotal Research Inc.edmonton, ab, ca
Full-time

We are seeking a Research Analyst to support a diverse portfolio of projects across public policy, evaluation, and market research.The role involves contributing across the full research process, f... Show more