Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
30+ days ago
Job type
  • Temporary
Job description

Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.

LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.).

Agentic reinforcement learning for tool-using and browsing-based LLMs trained in interactive environments.

Agentic evaluation and benchmarking, including design of multi-turn, verifiable reasoning tasks.

Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning-enhanced LLMs and tool-using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

Job requirements

About the ideal candidate :

PhD degree in Computer Science or related fields or master's degree with comparable experience.

Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.

Practical or research experience in reinforcement learning, self-supervised learning, or language model fine-tuning.

Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.

Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.

Familiarity with LLM post-training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.

Experience with multi-agent RL, tool-use / browser / coding agents, is an asset.

Strong communication and writing skills; enthusiasm for open research and collaborative problem-solving.

Huawei aims to support a French-speaking work environment for its employees in Quebec. We have taken steps to avoid requiring a language other than French for this position. However, proficiency in English is essential for this role for the following reasons :

The person will be required to communicate regularly with colleagues located outside Quebec, where English is the primary language used for communication between offices. In addition, the nature of the tasks related to this position, which falls within a highly specialized field of artificial intelligence, also requires knowledge of English.

Create a job alert for this search

Researcher Reinforcement Learning • Edmonton, Alberta, CA

Similar jobs
Remote R Engineer - AI Trainer

Remote R Engineer - AI Trainer

SuperAnnotate • Edmonton, Alberta, CA
Remote
Full-time
As a remote, hourly paid R Engineer, you will review AI-generated responses and generate high-quality R and data-analysis-focused content, evaluating the reasoning quality and step-by-step problem-...Show more
Last updated: 30+ days ago
Product Research Specialist

Product Research Specialist

LawDepot LLC. • Edmonton
Full-time +1
Join one of the fastest growing companies in Canada! LawDepot is proud to be a seven-time Growth 500 ranked organization and a major player in the Global legal solutions industry.Our mission is to ...Show more
Last updated: 15 days ago • Promoted
Director – Education, Training and Research

Director – Education, Training and Research

Albertametis • Edmonton
Full-time +1
Director – Education, Training and Research.Location : Central Office, Edmonton, AB.Closing Date : Until Suitable Candidate Found. Position Status : Full-time (40 hours / week) Permanent.Rupertsland Inst...Show more
Last updated: 17 days ago • Promoted
AI Learning Supervising Associate (18-month contract)

AI Learning Supervising Associate (18-month contract)

EY • Edmonton
Full-time +1
At EY, we’re all in to shape your future with confidence.We’ll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show more
Last updated: 12 days ago • Promoted
Sr. Bilingual Clinical Research Associate - Oncology (Sponsor Dedicated)

Sr. Bilingual Clinical Research Associate - Oncology (Sponsor Dedicated)

ICON Strategic Solutions • edmonton, AB, ca
Full-time
CRA you will be joining the world’s largest & most comprehensive clinical research organisation, powered by healthcare intelligence.What you will be doing : ...Show more
Last updated: 1 day ago • Promoted
Researcher-Chercheur - FPInnovations

Researcher-Chercheur - FPInnovations

FPInnovations • edmonton, ab, ca
Full-time
Colombie Britanique, Québec, Alberta.Chez FPInnovations, nous n’offrons pas seulement des emplois, nous proposons des possibilités de carrière qui vous permettront de contribuer de façon significat...Show more
Last updated: 1 day ago • Promoted
Medical Science Liaison - Brunel

Medical Science Liaison - Brunel

Brunel • edmonton, ab, ca
Full-time
Alberta, Ontario, or Quebec (Field-Based, Remote).We are currently hiring a Medical Science Liaison (MSL) who can serve as a trusted scientific expert and strategic field partner, building strong r...Show more
Last updated: 19 days ago • Promoted
Founder - Loud Solutions

Founder - Loud Solutions

Loud Solutions • edmonton, ab, ca
Full-time
Loud has partnered with a well-capitalized, highly active VC deploying capital into AI-driven businesses across large, legacy industries. What’s missing is the right person to steer the ship.We are ...Show more
Last updated: 19 days ago • Promoted
Performance Strategist - Meta Ads (1769503904-T1-CA-S1) Remote Canada

Performance Strategist - Meta Ads (1769503904-T1-CA-S1) Remote Canada

HireHawk • Edmonton, AB, CA
Remote
Full-time
Quick Apply
Job Type : Full-time, long-term contractor.Schedule : Full-time, aligned with U.Join a high-growth beauty brand as Creative Strategist, Paid Media, where you'll own and scale performance creativ...Show more
Last updated: 1 day ago
Recruitment Research Associate

Recruitment Research Associate

Executrade – Your Recruitment Specialists • Edmonton, AB, Canada
Full-time +1
Recruitment Research Associate.Executrade is a trusted leader in professional recruitment and talent advisory services, proudly serving organizations across Western Canada for over five decades.Wit...Show more
Last updated: 15 days ago • Promoted
𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗿𝗻𝘀 – 𝗡𝗲𝘂𝗿𝗼𝘀𝗰𝗶𝗲𝗻𝗰𝗲, 𝗣𝘀𝘆𝗰𝗵𝗼𝗹𝗼𝗴𝘆, 𝗖𝗼𝗴𝗻𝗶𝘁𝗶𝘃𝗲 𝗦𝗰𝗶𝗲𝗻𝗰𝗲, 𝗠𝗲𝗻𝘁𝗮𝗹 𝗛𝗲𝗮𝗹𝘁𝗵 & 𝗔𝗰𝗰𝗲𝘀𝘀𝗶𝗯𝗶𝗹𝗶𝘁𝘆 - Findora AI

𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗿𝗻𝘀 – 𝗡𝗲𝘂𝗿𝗼𝘀𝗰𝗶𝗲𝗻𝗰𝗲, 𝗣𝘀𝘆𝗰𝗵𝗼𝗹𝗼𝗴𝘆, 𝗖𝗼𝗴𝗻𝗶𝘁𝗶𝘃𝗲 𝗦𝗰𝗶𝗲𝗻𝗰𝗲, 𝗠𝗲𝗻𝘁𝗮𝗹 𝗛𝗲𝗮𝗹𝘁𝗵 & 𝗔𝗰𝗰𝗲𝘀𝘀𝗶𝗯𝗶𝗹𝗶𝘁𝘆 - Findora AI

Findora AI • edmonton, ab, ca
Full-time
We’re seeking motivated 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗿𝗻𝘀 to contribute to projects in : .Human-computer interaction and inclusive design. Neuropsychological and cognitive test development and validatio...Show more
Last updated: 2 days ago • Promoted
Canada Impact+ Research Chairs (Impact+)

Canada Impact+ Research Chairs (Impact+)

University of Alberta • Edmonton
Full-time
The University of Alberta invites applications from outstanding, internationally based researchers for the Canada Impact+ Research Chairs (Impact+) Competition — a landmark national initiative desi...Show more
Last updated: 17 days ago • Promoted
AI Learning Programs Lead

AI Learning Programs Lead

EY • Edmonton
Full-time
A global professional services firm in Edmonton is seeking a Supervising Associate to manage AI learning programs.Your role involves designing and delivering innovative learning solutions, collabor...Show more
Last updated: 12 days ago • Promoted
Research Analyst (Finance)

Research Analyst (Finance)

Ballad Consulting Group • Edmonton, AB, Canada
Full-time
The Ballad team is comprised of business professionals, strengthening the communities in which we live and work.Our diverse team brings a wealth of knowledge, experience, and creativity to each pro...Show more
Last updated: 22 days ago • Promoted
Senior Statistical Programmer

Senior Statistical Programmer

Warman O'Brien • edmonton, ab, ca
Full-time
Senior / Principal Statistical Programmer | Small CRO | Remote.We're partnered with a small CRO who are experiencing a large amount of growth within Biometrics. As a Senior Statistical Programmer, you...Show more
Last updated: 30+ days ago • Promoted
Product Research & Growth Specialist (Hybrid)

Product Research & Growth Specialist (Hybrid)

LawDepot • Edmonton
Full-time
A leading legal solutions provider in Edmonton is looking for a Product Research Specialist to enhance web products and user experience. The role involves data analysis, project execution, and colla...Show more
Last updated: 17 days ago • Promoted
Collaborative Clinical Trials Research Coordinator

Collaborative Clinical Trials Research Coordinator

Primesiteresearch • Edmonton
Full-time
A clinical research consultancy in Canada is seeking a research coordinator to manage and oversee clinical trials.Candidates should possess a degree and have 2-3 years of relevant experience.Succes...Show more
Last updated: 8 days ago • Promoted
Director – Education, Training and Research

Director – Education, Training and Research

Rupertsland University • Edmonton
Full-time +1
Director – Education, Training and Research.Location : Central Office, Edmonton, AB.Closing Date : Until Suitable Candidate Found. Position Status : Full-time (40 hours / week) Permanent.Rupertsland Inst...Show more
Last updated: 17 days ago • Promoted