Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
Il y a plus de 30 jours
Type de contrat
  • Temporaire
Description de poste

Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.


About the team:

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job:

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.

  • LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.).

  • Agentic reinforcement learning for tool-using and browsing-based LLMs trained in interactive environments.

  • Agentic evaluation and benchmarking, including design of multi-turn, verifiable reasoning tasks.

  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning-enhanced LLMs and tool-using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.


Job requirements

About the ideal candidate:

  • PhD degree in Computer Science or related fields or master's degree with comparable experience.

  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.

  • Practical or research experience in reinforcement learning, self-supervised learning, or language model fine-tuning.

  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.

  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.

  • Familiarity with LLM post-training pipelines (RLHF, GRPO/PPO, SFT, LoRA, MoE, etc.) is an asset.

  • Experience with multi-agent RL, tool-use / browser/coding agents, is an asset.

  • Strong communication and writing skills; enthusiasm for open research and collaborative problem-solving.

Huawei aims to support a French-speaking work environment for its employees in Quebec. We have taken steps to avoid requiring a language other than French for this position. However, proficiency in English is essential for this role for the following reasons:

The person will be required to communicate regularly with colleagues located outside Quebec, where English is the primary language used for communication between offices. In addition, the nature of the tasks related to this position, which falls within a highly specialized field of artificial intelligence, also requires knowledge of English.

Créer une alerte emploi pour cette recherche

Researcher - Reinforcement Learning • Edmonton, Alberta, CA

Offres similaires
Market Research Insights Manager - Qualitative

Market Research Insights Manager - Qualitative

Kynetec • edmonton, ab, ca
Temps plein
Kynetec is the global leader in agricultural and animal health market insights.We have a long history of market research expertise, specialising in animal health and nutrition, crop protection, far...Voir plus
Dernière mise à jour : il y a 8 jours • Offre sponsorisée
Research Assistant

Research Assistant

ASBB Economics and Research • edmonton, ab, ca
Temps plein +1
ASBB Economics and Research Ltd is a social and economic research advisory dedicated to driving impactful public policy discussions.Founded by Mani, a seasoned economist with global experience, the...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Research Assistant - ASBB Economics and Research

Research Assistant - ASBB Economics and Research

ASBB Economics and Research • edmonton, ab, ca
Temps plein +1
ASBB Economics and Research Ltd is a social and economic research advisory dedicated to driving impactful public policy discussions.Founded by Mani, a seasoned economist with global experience, the...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
AI/ML Engineer - Rivago Infotech Inc

AI/ML Engineer - Rivago Infotech Inc

Rivago Infotech Inc • edmonton, ab, ca
Temps plein
Responsible for designing, building, and deploying machine learning models and AI-driven systems within the Google Cloud ecosystem.This role bridges data science and software engineering, focusing ...Voir plus
Dernière mise à jour : il y a 2 jours • Offre sponsorisée
Survey Taker: Earn up to $25 per survey (Remote)

Survey Taker: Earn up to $25 per survey (Remote)

Earn Haus • Beaumont, AB, CA
Télétravail
Temps plein +1
Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Online Survey Participant: Work Remote and Earn Up To $25 Per Survey

Online Survey Participant: Work Remote and Earn Up To $25 Per Survey

Earn Haus • Beaumont, AB, CA
Télétravail
Temps plein +1
Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Remote Physics Researcher (PhD) - edmonton

Remote Physics Researcher (PhD) - edmonton

Turing • edmonton, ab, ca
Télétravail
Temps plein
Remote contract for PhDs in Physics, Applied Physics, or related fields.Work on cutting-edge projects with top AI labs while earning $50+/hour, fully remote, with flexible weekly hours.Help fine-tu...Voir plus
Dernière mise à jour : il y a 6 jours • Offre sponsorisée
Complete Online Surveys For Cash (Up to $25/per)

Complete Online Surveys For Cash (Up to $25/per)

Earn Haus • Beaumont, AB, CA
Temps plein +1
Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Division No. 11, CA
Temporaire
Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable ...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Organizational Wellbeing Advisor (E-Volunteer) - Spanish Required

Organizational Wellbeing Advisor (E-Volunteer) - Spanish Required

Cuso International • Beaumont, Alberta
Permanent
Online placement (E-Volunteer).Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only.Support the journey to becoming a caring org...Voir plus
Dernière mise à jour : il y a 2 jours • Offre sponsorisée
Market Research Insights Manager - Qualitative - Kynetec

Market Research Insights Manager - Qualitative - Kynetec

Kynetec • edmonton, ab, ca
Temps plein
Kynetec is the global leader in agricultural and animal health market insights.We have a long history of market research expertise, specialising in animal health and nutrition, crop protection, far...Voir plus
Dernière mise à jour : il y a 8 jours • Offre sponsorisée
Remote Chemistry Researcher (PhD) - Turing

Remote Chemistry Researcher (PhD) - Turing

Turing • edmonton, ab, ca
Télétravail
Temps plein
Remote contract for PhDs in Chemistry, Chemical Engineering, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+/hour, fully remote, with flexible weekly hours...Voir plus
Dernière mise à jour : il y a 6 jours • Offre sponsorisée
RL Researcher: LLMs & Agentic AI (12-Month)

RL Researcher: LLMs & Agentic AI (12-Month)

Huawei Technologies Canada Co., Ltd. • Edmonton, Division No. 11, CA
Temporaire
A leading technology firm in Canada seeks a Reinforcement Learning Researcher to advance research in artificial intelligence.The ideal candidate will hold a PhD in Computer Science or a related fie...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Equity Research Mentor - edmonton

Equity Research Mentor - edmonton

Wall Street Oasis • edmonton, ab, ca
Temps plein
Click the following link to submit your application today:.Wall Street Oasis (WSO) | Mentorship Program.Mentors | 1+ Million Students | Global Reach.K in a single week (if you qualify to become a h...Voir plus
Dernière mise à jour : il y a 13 heures • Offre sponsorisée • Nouvelle offre
Data Architecture & Governance Advisor - Spanish Required

Data Architecture & Governance Advisor - Spanish Required

Cuso International • Beaumont, Alberta
Permanent
This Volunteer Placement is Located in:.Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only.This volunteer placement offers a u...Voir plus
Dernière mise à jour : il y a 2 jours • Offre sponsorisée
Strategic Partnership Advisor - Spanish Required

Strategic Partnership Advisor - Spanish Required

Cuso International • Beaumont, Alberta
Permanent
This Volunteer Placement is Located in:.Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only.Cuso International is seeking two v...Voir plus
Dernière mise à jour : il y a 2 jours • Offre sponsorisée
Equity Research Mentor

Equity Research Mentor

Wall Street Oasis • edmonton, ab, ca
Temps plein
Click the following link to submit your application today:.Wall Street Oasis (WSO) | Mentorship Program.Mentors | 1+ Million Students | Global Reach.K in a single week (if you qualify to become a h...Voir plus
Dernière mise à jour : il y a 13 heures • Offre sponsorisée • Nouvelle offre
Study Participant - Prolific

Study Participant - Prolific

Prolific • edmonton, ab, ca
Temps plein
Prolific is not just another research platform – we are building the biggest pool of quality human research data in the world.Over 35,000 researchers, educators, and organizations use Prolific to r...Voir plus
Dernière mise à jour : il y a 22 jours • Offre sponsorisée