Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Alberta, CA
Il y a plus de 30 jours
Type de contrat
  • Temporaire
Description de poste

Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.

LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.).

Agentic reinforcement learning for tool-using and browsing-based LLMs trained in interactive environments.

Agentic evaluation and benchmarking, including design of multi-turn, verifiable reasoning tasks.

Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning-enhanced LLMs and tool-using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

Job requirements

About the ideal candidate :

PhD degree in Computer Science or related fields or master's degree with comparable experience.

Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.

Practical or research experience in reinforcement learning, self-supervised learning, or language model fine-tuning.

Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.

Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.

Familiarity with LLM post-training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.

Experience with multi-agent RL, tool-use / browser / coding agents, is an asset.

Strong communication and writing skills; enthusiasm for open research and collaborative problem-solving.

Huawei aims to support a French-speaking work environment for its employees in Quebec. We have taken steps to avoid requiring a language other than French for this position. However, proficiency in English is essential for this role for the following reasons :

The person will be required to communicate regularly with colleagues located outside Quebec, where English is the primary language used for communication between offices. In addition, the nature of the tasks related to this position, which falls within a highly specialized field of artificial intelligence, also requires knowledge of English.

Créer une alerte emploi pour cette recherche

Researcher Reinforcement Learning • Edmonton, Alberta, CA

Offres similaires
Product Research Specialist

Product Research Specialist

LawDepot LLC. • Edmonton
Temps plein +1
Join one of the fastest growing companies in Canada! LawDepot is proud to be a seven-time Growth 500 ranked organization and a major player in the Global legal solutions industry.Our mission is to ...Voir plus
Dernière mise à jour : il y a 14 jours • Offre sponsorisée
Director – Education, Training and Research

Director – Education, Training and Research

Albertametis • Edmonton
Temps plein +1
Director – Education, Training and Research.Location : Central Office, Edmonton, AB.Closing Date : Until Suitable Candidate Found. Position Status : Full-time (40 hours / week) Permanent.Rupertsland Inst...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée
AI Learning Supervising Associate (18-month contract)

AI Learning Supervising Associate (18-month contract)

EY • Edmonton
Temps plein +1
At EY, we’re all in to shape your future with confidence.We’ll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
Bilingual Talent Sourcer / Researcher (6-month contract) - edmonton

Bilingual Talent Sourcer / Researcher (6-month contract) - edmonton

ML6 Search + Talent Advisory • edmonton, ab, ca
Temps plein
ML6 Search + Talent Advisory s’est associé à une organisation de soins de santé multisite en forte croissance dans le cadre d’un mandat confidentiel pour. Sourcer / Chercheur(se) de talents bilingue ...Voir plus
Dernière mise à jour : il y a 1 jour • Offre sponsorisée
Researcher-Chercheur - FPInnovations

Researcher-Chercheur - FPInnovations

FPInnovations • edmonton, ab, ca
Temps plein
Colombie Britanique, Québec, Alberta.Chez FPInnovations, nous n’offrons pas seulement des emplois, nous proposons des possibilités de carrière qui vous permettront de contribuer de façon significat...Voir plus
Dernière mise à jour : il y a 1 jour • Offre sponsorisée
Medical Science Liaison - Brunel

Medical Science Liaison - Brunel

Brunel • edmonton, ab, ca
Temps plein
Alberta, Ontario, or Quebec (Field-Based, Remote).We are currently hiring a Medical Science Liaison (MSL) who can serve as a trusted scientific expert and strategic field partner, building strong r...Voir plus
Dernière mise à jour : il y a 19 jours • Offre sponsorisée
Founder - Loud Solutions

Founder - Loud Solutions

Loud Solutions • edmonton, ab, ca
Temps plein
Loud has partnered with a well-capitalized, highly active VC deploying capital into AI-driven businesses across large, legacy industries. What’s missing is the right person to steer the ship.We are ...Voir plus
Dernière mise à jour : il y a 19 jours • Offre sponsorisée
Performance Strategist - Meta Ads (1769503904-T1-CA-S1) Remote Canada

Performance Strategist - Meta Ads (1769503904-T1-CA-S1) Remote Canada

HireHawk • Edmonton, AB, CA
Télétravail
Temps plein
Quick Apply
Job Type : Full-time, long-term contractor.Schedule : Full-time, aligned with U.Join a high-growth beauty brand as Creative Strategist, Paid Media, where you'll own and scale performance creativ...Voir plus
Dernière mise à jour : il y a 1 jour
Recruitment Research Associate

Recruitment Research Associate

Executrade – Your Recruitment Specialists • Edmonton, AB, Canada
Temps plein +1
Recruitment Research Associate.Executrade is a trusted leader in professional recruitment and talent advisory services, proudly serving organizations across Western Canada for over five decades.Wit...Voir plus
Dernière mise à jour : il y a 14 jours • Offre sponsorisée
Remote R Engineer - AI Trainer

Remote R Engineer - AI Trainer

SuperAnnotate • Fort Saskatchewan, Alberta, CA
Télétravail
Temps plein
As a remote, hourly paid R Engineer, you will review AI-generated responses and generate high-quality R and data-analysis-focused content, evaluating the reasoning quality and step-by-step problem-...Voir plus
Dernière mise à jour : il y a plus de 30 jours
𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗿𝗻𝘀 – 𝗡𝗲𝘂𝗿𝗼𝘀𝗰𝗶𝗲𝗻𝗰𝗲, 𝗣𝘀𝘆𝗰𝗵𝗼𝗹𝗼𝗴𝘆, 𝗖𝗼𝗴𝗻𝗶𝘁𝗶𝘃𝗲 𝗦𝗰𝗶𝗲𝗻𝗰𝗲, 𝗠𝗲𝗻𝘁𝗮𝗹 𝗛𝗲𝗮𝗹𝘁𝗵 & 𝗔𝗰𝗰𝗲𝘀𝘀𝗶𝗯𝗶𝗹𝗶𝘁𝘆 - Findora AI

𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗿𝗻𝘀 – 𝗡𝗲𝘂𝗿𝗼𝘀𝗰𝗶𝗲𝗻𝗰𝗲, 𝗣𝘀𝘆𝗰𝗵𝗼𝗹𝗼𝗴𝘆, 𝗖𝗼𝗴𝗻𝗶𝘁𝗶𝘃𝗲 𝗦𝗰𝗶𝗲𝗻𝗰𝗲, 𝗠𝗲𝗻𝘁𝗮𝗹 𝗛𝗲𝗮𝗹𝘁𝗵 & 𝗔𝗰𝗰𝗲𝘀𝘀𝗶𝗯𝗶𝗹𝗶𝘁𝘆 - Findora AI

Findora AI • edmonton, ab, ca
Temps plein
We’re seeking motivated 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗜𝗻𝘁𝗲𝗿𝗻𝘀 to contribute to projects in : .Human-computer interaction and inclusive design. Neuropsychological and cognitive test development and validatio...Voir plus
Dernière mise à jour : il y a 2 jours • Offre sponsorisée
Canada Impact+ Research Chairs (Impact+)

Canada Impact+ Research Chairs (Impact+)

University of Alberta • Edmonton
Temps plein
The University of Alberta invites applications from outstanding, internationally based researchers for the Canada Impact+ Research Chairs (Impact+) Competition — a landmark national initiative desi...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée
AI Learning Programs Lead

AI Learning Programs Lead

EY • Edmonton
Temps plein
A global professional services firm in Edmonton is seeking a Supervising Associate to manage AI learning programs.Your role involves designing and delivering innovative learning solutions, collabor...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
Research Analyst (Finance)

Research Analyst (Finance)

Ballad Consulting Group • Edmonton, AB, Canada
Temps plein
The Ballad team is comprised of business professionals, strengthening the communities in which we live and work.Our diverse team brings a wealth of knowledge, experience, and creativity to each pro...Voir plus
Dernière mise à jour : il y a 22 jours • Offre sponsorisée
Senior Statistical Programmer

Senior Statistical Programmer

Warman O'Brien • edmonton, ab, ca
Temps plein
Senior / Principal Statistical Programmer | Small CRO | Remote.We're partnered with a small CRO who are experiencing a large amount of growth within Biometrics. As a Senior Statistical Programmer, you...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Product Research & Growth Specialist (Hybrid)

Product Research & Growth Specialist (Hybrid)

LawDepot • Edmonton
Temps plein
A leading legal solutions provider in Edmonton is looking for a Product Research Specialist to enhance web products and user experience. The role involves data analysis, project execution, and colla...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée
Collaborative Clinical Trials Research Coordinator

Collaborative Clinical Trials Research Coordinator

Primesiteresearch • Edmonton
Temps plein
A clinical research consultancy in Canada is seeking a research coordinator to manage and oversee clinical trials.Candidates should possess a degree and have 2-3 years of relevant experience.Succes...Voir plus
Dernière mise à jour : il y a 8 jours • Offre sponsorisée
Director – Education, Training and Research

Director – Education, Training and Research

Rupertsland University • Edmonton
Temps plein +1
Director – Education, Training and Research.Location : Central Office, Edmonton, AB.Closing Date : Until Suitable Candidate Found. Position Status : Full-time (40 hours / week) Permanent.Rupertsland Inst...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée