Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei • Edmonton, Division No. 11, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei • Edmonton, Division No. 11, CA
Il y a plus de 30 jours
Type de contrat
  • Temporaire
Description de poste

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long‑term projects, the aim is to enhance state‑of‑the‑art research while integrating innovations into the company’s products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine‑tuning toward continual, agentic self‑improvement.
  • LLM post‑training paradigms (e.g., RLHF, GRPO, reward‑free methods, etc.).
  • Agentic reinforcement learning for tool‑using and browsing‑based LLMs trained in interactive environments.
  • Agentic evaluation and benchmarking, including design of multi‑turn, verifiable reasoning tasks.
  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning‑enhanced LLMs and tool‑using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

About the ideal candidate :

  • PhD degree in Computer Science or related fields or master’s degree with comparable experience.
  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.
  • Practical or research experience in reinforcement learning, self‑supervised learning, or language model fine‑tuning.
  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.
  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.
  • Familiarity with LLM post‑training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.
  • Experience with multi‑agent RL, tool‑use / browser / coding agents, is an asset.
  • Strong communication and writing skills; enthusiasm for open research and collaborative problem‑solving.
  • #J-18808-Ljbffr

    Créer une alerte emploi pour cette recherche

    Researcher • Edmonton, Division No. 11, CA

    Offres similaires
    Market Research Analyst - Insight Global

    Market Research Analyst - Insight Global

    Insight Global • edmonton, ab, ca
    Temporaire
    Position : Market Research Analyst.Location : Calgary, Alberta (preferred).Work type : Hybrid, 2 Days / week onsite if in Calgary. Can also be remote across Canada.Targeted Start date : January 12th, 2026...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Strategic Planning Analyst III - Manager, Research and Innovation

    Strategic Planning Analyst III - Manager, Research and Innovation

    City of Edmonton • Edmonton
    Temps plein
    Strategic Planning Analyst III - Manager, Research and Innovation.The Edmonton Police Service (EPS) requires a highly effective leader to manage the Research and Innovation Section within the Strat...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    Research Ethics Associate

    Research Ethics Associate

    Alberta Innovates • Edmonton
    Temps plein +1
    Location : • • Edmonton Research Park • • •Posted : • • December 18, 2025 • • •Competition # : • • 4020We are seeking a Research Ethics Associate who will be responsible for processing research applications for...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    Trigonometry Private Tutoring Jobs Beaumont (Alberta)

    Trigonometry Private Tutoring Jobs Beaumont (Alberta)

    Superprof • Beaumont (Alberta), Canada
    Temps plein +1
    Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Machine Learning Resident - Client : T.rex AI (1 year term)

    Machine Learning Resident - Client : T.rex AI (1 year term)

    Alberta Machine Intelligence Institute • Edmonton, AB, Canada
    Temps plein
    If you are interested in the application of artificial intelligence (AI) and machine learning (ML) methods for Energy systems optimization, Distributed Energy Resources, and Multi-Agent RL, this is...Voir plus
    Dernière mise à jour : il y a 2 jours • Offre sponsorisée
    Market Research Manager

    Market Research Manager

    Kynetec • Edmonton, Alberta, Canada
    Temps plein
    Kynetec is the global leader in agricultural and animal health market insights.We have a long history of market research expertise, specialising in animal health and nutrition, crop protection, far...Voir plus
    Dernière mise à jour : il y a 13 heures • Offre sponsorisée • Nouvelle offre
    Research Associate

    Research Associate

    University of Alberta • Edmonton
    Temps plein
    Be among the first 25 applicants.This competition is open to all applicants however; internal candidates and applicants who were former employees of the University of Alberta will be given priority...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial.Agency • Edmonton
    Temps plein
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    (JRFP)Fellow- Jr. Researcher (Code : EU6458)

    (JRFP)Fellow- Jr. Researcher (Code : EU6458)

    European Institute of Policy Research and Human Rights SIA • Edmonton, AB, Canada
    Télétravail
    Temps plein
    Quick Apply
    European Institute of Policy Research and Human Rights.Our mission is to deliver world-class skill enhancing programs to candidates globally, equipping them with the knowledge and skills to influen...Voir plus
    Dernière mise à jour : il y a 13 heures • Nouvelle offre
    Senior Research, Policy, and Planning Analyst

    Senior Research, Policy, and Planning Analyst

    Government of Alberta • Edmonton
    Temps plein +2
    Senior Research, Policy, and Planning Analyst.Senior Research, Policy, and Planning Analyst.Service Alberta and Red Tape Reduction. Service Alberta and Red Tape Reduction is the government’s solutio...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial Agency • Edmonton
    Temps plein
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Research Advisor

    Research Advisor

    Cuso International • Edmonton, Alberta
    Permanent
    This Volunteer Placement is Located in : .Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only. The primary aim of this role is to ...Voir plus
    Dernière mise à jour : il y a plus de 30 jours
    Machine Learning Engineer

    Machine Learning Engineer

    Hifyre • Edmonton, Alberta, Canada
    Temps plein
    Hifyre provides market intelligence for the cannabis industry, analyzing retail data to help.Our models power product recommendations, sales forecasting, and market analysis for both internal opera...Voir plus
    Dernière mise à jour : il y a 3 jours • Offre sponsorisée
    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Ccwestt • Edmonton
    Temps plein
    A leading university in Canada is seeking an Assistant or Associate Professor in Robotics and AI.This full-time tenure-track position involves teaching, conducting research, and engaging in service...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Alberta Machine Intelligence Institute • Edmonton
    Temps plein
    If you are interested in leveraging Generative AI for Computer Vision in visual effects, film industry, and gaming, this is the right opportunity for you. Be a part of a team of research and machine...Voir plus
    Dernière mise à jour : il y a 4 jours • Offre sponsorisée
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Amii (Alberta Machine Intelligence Institute) • Edmonton
    Temps plein
    Machine Learning Resident - Client : Outsyders.Machine Learning Resident - Client : Outsyders.Amii (Alberta Machine Intelligence Institute). If you are interested in leveraging Generative AI for Compu...Voir plus
    Dernière mise à jour : il y a 4 jours • Offre sponsorisée
    AI Governance Lead — Ethical, Regulated Innovation

    AI Governance Lead — Ethical, Regulated Innovation

    Alberta Blue Cross • Edmonton
    Temps plein
    A healthcare coverage provider in Canada is seeking a Manager, AI Governance to lead the development of AI governance practices. This strategic role ensures responsible AI use and compliance with re...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    Market Research Analyst

    Market Research Analyst

    Insight Global • edmonton, ab, ca
    Temporaire
    Position : Market Research Analyst.Location : Calgary, Alberta (preferred).Work type : Hybrid, 2 Days / week onsite if in Calgary. Can also be remote across Canada.Targeted Start date : January 12th, 2026...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée