Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei • Edmonton, Division No. 11, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei • Edmonton, Division No. 11, CA
30+ days ago
Job type
  • Temporary
Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long‑term projects, the aim is to enhance state‑of‑the‑art research while integrating innovations into the company’s products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine‑tuning toward continual, agentic self‑improvement.
  • LLM post‑training paradigms (e.g., RLHF, GRPO, reward‑free methods, etc.).
  • Agentic reinforcement learning for tool‑using and browsing‑based LLMs trained in interactive environments.
  • Agentic evaluation and benchmarking, including design of multi‑turn, verifiable reasoning tasks.
  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning‑enhanced LLMs and tool‑using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

About the ideal candidate :

  • PhD degree in Computer Science or related fields or master’s degree with comparable experience.
  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.
  • Practical or research experience in reinforcement learning, self‑supervised learning, or language model fine‑tuning.
  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.
  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.
  • Familiarity with LLM post‑training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.
  • Experience with multi‑agent RL, tool‑use / browser / coding agents, is an asset.
  • Strong communication and writing skills; enthusiasm for open research and collaborative problem‑solving.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Researcher • Edmonton, Division No. 11, CA

    Similar jobs
    Strategic Planning Analyst III - Manager, Research and Innovation

    Strategic Planning Analyst III - Manager, Research and Innovation

    City of Edmonton • Edmonton
    Full-time
    Strategic Planning Analyst III - Manager, Research and Innovation.The Edmonton Police Service (EPS) requires a highly effective leader to manage the Research and Innovation Section within the Strat...Show more
    Last updated: 13 days ago • Promoted
    Research Ethics Associate

    Research Ethics Associate

    Alberta Innovates • Edmonton
    Full-time +1
    Location : • • Edmonton Research Park • • •Posted : • • December 18, 2025 • • •Competition # : • • 4020We are seeking a Research Ethics Associate who will be responsible for processing research applications for...Show more
    Last updated: 13 days ago • Promoted
    Machine Learning Resident - Client : T.rex AI (1 year term)

    Machine Learning Resident - Client : T.rex AI (1 year term)

    Alberta Machine Intelligence Institute • Edmonton, AB, Canada
    Full-time
    If you are interested in the application of artificial intelligence (AI) and machine learning (ML) methods for Energy systems optimization, Distributed Energy Resources, and Multi-Agent RL, this is...Show more
    Last updated: 4 days ago • Promoted
    (SRFP)-Research Fellow Sr. (Code : EU7260)

    (SRFP)-Research Fellow Sr. (Code : EU7260)

    European Institute of Policy Research and Human Rights SIA • Edmonton, AB, Canada
    Full-time
    European Institute of Policy Research and Human Rights.Our mission is to deliver world-class skill enhancing programs to candidates globally, equipping them with the knowledge and skills to influen...Show more
    Last updated: 3 hours ago • Promoted • New!
    Research Advisor

    Research Advisor

    Cuso International • Edmonton, Alberta
    Full-time +1
    This Volunteer Placement is Located in : .Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only. The primary aim of this role is to ...Show more
    Last updated: 30+ days ago
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial.Agency • Edmonton
    Full-time
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Show more
    Last updated: 13 days ago • Promoted
    Sr. Research Associate

    Sr. Research Associate

    Gilead Sciences, Inc. • Edmonton
    Full-time +1
    At Gilead, we’re creating a healthier world for all people.For more than 35 years, we’ve tackled diseases such as HIV, viral hepatitis, COVID-19 and cancer – working relentlessly to develop therapi...Show more
    Last updated: 13 days ago • Promoted
    (JRFP)Fellow- Jr. Researcher (Code : EU6458)

    (JRFP)Fellow- Jr. Researcher (Code : EU6458)

    European Institute of Policy Research and Human Rights SIA • Edmonton, AB, Canada
    Full-time
    European Institute of Policy Research and Human Rights.Our mission is to deliver world-class skill enhancing programs to candidates globally, equipping them with the knowledge and skills to influen...Show more
    Last updated: 3 hours ago • Promoted • New!
    Senior Research, Policy, and Planning Analyst

    Senior Research, Policy, and Planning Analyst

    Government of Alberta • Edmonton
    Full-time +2
    Senior Research, Policy, and Planning Analyst.Senior Research, Policy, and Planning Analyst.Service Alberta and Red Tape Reduction. Service Alberta and Red Tape Reduction is the government’s solutio...Show more
    Last updated: 13 days ago • Promoted
    Machine Learning Resident - Client : Thrive Career Wellness (1 year term)

    Machine Learning Resident - Client : Thrive Career Wellness (1 year term)

    Alberta Machine Intelligence Institute • Edmonton, AB, Canada
    Full-time
    Come work with us to explore the application of sequential decision-making algorithms to address a critical challenge in career mobility. If you are an RL / ML researcher or engineer looking to apply ...Show more
    Last updated: 30+ days ago • Promoted
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial Agency • Edmonton
    Full-time
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Scientist – RL & Applied AI Leader

    Machine Learning Scientist – RL & Applied AI Leader

    Alberta Machine Intelligence Institute • Edmonton
    Full-time
    A leading AI research institute in Canada is looking for a Machine Learning Scientist to deliver innovative solutions and lead projects involving Reinforcement Learning. The successful candidate wil...Show more
    Last updated: 5 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Hifyre • Edmonton, Alberta, Canada
    Full-time
    Hifyre provides market intelligence for the cannabis industry, analyzing retail data to help.Our models power product recommendations, sales forecasting, and market analysis for both internal opera...Show more
    Last updated: 5 days ago • Promoted
    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Ccwestt • Edmonton
    Full-time
    A leading university in Canada is seeking an Assistant or Associate Professor in Robotics and AI.This full-time tenure-track position involves teaching, conducting research, and engaging in service...Show more
    Last updated: 13 days ago • Promoted
    Freelance Reviewer - TransPerfect

    Freelance Reviewer - TransPerfect

    TransPerfect • edmonton, ab, ca
    Full-time
    TransPerfect is currently looking for qualified freelance linguists (reviewers) with Technical, Construction, Engineering, Manufacturing expertise interested in long-term collaboration and willing ...Show more
    Last updated: 1 day ago • Promoted
    Pediatric Oncology CNS : Team Lead & Research Leader

    Pediatric Oncology CNS : Team Lead & Research Leader

    Alberta Health Services • Edmonton
    Full-time
    A healthcare service organization in Canada is seeking a Team Lead CNS specialized in Pediatric Oncology.The role involves leading a multidisciplinary team, managing clinical guidelines, and foster...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Amii (Alberta Machine Intelligence Institute) • Edmonton
    Full-time
    Machine Learning Resident - Client : Outsyders.Machine Learning Resident - Client : Outsyders.Amii (Alberta Machine Intelligence Institute). If you are interested in leveraging Generative AI for Compu...Show more
    Last updated: 6 days ago • Promoted
    Computational Research Expert (Material Discovery)

    Computational Research Expert (Material Discovery)

    Aramco • Edmonton
    Full-time
    Aramco energizes the world economy.Aramco occupies a special position in the global energy industry.We are one of the world's largest producers of hydrocarbon energy and chemicals, with among the l...Show more
    Last updated: 11 days ago • Promoted