Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei Technologies Canada Co., Ltd. • Edmonton, Division No. 11, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Division No. 11, CA
7 days ago
Job type
  • Temporary
Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long‑term projects, the aim is to enhance state‑of‑the‑art research while integrating innovations into the company’s products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine‑tuning toward continual, agentic self‑improvement.
  • LLM post‑training paradigms (e.g., RLHF, GRPO, reward‑free methods, etc.).
  • Agentic reinforcement learning for tool‑using and browsing‑based LLMs trained in interactive environments.
  • Agentic evaluation and benchmarking, including design of multi‑turn, verifiable reasoning tasks.
  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning‑enhanced LLMs and tool‑using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

About the ideal candidate :

  • PhD degree in Computer Science or related fields or master’s degree with comparable experience.
  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.
  • Practical or research experience in reinforcement learning, self‑supervised learning, or language model fine‑tuning.
  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.
  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.
  • Familiarity with LLM post‑training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.
  • Experience with multi‑agent RL, tool‑use / browser / coding agents, is an asset.
  • Strong communication and writing skills; enthusiasm for open research and collaborative problem‑solving.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Researcher • Edmonton, Division No. 11, CA

    Similar jobs
    Strategic Planning Analyst III - Manager, Research and Innovation

    Strategic Planning Analyst III - Manager, Research and Innovation

    City of Edmonton • Edmonton
    Full-time
    Strategic Planning Analyst III - Manager, Research and Innovation.The Edmonton Police Service (EPS) requires a highly effective leader to manage the Research and Innovation Section within the Strat...Show more
    Last updated: 7 days ago • Promoted
    Remote Senior Finance Specialist - Ai Trainer

    Remote Senior Finance Specialist - Ai Trainer

    SuperAnnotate • Fort Saskatchewan, Canada
    Remote
    Full-time
    In this hourly, remote contractor role, you will review AI-generated finance analyses and / or generate expert finance content, evaluating reasoning quality and step-by-step problem-solving while pro...Show more
    Last updated: 1 day ago • Promoted
    Community Research Coordinator (Maskwacis)

    Community Research Coordinator (Maskwacis)

    The University of Alberta • Edmonton
    Full-time
    This position is a part of the Non-Academic Staff Association (NASA).This position has a term length of 1 year plus 1 day and offers a comprehensive benefits package. This position will primarily be...Show more
    Last updated: 1 day ago • Promoted
    Research Advisor

    Research Advisor

    Cuso International • Edmonton, Alberta
    Full-time +1
    This Volunteer Placement is Located in : .Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only. The primary aim of this role is to ...Show more
    Last updated: 30+ days ago
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial.Agency • Edmonton
    Full-time
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Show more
    Last updated: 7 days ago • Promoted
    Sr. Research Associate

    Sr. Research Associate

    Gilead Sciences, Inc. • Edmonton
    Full-time +1
    At Gilead, we’re creating a healthier world for all people.For more than 35 years, we’ve tackled diseases such as HIV, viral hepatitis, COVID-19 and cancer – working relentlessly to develop therapi...Show more
    Last updated: 7 days ago • Promoted
    Senior Research, Policy, and Planning Analyst

    Senior Research, Policy, and Planning Analyst

    Government of Alberta • Edmonton
    Full-time +2
    Senior Research, Policy, and Planning Analyst.Senior Research, Policy, and Planning Analyst.Service Alberta and Red Tape Reduction. Service Alberta and Red Tape Reduction is the government’s solutio...Show more
    Last updated: 7 days ago • Promoted
    Healthcare AI Lead : NLP & Analytics

    Healthcare AI Lead : NLP & Analytics

    Canadian Professional Sales Association • Edmonton
    Full-time
    A national professional association is looking for an Artificial Intelligence Specialist to lead AI initiatives and develop models in healthcare. The role requires a Bachelor's degree in health scie...Show more
    Last updated: 6 days ago • Promoted
    Canada Impact+ Research Chairs (Impact+)

    Canada Impact+ Research Chairs (Impact+)

    University of Alberta • Edmonton
    Full-time
    The University of Alberta invites applications from outstanding, internationally based researchers for the Canada Impact+ Research Chairs (Impact+) Competition — a landmark national initiative desi...Show more
    Last updated: 1 day ago • Promoted
    Computational Research Expert (Optimization and Control)

    Computational Research Expert (Optimization and Control)

    Aramco • Edmonton (West Clareview / East Londonderry), ca
    Full-time
    Aramco energizes the world economy.Aramco occupies a special position in the global energy industry.We are one of the world's largest producers of hydrocarbon energy and chemicals, with among the l...Show more
    Last updated: 3 hours ago • Promoted • New!
    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Ccwestt • Edmonton
    Full-time
    A leading university in Canada is seeking an Assistant or Associate Professor in Robotics and AI.This full-time tenure-track position involves teaching, conducting research, and engaging in service...Show more
    Last updated: 7 days ago • Promoted
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Alberta Machine Intelligence Institute • Edmonton
    Full-time
    If you are interested in leveraging Generative AI for Computer Vision in visual effects, film industry, and gaming, this is the right opportunity for you. Be a part of a team of research and machine...Show more
    Last updated: 16 hours ago • Promoted • New!
    Paid ML Energy Forecasting Fellow - Mentored Research

    Paid ML Energy Forecasting Fellow - Mentored Research

    Alberta Machine Intelligence Institute • Edmonton
    Full-time
    A leading AI research institute in Canada is seeking a Machine Learning Resident to focus on energy consumption modeling. The role involves designing, implementing, and evaluating machine learning m...Show more
    Last updated: 7 days ago • Promoted
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Amii (Alberta Machine Intelligence Institute) • Edmonton
    Full-time
    Machine Learning Resident - Client : Outsyders.Machine Learning Resident - Client : Outsyders.Amii (Alberta Machine Intelligence Institute). If you are interested in leveraging Generative AI for Compu...Show more
    Last updated: 16 hours ago • Promoted • New!
    Managing Director / Senior Consultant - Aligned Labs

    Managing Director / Senior Consultant - Aligned Labs

    Aligned Labs • edmonton, ab, ca
    Part-time
    We are looking to expand our network of.AI models struggle with, as well as.Visit our website to learn more : .At Aligned, we partner with the world's leading AI labs to push the frontier of AI knowl...Show more
    Last updated: 4 days ago • Promoted
    AI Governance Lead — Ethical, Regulated Innovation

    AI Governance Lead — Ethical, Regulated Innovation

    Alberta Blue Cross • Edmonton
    Full-time
    A healthcare coverage provider in Canada is seeking a Manager, AI Governance to lead the development of AI governance practices. This strategic role ensures responsible AI use and compliance with re...Show more
    Last updated: 7 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Just Eat Takeaway.com • Edmonton
    Full-time
    We’re a leading global online food delivery platform, and our vision is to empower everyday convenience.Whether it’s a Friday-night feast, a post-gym poke bowl, or grabbing some groceries, our tech...Show more
    Last updated: 16 days ago • Promoted
    Learning Design Specialist (2986)

    Learning Design Specialist (2986)

    NAIT (Northern Alberta Institute of Technology) • Edmonton
    Full-time +1
    Learning Design Specialist (2986) – NAIT.Temporary position ending on or before March 31, 2027, with the possibility of extension. Under the direction of the Manager of Learning Experience Design, y...Show more
    Last updated: 7 days ago • Promoted