Talent.com
Researcher - Reinforcement Learning
Researcher - Reinforcement LearningHuawei • Edmonton, Division No. 11, CA
Researcher - Reinforcement Learning

Researcher - Reinforcement Learning

Huawei • Edmonton, Division No. 11, CA
30+ days ago
Job type
  • Temporary
Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long‑term projects, the aim is to enhance state‑of‑the‑art research while integrating innovations into the company’s products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine‑tuning toward continual, agentic self‑improvement.
  • LLM post‑training paradigms (e.g., RLHF, GRPO, reward‑free methods, etc.).
  • Agentic reinforcement learning for tool‑using and browsing‑based LLMs trained in interactive environments.
  • Agentic evaluation and benchmarking, including design of multi‑turn, verifiable reasoning tasks.
  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning‑enhanced LLMs and tool‑using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

About the ideal candidate :

  • PhD degree in Computer Science or related fields or master’s degree with comparable experience.
  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.
  • Practical or research experience in reinforcement learning, self‑supervised learning, or language model fine‑tuning.
  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.
  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.
  • Familiarity with LLM post‑training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.
  • Experience with multi‑agent RL, tool‑use / browser / coding agents, is an asset.
  • Strong communication and writing skills; enthusiasm for open research and collaborative problem‑solving.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Researcher • Edmonton, Division No. 11, CA

    Similar jobs
    Strategic Planning Analyst III - Manager, Research and Innovation

    Strategic Planning Analyst III - Manager, Research and Innovation

    City of Edmonton • Edmonton
    Full-time
    Strategic Planning Analyst III - Manager, Research and Innovation.The Edmonton Police Service (EPS) requires a highly effective leader to manage the Research and Innovation Section within the Strat...Show more
    Last updated: 10 days ago • Promoted
    Research Ethics Associate

    Research Ethics Associate

    Alberta Innovates • Edmonton
    Full-time +1
    Location : • • Edmonton Research Park • • •Posted : • • December 18, 2025 • • •Competition # : • • 4020We are seeking a Research Ethics Associate who will be responsible for processing research applications for...Show more
    Last updated: 10 days ago • Promoted
    Trigonometry Private Tutoring Jobs Beaumont (Alberta)

    Trigonometry Private Tutoring Jobs Beaumont (Alberta)

    Superprof • Beaumont (Alberta), Canada
    Full-time +1
    Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Show more
    Last updated: 30+ days ago • Promoted
    Recreation Coordinator

    Recreation Coordinator

    Canadian Forces Morale and Welfare Services - CFMWS • Edmonton, AB, Canada
    Full-time
    CFMWS - WHERE PURPOSE MEETS PASSION!.At Canadian Forces Morale and Welfare Services (CFMWS), we’re more than just a workplace. we’re a proud community dedicated to supporting Canadian Armed Forces ...Show more
    Last updated: 7 days ago • Promoted
    Remote Online Tasks – Earn Up to $1,500 / month Playing Games

    Remote Online Tasks – Earn Up to $1,500 / month Playing Games

    Free Cash by Almedia • Bon Accord
    Remote
    Full-time
    Receive a $5 welcome bonus when you complete your first offer!Make money in your spare time by completing online tasks : . Take paid surveys (5-15 minutes each).Test new mobile games and apps.Share yo...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Resident - Client : T.rex AI (1 year term)

    Machine Learning Resident - Client : T.rex AI (1 year term)

    Alberta Machine Intelligence Institute • Edmonton, AB, Canada
    Full-time
    If you are interested in the application of artificial intelligence (AI) and machine learning (ML) methods for Energy systems optimization, Distributed Energy Resources, and Multi-Agent RL, this is...Show more
    Last updated: 1 day ago • Promoted
    Research Associate

    Research Associate

    University of Alberta • Edmonton
    Full-time
    Be among the first 25 applicants.This competition is open to all applicants however; internal candidates and applicants who were former employees of the University of Alberta will be given priority...Show more
    Last updated: 8 hours ago • Promoted • New!
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial.Agency • Edmonton
    Full-time
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Show more
    Last updated: 10 days ago • Promoted
    Senior Research, Policy, and Planning Analyst

    Senior Research, Policy, and Planning Analyst

    Government of Alberta • Edmonton
    Full-time +2
    Senior Research, Policy, and Planning Analyst.Senior Research, Policy, and Planning Analyst.Service Alberta and Red Tape Reduction. Service Alberta and Red Tape Reduction is the government’s solutio...Show more
    Last updated: 10 days ago • Promoted
    Healthcare AI Lead : NLP & Analytics

    Healthcare AI Lead : NLP & Analytics

    Canadian Professional Sales Association • Edmonton
    Full-time
    A national professional association is looking for an Artificial Intelligence Specialist to lead AI initiatives and develop models in healthcare. The role requires a Bachelor's degree in health scie...Show more
    Last updated: 9 days ago • Promoted
    Machine Learning Resident - Client : Thrive Career Wellness (1 year term)

    Machine Learning Resident - Client : Thrive Career Wellness (1 year term)

    Alberta Machine Intelligence Institute • Edmonton, AB, Canada
    Full-time
    Come work with us to explore the application of sequential decision-making algorithms to address a critical challenge in career mobility. If you are an RL / ML researcher or engineer looking to apply ...Show more
    Last updated: 30+ days ago • Promoted
    Lead AI Engineer, LLMs & SLMs

    Lead AI Engineer, LLMs & SLMs

    Artificial Agency • Edmonton
    Full-time
    You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Show more
    Last updated: 30+ days ago • Promoted
    Research Advisor

    Research Advisor

    Cuso International • Edmonton, Alberta
    Permanent
    This Volunteer Placement is Located in : .Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only. The primary aim of this role is to ...Show more
    Last updated: 30+ days ago
    Machine Learning Engineer

    Machine Learning Engineer

    Hifyre • Edmonton, Alberta, Canada
    Full-time
    Hifyre provides market intelligence for the cannabis industry, analyzing retail data to help.Our models power product recommendations, sales forecasting, and market analysis for both internal opera...Show more
    Last updated: 2 days ago • Promoted
    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Robotics & AI Tenure-Track Professor — Autonomous Systems

    Ccwestt • Edmonton
    Full-time
    A leading university in Canada is seeking an Assistant or Associate Professor in Robotics and AI.This full-time tenure-track position involves teaching, conducting research, and engaging in service...Show more
    Last updated: 10 days ago • Promoted
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Alberta Machine Intelligence Institute • Edmonton
    Full-time
    If you are interested in leveraging Generative AI for Computer Vision in visual effects, film industry, and gaming, this is the right opportunity for you. Be a part of a team of research and machine...Show more
    Last updated: 3 days ago • Promoted
    Machine Learning Resident - Client : Outsyders (1 year term)

    Machine Learning Resident - Client : Outsyders (1 year term)

    Amii (Alberta Machine Intelligence Institute) • Edmonton
    Full-time
    Machine Learning Resident - Client : Outsyders.Machine Learning Resident - Client : Outsyders.Amii (Alberta Machine Intelligence Institute). If you are interested in leveraging Generative AI for Compu...Show more
    Last updated: 3 days ago • Promoted
    AI Governance Lead — Ethical, Regulated Innovation

    AI Governance Lead — Ethical, Regulated Innovation

    Alberta Blue Cross • Edmonton
    Full-time
    A healthcare coverage provider in Canada is seeking a Manager, AI Governance to lead the development of AI governance practices. This strategic role ensures responsible AI use and compliance with re...Show more
    Last updated: 10 days ago • Promoted