Researcher - Reinforcement LearningHuawei Technologies Canada Co., Ltd. • Edmonton, Division No. 11, CA

Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Division No. 11, CA

7 days ago

Job type

Temporary

Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team :

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long‑term projects, the aim is to enhance state‑of‑the‑art research while integrating innovations into the company’s products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job :

Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine‑tuning toward continual, agentic self‑improvement.
LLM post‑training paradigms (e.g., RLHF, GRPO, reward‑free methods, etc.).
Agentic reinforcement learning for tool‑using and browsing‑based LLMs trained in interactive environments.
Agentic evaluation and benchmarking, including design of multi‑turn, verifiable reasoning tasks.
Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning‑enhanced LLMs and tool‑using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.

About the ideal candidate :

PhD degree in Computer Science or related fields or master’s degree with comparable experience.

Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.

Practical or research experience in reinforcement learning, self‑supervised learning, or language model fine‑tuning.

Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.

Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.

Familiarity with LLM post‑training pipelines (RLHF, GRPO / PPO, SFT, LoRA, MoE, etc.) is an asset.

Experience with multi‑agent RL, tool‑use / browser / coding agents, is an asset.

Strong communication and writing skills; enthusiasm for open research and collaborative problem‑solving.

#J-18808-Ljbffr

Create a job alert for this search

Researcher • Edmonton, Division No. 11, CA

Similar jobs

Strategic Planning Analyst III - Manager, Research and Innovation

City of Edmonton • Edmonton

Full-time

Strategic Planning Analyst III - Manager, Research and Innovation.The Edmonton Police Service (EPS) requires a highly effective leader to manage the Research and Innovation Section within the Strat...Show more

Last updated: 7 days ago • Promoted

Remote Senior Finance Specialist - Ai Trainer

SuperAnnotate • Fort Saskatchewan, Canada

Remote

Full-time

In this hourly, remote contractor role, you will review AI-generated finance analyses and / or generate expert finance content, evaluating reasoning quality and step-by-step problem-solving while pro...Show more

Last updated: 1 day ago • Promoted

Community Research Coordinator (Maskwacis)

The University of Alberta • Edmonton

Full-time

This position is a part of the Non-Academic Staff Association (NASA).This position has a term length of 1 year plus 1 day and offers a comprehensive benefits package. This position will primarily be...Show more

Last updated: 1 day ago • Promoted

Research Advisor

Cuso International • Edmonton, Alberta

Full-time +1

This Volunteer Placement is Located in : .Please submit a Spanish Resume and Statement of Interest.Open to Canadian Citizens and Permanent Residents of Canada only. The primary aim of this role is to ...Show more

Last updated: 30+ days ago

Lead AI Engineer, LLMs & SLMs

Artificial.Agency • Edmonton

Full-time

You will lead a high-caliber team working on large and small language models, setting technical and research strategy, guiding fine-tuning and deployment efforts, and aligning AI innovation with pr...Show more

Last updated: 7 days ago • Promoted

Sr. Research Associate

Gilead Sciences, Inc. • Edmonton

Full-time +1

At Gilead, we’re creating a healthier world for all people.For more than 35 years, we’ve tackled diseases such as HIV, viral hepatitis, COVID-19 and cancer – working relentlessly to develop therapi...Show more

Last updated: 7 days ago • Promoted

Senior Research, Policy, and Planning Analyst

Government of Alberta • Edmonton

Full-time +2

Senior Research, Policy, and Planning Analyst.Senior Research, Policy, and Planning Analyst.Service Alberta and Red Tape Reduction. Service Alberta and Red Tape Reduction is the government’s solutio...Show more

Last updated: 7 days ago • Promoted

Healthcare AI Lead : NLP & Analytics

Canadian Professional Sales Association • Edmonton

Full-time

A national professional association is looking for an Artificial Intelligence Specialist to lead AI initiatives and develop models in healthcare. The role requires a Bachelor's degree in health scie...Show more

Last updated: 6 days ago • Promoted

Canada Impact+ Research Chairs (Impact+)

University of Alberta • Edmonton

Full-time

The University of Alberta invites applications from outstanding, internationally based researchers for the Canada Impact+ Research Chairs (Impact+) Competition — a landmark national initiative desi...Show more

Last updated: 1 day ago • Promoted

Computational Research Expert (Optimization and Control)

Aramco • Edmonton (West Clareview / East Londonderry), ca

Full-time

Aramco energizes the world economy.Aramco occupies a special position in the global energy industry.We are one of the world's largest producers of hydrocarbon energy and chemicals, with among the l...Show more

Last updated: 3 hours ago • Promoted • New!

Robotics & AI Tenure-Track Professor — Autonomous Systems

Ccwestt • Edmonton

Full-time

A leading university in Canada is seeking an Assistant or Associate Professor in Robotics and AI.This full-time tenure-track position involves teaching, conducting research, and engaging in service...Show more

Last updated: 7 days ago • Promoted

Machine Learning Resident - Client : Outsyders (1 year term)

Alberta Machine Intelligence Institute • Edmonton

Full-time

If you are interested in leveraging Generative AI for Computer Vision in visual effects, film industry, and gaming, this is the right opportunity for you. Be a part of a team of research and machine...Show more

Last updated: 16 hours ago • Promoted • New!

Paid ML Energy Forecasting Fellow - Mentored Research

Alberta Machine Intelligence Institute • Edmonton

Full-time

A leading AI research institute in Canada is seeking a Machine Learning Resident to focus on energy consumption modeling. The role involves designing, implementing, and evaluating machine learning m...Show more

Last updated: 7 days ago • Promoted

Machine Learning Resident - Client : Outsyders (1 year term)

Amii (Alberta Machine Intelligence Institute) • Edmonton

Full-time

Machine Learning Resident - Client : Outsyders.Machine Learning Resident - Client : Outsyders.Amii (Alberta Machine Intelligence Institute). If you are interested in leveraging Generative AI for Compu...Show more

Last updated: 16 hours ago • Promoted • New!

Managing Director / Senior Consultant - Aligned Labs

Aligned Labs • edmonton, ab, ca

Part-time

We are looking to expand our network of.AI models struggle with, as well as.Visit our website to learn more : .At Aligned, we partner with the world's leading AI labs to push the frontier of AI knowl...Show more

Last updated: 4 days ago • Promoted

AI Governance Lead — Ethical, Regulated Innovation

Alberta Blue Cross • Edmonton

Full-time

A healthcare coverage provider in Canada is seeking a Manager, AI Governance to lead the development of AI governance practices. This strategic role ensures responsible AI use and compliance with re...Show more

Last updated: 7 days ago • Promoted

Machine Learning Engineer

Just Eat Takeaway.com • Edmonton

Full-time

We’re a leading global online food delivery platform, and our vision is to empower everyday convenience.Whether it’s a Friday-night feast, a post-gym poke bowl, or grabbing some groceries, our tech...Show more

Last updated: 16 days ago • Promoted

Learning Design Specialist (2986)

NAIT (Northern Alberta Institute of Technology) • Edmonton

Full-time +1

Learning Design Specialist (2986) – NAIT.Temporary position ending on or before March 31, 2027, with the possibility of extension. Under the direction of the Manager of Learning Experience Design, y...Show more

Last updated: 7 days ago • Promoted