Talent.com
Reinforcement Learning Engineer (Full-Time) - Humanoid Robot
Reinforcement Learning Engineer (Full-Time) - Humanoid RobotAXIBO INC • CAMBRIDGE, Ontario, Canada
Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

AXIBO INC • CAMBRIDGE, Ontario, Canada
30+ days ago
Job type
  • Full-time
Job description

About AXIBO

AXIBO is a robotics company pioneering the design, prototyping, and manufacturing of advanced robotic systems—all under one roof. We build everything in-house and take pride in delivering robust, reliable products that power automation across industries. Our fast-paced environment demands high levels of precision, organization, and execution —not just in engineering, but across all functions.

Position Overview

As a Reinforcement Learning Engineer , you will develop and deploy machine learning systems that enable intelligent behaviors in our humanoid and legged robots. You'll work at the intersection of control theory, deep learning, and robotics—helping close the loop between simulation and reality to bring adaptive behaviors into real-world machines.

Key Responsibilities

Develop reinforcement learning agents for robotic control tasks such as locomotion, manipulation, and dynamic balance

Implement learning architectures using policy gradient methods, actor-critic frameworks, and off-policy algorithms (e.g., PPO, SAC, TD3)

Build reward functions , curriculum learning strategies, and simulation environments tailored for real-world transfer

Design multi-agent training pipelines , including distributed rollouts, experience replay, and adaptive difficulty scaling

Interface with Isaac Gym, Mujoco, Brax, and custom physics simulators to run large-scale experiments

Work with hardware and firmware teams to deploy trained policies to embedded or real-time environments

Design diagnostic tools and visualization dashboards to monitor training progress and system behavior

Apply domain randomization, sim2real techniques , and sensor noise modeling to enhance policy robustness

Maintain code quality through version control, testing, and modular design

Stay current with academic literature and integrate novel RL methods as appropriate

Required Skills and Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, Robotics, or a related field

2+ years of hands-on experience applying deep reinforcement learning to simulation or robotic control tasks

Strong grasp of machine learning fundamentals and control theory

Proficiency with PyTorch , JAX , or TensorFlow

Programming experience in Python and C++

Deep understanding of policy optimization , generalization, and environment design

Experience working in Linux development environments and with GPU-based training pipelines

Excellent debugging skills across ML, software, and hardware stacks

Ability to independently manage experiments and rapidly iterate on model architectures

Preferred Experience (Bonus)

Deployment of RL systems to real-world robots , especially legged or humanoid platforms

Contributions to open-source RL frameworks or robotics middleware (e.g., ROS, Isaac ROS)

Experience with imitation learning , behavior cloning , or inverse reinforcement learning

Prior research / publications in reinforcement learning, multi-agent systems, or robotic control

Familiarity with low-level robot interfaces , sensor fusion, or control loop tuning

Knowledge of real-time systems , embedded software, or custom actuator control

Job Details

Location : Cambridge, Ontario

Work Environment : In-person (on-site at our Waterloo facility)

Type : Full-time

Compensation : Competitive salary (based on experience)

Health Insurance : Provided

Growth : Regular performance evaluations with potential for salary increases and stock option participation

Create a job alert for this search

Reinforcement Learning Engineer FullTime Humanoid Robot • CAMBRIDGE, Ontario, Canada

Similar jobs
Marie-Lauren looking for a babysitter or nanny in Brantford

Marie-Lauren looking for a babysitter or nanny in Brantford

Sitly • Brantford, CA
Full-time +1
We are a loving intergenerational family in Brantford, Ontario, seeking a dedicated nanny to care for our infant.This is a full-time position, and we would love someone who can join our family and ...Show more
Last updated: 2 hours ago • Promoted • New!
Solutions Consultant

Solutions Consultant

ExaCare AI • guelph, on, ca
Full-time
We are a trailblazing health tech company on a mission to revolutionize the nursing home & post acute space.Our innovative AI software is transforming the admissions process and care delivery in th...Show more
Last updated: 2 days ago • Promoted
Cybersecurity Consultant – Azure & AI Governance ((French Bilingual)

Cybersecurity Consultant – Azure & AI Governance ((French Bilingual)

Concentrix • guelph, on, ca
Full-time
Cybersecurity Consultant – Azure & AI Governance.Microsoft ecosystem to advise enterprise customers and lead strategic AI security initiatives. Lead customer workshops to assess AI readiness, focusi...Show more
Last updated: 23 days ago • Promoted
Saviynt SME - TechDemocracy

Saviynt SME - TechDemocracy

TechDemocracy • guelph, on, ca
Full-time
Lead design and implementation of Saviynt IGA solutions (Lifecycle, Access Requests, Certifications).Integrate Saviynt with HR, AD, Azure AD, and cloud / on-prem applications.Configure workflows, pol...Show more
Last updated: 2 days ago • Promoted
Principal Maintenance Readiness - Brunel

Principal Maintenance Readiness - Brunel

Brunel • guelph, on, ca
Temporary
Principal Maintenance Readiness.Our client is looking for someone to join their team for the role of Principal Maintenance Readiness to provide technical expertise in developing mining and processi...Show more
Last updated: 15 days ago • Promoted
Hearing Performance Engineer — Hybrid & Mentorship

Hearing Performance Engineer — Hybrid & Mentorship

Sonova Group • Kitchener
Full-time
A leading hearing care solutions company in Kitchener, Canada is seeking a Hearing Performance Developer to drive innovative projects improving hearing health. This role requires an engineering degr...Show more
Last updated: 5 days ago • Promoted
Director Design - Martyn Bassett Associates

Director Design - Martyn Bassett Associates

Martyn Bassett Associates • guelph, on, ca
Full-time
Our client is focused on improving employee financial wellness, and their platform goes beyond simple on-demand pay.Their platform combines flexible payout options with financial education, rewards...Show more
Last updated: 13 days ago • Promoted
Software Engineer

Software Engineer

NPA WorldWide • Brantford Southeast, Ontario, Canada
Full-time +1
Plan, design, develop, test, implement, maintain, and document applications to meet business requirements.Modify and maintain existing applications. Provide technical support to end users for applic...Show more
Last updated: 7 days ago • Promoted
Senior Technical Recruiter

Senior Technical Recruiter

Sage Recruiting Inc. • guelph, on, ca
Full-time
Sage Recruiting is growing, and we’re looking for a seasoned Technical Recruiter who wants a role with a forward-thinking agency focused exclusively on hiring (Perm) Product and Engineering folks f...Show more
Last updated: 10 days ago • Promoted
Control Systems Software Designer

Control Systems Software Designer

Jamesway • cambridge, on, ca
Full-time
Jamesway Chick Master Incubator Company Inc.Poultry Incubation Products and Services.Jamesway is a privately held company, headquartered in Cambridge, Ontario, Canada and operates subsidiaries in t...Show more
Last updated: 2 days ago • Promoted
Earn money testing apps - Remote

Earn money testing apps - Remote

Almedia • Cambridge, Ontario, Canada
Remote
Full-time
Get paid for testing apps, games and surveys.Almedia runs a dynamic platform where users earn money online by completing tasks, playing games, and filling out surveys. Since our launch 5 years ago, ...Show more
Last updated: 30+ days ago • Promoted
Solutions Engineer

Solutions Engineer

Meld • guelph, on, ca
Full-time
Meld is a fast growing startup looking to add developer support for customers who use our API driven platform for managing their crypto related integrations. We're focused on helping money move on c...Show more
Last updated: 2 days ago • Promoted
Epicor Kinetic Implementation Specialist - Tenth Revolution Group

Epicor Kinetic Implementation Specialist - Tenth Revolution Group

Tenth Revolution Group • guelph, on, ca
Full-time
Job Description : Epicor Kinetic Implementation Consultant.Epicor Kinetic Implementation Consultant.ERP implementations for manufacturing and distribution clients. This role requires strong expertise...Show more
Last updated: 10 days ago • Promoted
Remote R Engineer - AI Trainer

Remote R Engineer - AI Trainer

SuperAnnotate • Brantford, Ontario, CA
Remote
Full-time
As a remote, hourly paid R Engineer, you will review AI-generated responses and generate high-quality R and data-analysis-focused content, evaluating the reasoning quality and step-by-step problem-...Show more
Last updated: 30+ days ago
Project Manager - Dynamics CRM

Project Manager - Dynamics CRM

Cambay Solutions • guelph, on, ca
Full-time
About Cambay Solutions : Cambay is a Microsoft Partner IT firm delivering the Microsoft Three Cloud Strategy; Microsoft 365, Microsoft Dynamics 365, Microsoft Azure by providing Managed Delivery, In...Show more
Last updated: 14 days ago • Promoted
Mid Level Developer - Retail Platform

Mid Level Developer - Retail Platform

Hifyre • guelph, on, ca
Full-time
Mid-Level Developer - Retail Platform.Hifyre has created the cannabis industry’s most advanced retail sales platform, leveraging data to deliver personalized, effective, consumer & partner engageme...Show more
Last updated: 2 days ago • Promoted
Forward Deployed Engineer (Solution Delivery) - North America

Forward Deployed Engineer (Solution Delivery) - North America

Trackunit • Kitchener
Full-time
Forward Deployed Engineer (Solution Delivery) – North America.Trackunit is looking for a Solution Delivery Engineer who ensures IrisX, our industry cloud platform, is implemented and fully adopted,...Show more
Last updated: 22 days ago • Promoted
Licensed Millwright - $3k Sign-on Bonus

Licensed Millwright - $3k Sign-on Bonus

Cargill • Waterford, ON, CA
Full-time
Week 1 : Monday, Tuesday, Friday, Saturday.Week 2 : Sunday, Wednesday, Thursday.Must hold an Ontario or Inter-Provincial 433A Millwright Certificate. As a Maintenance Millwright at Cargill, you will b...Show more
Last updated: 30+ days ago • Promoted