Talent.com
Reinforcement Learning Engineer (Full-Time) - Humanoid Robot
Reinforcement Learning Engineer (Full-Time) - Humanoid RobotAXIBO INC • CAMBRIDGE, Ontario, Canada
Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

AXIBO INC • CAMBRIDGE, Ontario, Canada
30+ days ago
Job type
  • Full-time
Job description

About AXIBO

AXIBO is a robotics company pioneering the design, prototyping, and manufacturing of advanced robotic systems—all under one roof. We build everything in-house and take pride in delivering robust, reliable products that power automation across industries. Our fast-paced environment demands high levels of precision, organization, and execution—not just in engineering, but across all functions.

Position Overview

As a Reinforcement Learning Engineer, you will develop and deploy machine learning systems that enable intelligent behaviors in our humanoid and legged robots. You'll work at the intersection of control theory, deep learning, and robotics—helping close the loop between simulation and reality to bring adaptive behaviors into real-world machines.

Key Responsibilities

  • Develop reinforcement learning agents for robotic control tasks such as locomotion, manipulation, and dynamic balance

  • Implement learning architectures using policy gradient methods, actor-critic frameworks, and off-policy algorithms (e.g., PPO, SAC, TD3)

  • Build reward functions, curriculum learning strategies, and simulation environments tailored for real-world transfer

  • Design multi-agent training pipelines, including distributed rollouts, experience replay, and adaptive difficulty scaling

  • Interface with Isaac Gym, Mujoco, Brax, and custom physics simulators to run large-scale experiments

  • Work with hardware and firmware teams to deploy trained policies to embedded or real-time environments

  • Design diagnostic tools and visualization dashboards to monitor training progress and system behavior

  • Apply domain randomization, sim2real techniques, and sensor noise modeling to enhance policy robustness

  • Maintain code quality through version control, testing, and modular design

  • Stay current with academic literature and integrate novel RL methods as appropriate

Required Skills and Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, Robotics, or a related field

  • 2+ years of hands-on experience applying deep reinforcement learning to simulation or robotic control tasks

  • Strong grasp of machine learning fundamentals and control theory

  • Proficiency with PyTorch, JAX, or TensorFlow

  • Programming experience in Python and C++

  • Deep understanding of policy optimization, generalization, and environment design

  • Experience working in Linux development environments and with GPU-based training pipelines

  • Excellent debugging skills across ML, software, and hardware stacks

  • Ability to independently manage experiments and rapidly iterate on model architectures

Preferred Experience (Bonus)

  • Deployment of RL systems to real-world robots, especially legged or humanoid platforms

  • Contributions to open-source RL frameworks or robotics middleware (e.g., ROS, Isaac ROS)

  • Experience with imitation learning, behavior cloning, or inverse reinforcement learning

  • Prior research/publications in reinforcement learning, multi-agent systems, or robotic control

  • Familiarity with low-level robot interfaces, sensor fusion, or control loop tuning

  • Knowledge of real-time systems, embedded software, or custom actuator control

Job Details

  • Location: Cambridge, Ontario

  • Work Environment: In-person (on-site at our Waterloo facility)

  • Type: Full-time

  • Compensation: Competitive salary (based on experience)

  • Health Insurance: Provided

  • Growth: Regular performance evaluations with potential for salary increases and stock option participation

Create a job alert for this search

Reinforcement Learning Engineer (Full-Time) - Humanoid Robot • CAMBRIDGE, Ontario, Canada

Similar jobs

Forward Deployed Engineer - ForgeSight

ForgeSightguelph, on, ca
Full-time

MVPs and pilot use cases to enterprise-wide deployments, optimization, and ongoing support.We are dedicated to helping organizations achieve measurable results and maximize the value of their inves...Show more

 • Promoted

Remote Biology Expert (PhD) - Turing

Turingguelph, on, ca
Remote
Full-time

Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+/hour, fully remote, with flexible weekly ...Show more

 • Promoted

Senior AWS DBT Engineer

Mastech Digitalkitchener, on, ca
Full-time

We are seeking a Senior AWS DBT Engineer to play a critical role in transforming an existing analytics ecosystem into a modern, scalable Databricks + dbt architecture (Bronze → Silver → Gold).This ...Show more

 • Promoted

Web3 Research & Intelligence Associate (Remote - Canada)

C–CorpInvest Bankguelph, on, ca
Remote
Full-time

C–CorpInvest Bank is a boutique investment firm dedicated to empowering small and medium-sized enterprises (SMEs) with tailored solutions to achieve growth and success.Our team of experienced profe...Show more

 • Promoted

Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

Axibo Inc.Cambridge, Region of Waterloo, CA
Full-time

AXIBO is a robotics company pioneering the.We build everything in-house and take pride in delivering.Our fast-paced environment demands high levels of.Reinforcement Learning Engineer.You'll work at...Show more

 • Promoted

AI Trainer - Advanced Japanese Fluency

Prolificcambridge, on, ca
Full-time

AI Trainer - Advanced Japanese Fluency.Prolific is not just another player in the AI space – we are building the biggest pool of quality human data in the world.Over 35,000 AI developers, researche...Show more

 • Promoted

Omnichannel Analytics - Indegene

Indegeneguelph, on, ca
Full-time

We are a technology-led healthcare solutions provider.We are driven by our purpose to enable healthcare organizations to be future ready.We offer accelerated, global growth opportunities for talent...Show more

 • Promoted • New!

Head of Design - Martyn Bassett Associates

Martyn Bassett Associatesguelph, on, ca
Full-time

Our client is focused on improving employee financial wellness, and their platform goes beyond simple on-demand pay.Their platform combines flexible payout options with financial education, rewards...Show more

 • Promoted

Global Health Economics and Outcomes (HEOR) Director – Respiratory Biologics

Alphanumeric Systemsguelph, on, ca
Full-time

Global Health Economics and Outcomes (HEOR) Director - Respiratory Biologics.Alphanumeric Systems is seeking a.Global Health Economics and Outcomes Research (HEOR) Director - Respiratory Biologics....Show more

 • Promoted

Expert Mathematician - RentAHuman

RentAHumanguelph, on, ca
Full-time

RentAHuman matches top creative and technical minds with the AI labs doing the most interesting work right now.We're based in San Francisco, and our work has been covered in Forbes, Business Inside...Show more

 • Promoted

Principal Delivery Lead - Space Dinosaurs

Space Dinosaursguelph, on, ca
Full-time

Space Dinosaurs builds fast, high-converting ecommerce experiences for the world's top brands, with a focus on conversion rate and revenue optimization.We're a specialist e-commerce and creative st...Show more

 • Promoted

Senior Animator - Pixel Movement

Pixel Movementguelph, on, ca
Full-time

Unannounced stylised melee action game (Unreal Engine 5).An independent game studio building a stylised melee action title centred on expressive combat, fluid transitions, and rewarding mastery.The...Show more

 • Promoted

Principal AI Engineer

Amaris Consultingguelph, on, ca
Full-time

AI across multiple applications.This role is critical in establishing organization-wide standards and delivering scalable, production-ready AI solutions.You will work closely with cross-functional ...Show more

 • Promoted

Data Platform Modelling Manager

Harnhamguelph, on, ca
Full-time

We are seeking a Data Platform Manager to oversee Data Modelling and manage a team.This person will inherit and existing team and continue to scale!.Lead the evolution and optimization of existing ...Show more

 • Promoted

Physics Private Tutoring Jobs Brant

SuperprofBrant, Canada
Full-time +1

Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Show more

 • Promoted

Product Development Engineer - Melitron Corporation

Melitron Corporationguelph, on, ca
Full-time

This is an exciting opportunity to become a key member of the Melitron Corporation Engineering Team.Reporting to the VP of Product Development, the successful candidate will play a central role in ...Show more

 • Promoted

Machine Learning Architect - Insight Global

Insight Globalguelph, on, ca
Full-time

We are looking for a highly skilled and innovative AI/ML Architect to lead the design and modernization of our AI/ML landscape.The ideal candidate will have a deep understanding of machine learning...Show more

 • Promoted

Dynatrace Engineer

Galentguelph, on, ca
Full-time

A Dynatrace SaaS Engineer is responsible for deploying, managing, and optimizing Dynatrace’s observability platform across cloud and on-prem environments, ensuring application reliability, performa...Show more