Talent.com
Reinforcement Learning Engineer (Full-Time) - Humanoid Robot
Reinforcement Learning Engineer (Full-Time) - Humanoid RobotAXIBO INC • Cambridge, ON, Canada
Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

Reinforcement Learning Engineer (Full-Time) - Humanoid Robot

AXIBO INC • Cambridge, ON, Canada
Il y a plus de 30 jours
Type de contrat
  • Temps plein
Description de poste

Job Description

Job Description

About AXIBO

AXIBO is a robotics company pioneering the design, prototyping, and manufacturing of advanced robotic systemsall under one roof. We build everything in-house and take pride in delivering robust, reliable products that power automation across industries. Our fast-paced environment demands high levels of precision, organization, and execution not just in engineering, but across all functions.

Position Overview

As a Reinforcement Learning Engineer , you will develop and deploy machine learning systems that enable intelligent behaviors in our humanoid and legged robots. You'll work at the intersection of control theory, deep learning, and roboticshelping close the loop between simulation and reality to bring adaptive behaviors into real-world machines.

Key Responsibilities

Develop reinforcement learning agents for robotic control tasks such as locomotion, manipulation, and dynamic balance

Implement learning architectures using policy gradient methods, actor-critic frameworks, and off-policy algorithms (e.g., PPO, SAC, TD3)

Build reward functions , curriculum learning strategies, and simulation environments tailored for real-world transfer

Design multi-agent training pipelines , including distributed rollouts, experience replay, and adaptive difficulty scaling

Interface with Isaac Gym, Mujoco, Brax, and custom physics simulators to run large-scale experiments

Work with hardware and firmware teams to deploy trained policies to embedded or real-time environments

Design diagnostic tools and visualization dashboards to monitor training progress and system behavior

Apply domain randomization, sim2real techniques , and sensor noise modeling to enhance policy robustness

Maintain code quality through version control, testing, and modular design

Stay current with academic literature and integrate novel RL methods as appropriate

Required Skills and Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, Robotics, or a related field

2+ years of hands-on experience applying deep reinforcement learning to simulation or robotic control tasks

Strong grasp of machine learning fundamentals and control theory

Proficiency with PyTorch , JAX , or TensorFlow

Programming experience in Python and C++

Deep understanding of policy optimization , generalization, and environment design

Experience working in Linux development environments and with GPU-based training pipelines

Excellent debugging skills across ML, software, and hardware stacks

Ability to independently manage experiments and rapidly iterate on model architectures

Preferred Experience (Bonus)

Deployment of RL systems to real-world robots , especially legged or humanoid platforms

Contributions to open-source RL frameworks or robotics middleware (e.g., ROS, Isaac ROS)

Experience with imitation learning , behavior cloning , or inverse reinforcement learning

Prior research / publications in reinforcement learning, multi-agent systems, or robotic control

Familiarity with low-level robot interfaces , sensor fusion, or control loop tuning

Knowledge of real-time systems , embedded software, or custom actuator control

Job Details

Location : Cambridge, Ontario

Work Environment : In-person (on-site at our Waterloo facility)

Type : Full-time

Compensation : Competitive salary (based on experience)

Health Insurance : Provided

Growth : Regular performance evaluations with potential for salary increases and stock option participation

Créer une alerte emploi pour cette recherche

Reinforcement Learning Engineer FullTime Humanoid Robot • Cambridge, ON, Canada

Offres similaires
Full Stack Engineer - Regie.ai

Full Stack Engineer - Regie.ai

Regie.ai • guelph, on, ca
Temps plein
Series B-funded, AI-native sales engagement automation platform focused on transforming business-critical prospecting—the top of the funnel—into a precise, scalable, and repeatable process.As the v...Voir plus
Dernière mise à jour : il y a 5 heures • Offre sponsorisée • Nouvelle offre
Product Design Engineer - Adamson Systems Engineering

Product Design Engineer - Adamson Systems Engineering

Adamson Systems Engineering • guelph, on, ca
Temps plein
We are hiring for many new positions to keep up with global demand.As a leader in the design and manufacture of premium loudspeaker systems for live sound and installation markets, our name is cele...Voir plus
Dernière mise à jour : il y a 5 heures • Offre sponsorisée • Nouvelle offre
Forensic Engineer SME - Mitigateway

Forensic Engineer SME - Mitigateway

Mitigateway • guelph, on, ca
Temps plein
We believe that by embedding expert forensic reasoning into scalable AI, we can transform the way risk is understood and adjudicated in property insurance losses. We build enterprise-grade generativ...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Remote Online Tasks – Earn Up to $1,500 / month Playing Games

Remote Online Tasks – Earn Up to $1,500 / month Playing Games

Free Cash by Almedia • Burford, Canada
Télétravail
Temps plein
Receive a $5 welcome bonus when you complete your first offer!Make money in your spare time by completing online tasks : . Take paid surveys (5-15 minutes each).Test new mobile games and apps.Share yo...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Character Rigger

Character Rigger

Aquent Talent • guelph, on, ca
Temps plein
The Coalition is seeking a highly motivated Rigger.In this position, you will be responsible for carrying through the artistic vision of the project and applying your experience directly to rigging...Voir plus
Dernière mise à jour : il y a 5 heures • Offre sponsorisée • Nouvelle offre
Scientist | AI & Chip Design - Cognichip

Scientist | AI & Chip Design - Cognichip

Cognichip • guelph, on, ca
Temps plein
We’re hiring teammates at the intersection of AI / ML and chip design—people who love building, move fast, and care about doing excellent work. You’ll work at the boundary between our AI and hardware ...Voir plus
Dernière mise à jour : il y a 4 heures • Offre sponsorisée • Nouvelle offre
I&C Designer - Airswift

I&C Designer - Airswift

Airswift • guelph, on, ca
Temps plein
The I&C Designer will support an upstream expansion project, focusing on instrument design deliverables using.SmartPlant Instrumentation (SPI). This role develops datasheets, loop diagrams, and supp...Voir plus
Dernière mise à jour : il y a 4 heures • Offre sponsorisée • Nouvelle offre
Software Engineer

Software Engineer

NPA WorldWide • Brantford Southeast, Ontario, Canada
Temps plein +1
Plan, design, develop, test, implement, maintain, and document applications to meet business requirements.Modify and maintain existing applications. Provide technical support to end users for applic...Voir plus
Dernière mise à jour : il y a 1 jour • Offre sponsorisée
Hardware Specialist - AMroute LLC

Hardware Specialist - AMroute LLC

AMroute LLC • guelph, on, ca
Temps plein
New onsite contract role : Hardware Senior Principal Consultant.Location : Ottawa, Ontario Canada (All candidates must be local to Ottawa, Ontario). Develop FPGA IP architecture for 4G / 5G Radio Units....Voir plus
Dernière mise à jour : il y a 2 heures • Offre sponsorisée • Nouvelle offre
Faculty, Mechatronics

Faculty, Mechatronics

Assiniboine College • guelph, on, ca
Temps plein
Russ Edwards School of Agriculture and Environment.Salary : $65,184 to $102,262 annually ($34.Educational Supplement : Masters $1. Assiniboine College has been providing exceptional learning experienc...Voir plus
Dernière mise à jour : il y a 2 heures • Offre sponsorisée • Nouvelle offre
Director of Technology & AI - Peel Children's Aid Society (Peel CAS)

Director of Technology & AI - Peel Children's Aid Society (Peel CAS)

Peel Children's Aid Society (Peel CAS) • guelph, on, ca
Temps plein
Mississauga, ON (Hybrid – 2–3 days / week in office).This is not a traditional technology role; Peel CAS is a transformative system leader driving change. Peel Children’s Aid Society is one of the lar...Voir plus
Dernière mise à jour : il y a 5 heures • Offre sponsorisée • Nouvelle offre
Remote R Engineer - AI Trainer

Remote R Engineer - AI Trainer

SuperAnnotate • Brantford, Ontario, CA
Télétravail
Temps plein
As a remote, hourly paid R Engineer, you will review AI-generated responses and generate high-quality R and data-analysis-focused content, evaluating the reasoning quality and step-by-step problem-...Voir plus
Dernière mise à jour : il y a plus de 30 jours
Senior Machine Learning Scientist

Senior Machine Learning Scientist

Altis Labs • guelph, on, ca
Temps plein
Altis Labs is the computational imaging company accelerating clinical trials with AI.We are on a mission to help get the most effective novel treatments to patients sooner.Top 20 biopharma sponsors...Voir plus
Dernière mise à jour : il y a 6 heures • Offre sponsorisée • Nouvelle offre
Forward Deployed Engineer (Solution Delivery) - North America

Forward Deployed Engineer (Solution Delivery) - North America

Trackunit • Kitchener
Temps plein
Forward Deployed Engineer (Solution Delivery) – North America.Trackunit is looking for a Solution Delivery Engineer who ensures IrisX, our industry cloud platform, is implemented and fully adopted,...Voir plus
Dernière mise à jour : il y a 16 jours • Offre sponsorisée
Logistics Supervisor

Logistics Supervisor

Hilton Foods • Brantford, ON, Canada
Temps plein
We’re working towards an ambitious future, one we want to build together with all our colleagues.This is why we are looking for an exceptional. As a key member of the Canadian Team, the.Logistics de...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée
Machine Learning Scientist / Engineer - SPECTRAFORCE

Machine Learning Scientist / Engineer - SPECTRAFORCE

SPECTRAFORCE • guelph, on, ca
Temps plein
Job Title : Machine Learning Scientist / Engineer.Length of contract- 12 Months (Possible extension).Hybrid- 2-3 days a week @ Toronto, ON. Interview- 2 rounds (first panel, 45 min).Machine Learning S...Voir plus
Dernière mise à jour : il y a 6 heures • Offre sponsorisée • Nouvelle offre
Lead AI Engineer

Lead AI Engineer

Harnham • guelph, on, ca
Temps plein
Toronto, ON - 3 days onsite / week.CAD + bonus + LTI; 300,000 - 400,000 CAD total.Harnham is partnering with one of the most well known financial services companies, which is looking for an experienc...Voir plus
Dernière mise à jour : il y a 2 heures • Offre sponsorisée • Nouvelle offre
Sr. Product Development Engineer (Machinery) - InVision Staffing Services Inc.

Sr. Product Development Engineer (Machinery) - InVision Staffing Services Inc.

InVision Staffing Services Inc. • guelph, on, ca
Permanent
InVision is a Professional Recruitment Firm involved in Engineering, Industrial / Skilled Trades, Information Technology and Professional Services within Canada and the U. We have a successful track r...Voir plus
Dernière mise à jour : il y a 2 heures • Offre sponsorisée • Nouvelle offre