Talent.com
Machine Learning Engineer, Reinforcement Learning & Reward Modeling
Machine Learning Engineer, Reinforcement Learning & Reward ModelingWayve • Vancouver, BC, CA
No longer accepting applications
Machine Learning Engineer, Reinforcement Learning & Reward Modeling

Machine Learning Engineer, Reinforcement Learning & Reward Modeling

Wayve • Vancouver, BC, CA
30+ days ago
Job type
  • Full-time
  • Temporary
Job description

Join or sign in to find your next job

Join to apply for the Applied Scientist - Reward Modeling role at Wayve

Join to apply for the Applied Scientist - Reward Modeling role at Wayve

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran status, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law.

About Us

Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.

Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving.

In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.

At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.

Make Wayve the experience that defines your career!

The role

We're looking for an experienced Applied Scientist with expertise in Reinforcement Learning and Reward Modelling to advance our training and evaluation frameworks contributing significantly to the creation of safe and reliable AI driving technology. The ideal candidate has a deep understanding of reinforcement learning, machine learning, and behavioural modelling, combined with a drive to innovate in the autonomous driving space.

Role

In this role, you will be at the forefront of designing and optimizing reward and reinforcement learning models that are powerful and resource-efficient, tailored for the unique demands of embodied AI and autonomous systems. Your work will involve but not limited to :

  • Design, develop, and refine reward models that align with safe and efficient driving objectives for autonomous vehicles.
  • Work closely with multidisciplinary teams to integrate reward models with real-world data and simulation frameworks.
  • Define a data strategy that includes efficient use of real and synthetic data, annotations, and active learning.
  • Design experiments to evaluate reward structures in diverse driving scenarios and identify areas for improvement.
  • Collaborate with world-class researchers and engineers to push the boundaries of AI, contributing significantly to the evolution of autonomous driving technology

What you’ll bring to Wayve

In order to set you up for success as an Applied Scientist at Wayve, we’re looking for the following skills and experience.

Must Haves

  • Proven expertise in reinforcement learning, including in areas like offline RL, reward modelling, RLHF, DPO, GPRO, as well as experience with machine learning.
  • Strong programming skills in Python and experience with machine learning libraries such as PyTorch.
  • Experience in working with simulation environments and real-world data for model validation and performance benchmarking.
  • Demonstrated ability to publish research and present findings to both technical and non-technical audiences at top tier conferences.
  • Excellent problem-solving skills and the ability to work independently as well as in a team environment.
  • Demonstrated ability to work collaboratively in a fast-paced, innovative, interdisciplinary team environment.
  • Desirable

  • Track record of publications at top-tier conferences like NeurIPS, CVPR, ICRA, ICLR, CoRL etc.
  • Familiarity with self-driving technologies, sensor data processing, and real-time decision-making algorithms.
  • Experience with large-scale machine learning systems, distributed training and deploying models in production environments.
  • What we offer you

  • Attractive compensation with salary and equity
  • Immersion in a team of world-class researchers, engineers and entrepreneurs
  • A unique position to shape the future of autonomy and tackle the biggest challenge of our time
  • Bespoke learning and development opportunities
  • Relocation support with visa sponsorship
  • Flexible working hours - we trust you to do your job well, at times that suit you and your time
  • Benefits such as an onsite chef, workplace nursery scheme, private health insurance, therapy, daily yoga, onsite bar, large social budgets, unlimited L&D requests, enhanced parental leave, and more!
  • This is a full-time role based in our office in Vancouver. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home.

    We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.

    For more information visit Careers at Wayve.

    To learn more about what drives us, visit Values at Wayve

    DISCLAIMER : We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non-discriminatory.

    Seniority level

    Seniority level

    Mid-Senior level

    Employment type

    Employment type

    Full-time

    Job function

    Job function

    Research, Analyst, and Information Technology

    Industries

    Software Development

    Referrals increase your chances of interviewing at Wayve by 2x

    Get notified about new Applied Scientist jobs in Vancouver, British Columbia, Canada .

    White Rock, British Columbia, Canada 1 month ago

    Research Scientist - Antibody Purification (12-month Contract)

    Associate Research Scientist (Instrument Control and Acquisition)

    Associate Research Scientist (Instrument Control and Acquisition)

    Burnaby, British Columbia, Canada 6 days ago

    Burnaby, British Columbia, Canada 2 weeks ago

    Burnaby, British Columbia, Canada 1 day ago

    Research Scientist - Computational Structural Biology

    AI Research Scientist : AEC. Remote US or Canada

    AI / ML / LLM Engineer (Healthcare & Edge AI)

    White Rock, British Columbia, Canada 3 weeks ago

    AI Research Scientist – Structured & Spatial Modeling

    Data Scientist, Experimentation & Incremental Measurement

    Burnaby, British Columbia, Canada 6 months ago

    Senior Applied Scientist (Remote - Canada)

    White Rock, British Columbia, Canada 1 month ago

    Richmond, British Columbia, Canada CA$60,000.00-CA$90,000.00 5 months ago

    We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer Reinforcement Learning Reward Modeling • Vancouver, BC, CA

    Similar jobs
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Starboard Recruitment • Vancouver, BC, Canada
    Full-time
    Follow Starboard Recruitment on LinkedIn for ongoing job opportunities, market updates and advice : .Opportunity is with one of Canada's fastest growing, well-funded, Series-B tech startups in th...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Scientist

    Machine Learning Scientist

    Equest • Vancouver, British Columbia, Canada
    Full-time
    DarkVision is seeking a Machine Learning Scientist to join our Imaging & AI team.You will research, design, and prototype the deep learning architectures that power our automated analysis tools.You...Show more
    Last updated: 3 days ago • Promoted
    Staff AI / ML Product Manager

    Staff AI / ML Product Manager

    Quandri • Vancouver
    Full-time
    At Quandri, our mission is to unlock the world’s insurance data so brokerages and agencies can deliver the best possible service to their clients. Our Renewal Intelligence Platform is designed to he...Show more
    Last updated: 6 days ago • Promoted
    Founding SRE Engineer - Scale & Reliability Leader

    Founding SRE Engineer - Scale & Reliability Leader

    OpusClip • Burnaby
    Full-time
    A leading AI video platform in Burnaby seeks a Founding Site Reliability Engineer (SRE) to enhance platform stability and scalability. You will architect isolated processing environments, drive impr...Show more
    Last updated: 6 days ago • Promoted
    Research Associate

    Research Associate

    Impact Recruitment • Greater Vancouver Metropolitan Area, Canada
    Full-time
    Impact Recruitment has the pleasure of working once again with this national, full-service law firm located in downtown Vancouver. With a strong history of representing clients in innumerable comple...Show more
    Last updated: 2 hours ago • Promoted • New!
    Subsurface Modeling Specialist — Energy Projects & Training

    Subsurface Modeling Specialist — Energy Projects & Training

    Seequent • Vancouver
    Full-time
    A leading geoscience technology company is looking for a Project Geologist in Vancouver.This full-time role involves solving complex geoscience challenges, providing technical training on software ...Show more
    Last updated: 6 days ago • Promoted
    Engineering Technologist

    Engineering Technologist

    Advanced Cyclotron Systems Inc. • Greater Vancouver Metropolitan Area, Canada
    Full-time
    Why Join Advanced Cyclotron Systems, Inc.Advanced Cyclotron Systems, Inc.ACSI) is a world leader in the design and manufacturing of high output cyclotrons for the international nuclear medicine com...Show more
    Last updated: 2 hours ago • Promoted • New!
    PhD student : Applying Machine Learning to Building Systems

    PhD student : Applying Machine Learning to Building Systems

    International Society for Industrial Ecology • Vancouver
    Full-time
    PhD student : Applying Machine Learning to Building Systems.There have been growing efforts and advocations for integrating energy performance and carbon intensity metrics into building codes.One of...Show more
    Last updated: 7 hours ago • Promoted • New!
    Robotics Controls Co-op : AI-Driven Hands & Systems

    Robotics Controls Co-op : AI-Driven Hands & Systems

    Sanctuary AI • Vancouver
    Full-time
    A leading robotics firm in Metro Vancouver is seeking a skilled Controls Engineering co-op.The role involves developing advanced control systems for humanoid robots, integrating software with hardw...Show more
    Last updated: 6 days ago • Promoted
    Researcher AI Computing System

    Researcher AI Computing System

    Huawei Technologies Canada Co., Ltd. • Vancouver, BC, CA
    Temporary
    Huawei Canada has an immediate 12 month contract opening for a Researcher.The Advanced Computing and Storage Lab, currently a part of the Vancouver Research Centre, aims to explore adaptive computi...Show more
    Last updated: 30+ days ago
    Customer Solutions Manager

    Customer Solutions Manager

    Great Little Box Company • Richmond, BC, Canada
    Full-time
    We are seeking an experienced and dynamic.Folding Carton division in Richmond.In this leadership role, you will oversee a high‑performing team, ensure exceptional customer service, and contribute t...Show more
    Last updated: 2 hours ago • Promoted • New!
    CNC Programmer / Applications Specialist

    CNC Programmer / Applications Specialist

    Ebco Industries Ltd. • Richmond, BC, Canada
    Full-time
    The CNC Programmer / Applications Specialist is responsible for developing CNC programs, designing fixtures, optimizing machining operations, and supporting production teams to ensure high-quality ...Show more
    Last updated: 2 hours ago • Promoted • New!
    Senior Generative AI Developer

    Senior Generative AI Developer

    freelance.ca • Burnaby, Canada
    Full-time
    Senior Generative AI Developer .Key information / Informations clés.Type de poste : Contrat (consultant).Taux horaire : 70 $ – 90 $ / heure. Mandat senior en intelligence artificielle générative au ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    CD PROJEKT RED • Vancouver
    Full-time
    To create revolutionary, story-driven RPGs which go straight to the hearts of gamers — this is our mission.Want to dive deeper into our company’s culture? Explore our social media and check out our...Show more
    Last updated: 6 days ago • Promoted
    Manager AI

    Manager AI

    TELUS • Vancouver
    Full-time
    Join our team and what we'll accomplish together.The Data Strategy & Enablement (DSE) team is on a continuous journey towards helping TELUS become a world-class leader in Artificial Intelligence pr...Show more
    Last updated: 6 days ago • Promoted
    Senior Learning Design Specialist : Elevate Training

    Senior Learning Design Specialist : Elevate Training

    Aritzia • Vancouver
    Full-time
    A luxury retail company is looking for a Specialist / Senior Specialist in Learning Design to create high-quality training solutions. This role supports learning across teams by applying effective des...Show more
    Last updated: 2 days ago • Promoted
    Senior Talent Acquisition Lead — Robotics & AI (Hybrid)

    Senior Talent Acquisition Lead — Robotics & AI (Hybrid)

    Novarc Technologies Inc. • Burnaby
    Full-time
    A robotics technology firm in Metro Vancouver is seeking a Talent Acquisition Manager to lead recruitment efforts.The role involves developing talent pipelines, enhancing the employer brand, and ma...Show more
    Last updated: 4 days ago • Promoted
    Security Engineer (ID#5228)

    Security Engineer (ID#5228)

    New Value Solutions • Richmond, BC, Canada
    Full-time
    New Value Solutions, a national IT consulting company, is seeking a Security Engineer to join a DevSecOps team focused on security in SDLC. This will involve secure design review, threat modelling, ...Show more
    Last updated: 2 hours ago • Promoted • New!