Machine Learning Engineer, Reinforcement Learning & Reward ModelingWayve • Vancouver, BC, CA

Les candidatures ne sont plus acceptées

Machine Learning Engineer, Reinforcement Learning & Reward Modeling

Wayve • Vancouver, BC, CA

Il y a plus de 30 jours

Type de contrat

Temps plein

Temporaire

Description de poste

Join or sign in to find your next job

Join to apply for the Applied Scientist - Reward Modeling role at Wayve

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran status, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law.

About Us

Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.

Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving.

In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.

At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.

Make Wayve the experience that defines your career!

The role

We're looking for an experienced Applied Scientist with expertise in Reinforcement Learning and Reward Modelling to advance our training and evaluation frameworks contributing significantly to the creation of safe and reliable AI driving technology. The ideal candidate has a deep understanding of reinforcement learning, machine learning, and behavioural modelling, combined with a drive to innovate in the autonomous driving space.

Role

In this role, you will be at the forefront of designing and optimizing reward and reinforcement learning models that are powerful and resource-efficient, tailored for the unique demands of embodied AI and autonomous systems. Your work will involve but not limited to :

Design, develop, and refine reward models that align with safe and efficient driving objectives for autonomous vehicles.
Work closely with multidisciplinary teams to integrate reward models with real-world data and simulation frameworks.
Define a data strategy that includes efficient use of real and synthetic data, annotations, and active learning.
Design experiments to evaluate reward structures in diverse driving scenarios and identify areas for improvement.
Collaborate with world-class researchers and engineers to push the boundaries of AI, contributing significantly to the evolution of autonomous driving technology

What you’ll bring to Wayve

In order to set you up for success as an Applied Scientist at Wayve, we’re looking for the following skills and experience.

Must Haves

Proven expertise in reinforcement learning, including in areas like offline RL, reward modelling, RLHF, DPO, GPRO, as well as experience with machine learning.

Strong programming skills in Python and experience with machine learning libraries such as PyTorch.

Experience in working with simulation environments and real-world data for model validation and performance benchmarking.

Demonstrated ability to publish research and present findings to both technical and non-technical audiences at top tier conferences.

Excellent problem-solving skills and the ability to work independently as well as in a team environment.

Demonstrated ability to work collaboratively in a fast-paced, innovative, interdisciplinary team environment.

Desirable

Track record of publications at top-tier conferences like NeurIPS, CVPR, ICRA, ICLR, CoRL etc.

Familiarity with self-driving technologies, sensor data processing, and real-time decision-making algorithms.

Experience with large-scale machine learning systems, distributed training and deploying models in production environments.

What we offer you

Attractive compensation with salary and equity

Immersion in a team of world-class researchers, engineers and entrepreneurs

A unique position to shape the future of autonomy and tackle the biggest challenge of our time

Bespoke learning and development opportunities

Relocation support with visa sponsorship

Flexible working hours - we trust you to do your job well, at times that suit you and your time

Benefits such as an onsite chef, workplace nursery scheme, private health insurance, therapy, daily yoga, onsite bar, large social budgets, unlimited L&D requests, enhanced parental leave, and more!

This is a full-time role based in our office in Vancouver. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home.

We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.

For more information visit Careers at Wayve.

To learn more about what drives us, visit Values at Wayve

DISCLAIMER : We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non-discriminatory.

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Research, Analyst, and Information Technology

Industries

Software Development

Referrals increase your chances of interviewing at Wayve by 2x

Get notified about new Applied Scientist jobs in Vancouver, British Columbia, Canada .

White Rock, British Columbia, Canada 1 month ago

Research Scientist - Antibody Purification (12-month Contract)

Associate Research Scientist (Instrument Control and Acquisition)

Burnaby, British Columbia, Canada 6 days ago

Burnaby, British Columbia, Canada 2 weeks ago

Burnaby, British Columbia, Canada 1 day ago

Research Scientist - Computational Structural Biology

AI Research Scientist : AEC. Remote US or Canada

AI / ML / LLM Engineer (Healthcare & Edge AI)

White Rock, British Columbia, Canada 3 weeks ago

AI Research Scientist – Structured & Spatial Modeling

Data Scientist, Experimentation & Incremental Measurement

Burnaby, British Columbia, Canada 6 months ago

Senior Applied Scientist (Remote - Canada)

White Rock, British Columbia, Canada 1 month ago

Richmond, British Columbia, Canada CA$60,000.00-CA$90,000.00 5 months ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Créer une alerte emploi pour cette recherche

Machine Learning Engineer Reinforcement Learning Reward Modeling • Vancouver, BC, CA

Offres similaires

Remote Mathematics Expert for AI Research & Training

Labelbox • Vancouver, Metro Vancouver Regional District, Canada

Télétravail

Temps partiel

A leading AI research firm is seeking a Mathematics Expert to develop and solve advanced mathematical problems remotely.The ideal candidate holds a Master’s or PhD in Mathematics, has strong coding...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

Director of Product Management - Robotics & AI

Sanctuary Cognitive Systems Corporation • Vancouver, Metro Vancouver Regional District, Canada

Temps plein

A leading robotics firm in Metro Vancouver is seeking a Director of Product Management.This role will lead the product management organization and shape product strategies for humanoid robots.Ideal...Voir plus

Dernière mise à jour : il y a 22 jours • Offre sponsorisée

Customer Service - Remote - 50k-60k / Year

Spade Recruiting • Squamish, British Columbia

Télétravail

Temps plein

Quick Apply

We’re looking for enthusiastic, self-driven, individuals to assist existing and prospective clients within our organization. This position will work with multiple clients throughout the day pr...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

AI Practice Lead

Aequilibrium Software Inc. • Vancouver, BC, CA

Temps plein

Quick Apply

AI Practice Lead We are seeking a highly experienced AI Practice Lead to spearhead Aequilibrium's (AEQ) AI strategy, establish and oversee the AI Center of Excellence (CoE), and drive innovation ac...Voir plus

Dernière mise à jour : il y a plus de 30 jours

Merchandising Associate : Full Time - SQUAMISH

The Home Depot Canada • Squamish, British Columbia, Canada

Temps plein

With a career at The Home Depot, you can be yourself and also be part of something bigger.Merchandising Execution Associates (MEAs) perform in-store merchandising service activities such as merchan...Voir plus

Dernière mise à jour : il y a 11 jours • Offre sponsorisée

Product Design Development Engineer

The Peak Group of Companies • richmond, bc, ca

Temps plein

The PEAK Group of Companies is a leader in home improvement, delivering innovative products across Canada, the United States, Australia, and New Zealand. As a trusted partner of The Home Depot (THD)...Voir plus

Dernière mise à jour : il y a 1 jour • Offre sponsorisée

Hindi-English Bilingual (AI Consulting) - richmond

Aligned Labs • richmond, bc, ca

Temps partiel

We are looking to expand our team of expert consultants with 25.This role involves evaluating AI model outputs in Hindi and English, assessing cultural context and nuance, translating between both ...Voir plus

Dernière mise à jour : il y a 1 jour • Offre sponsorisée

Trigonometry Private Tutoring Jobs Delta

Superprof • Delta, Canada

Temps plein +1

Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

AI / ML Engineering Director for AutoCAD & Infra

Autodesk, Inc. • Vancouver, Metro Vancouver Regional District, Canada

Temps plein

A leading software company in Canada is seeking an AI and ML Engineering Director to define and implement the AI technical strategy for their AIR organization. This role involves leading and mentori...Voir plus

Dernière mise à jour : il y a 26 jours • Offre sponsorisée

Director of Software Development - AI / ML

Autodesk, Inc. • Vancouver, Metro Vancouver Regional District, Canada

Temps plein

Job Requisition ID # • •25WD92782 • •Position Overview • • • Strategic Leadership : Define and implement the AI technical strategy for the AIR org, aligning it with Autodesk’s broader technical goals and b...Voir plus

Dernière mise à jour : il y a 26 jours • Offre sponsorisée

Researcher AI Computing System

Huawei Technologies Canada Co., Ltd. • Vancouver, BC, CA

Temporaire

Huawei Canada has an immediate 12 month contract opening for a Researcher.The Advanced Computing and Storage Lab, currently a part of the Vancouver Research Centre, aims to explore adaptive computi...Voir plus

Dernière mise à jour : il y a plus de 30 jours

Trigonometry Private Tutoring Jobs Richmond

Superprof • Richmond, Canada

Temps plein +1

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

Cybersecurity Consultant – Azure & AI Governance ((French Bilingual) - richmond

Concentrix • richmond, bc, ca

Temps plein

Cybersecurity Consultant – Azure & AI Governance.Microsoft ecosystem to advise enterprise customers and lead strategic AI security initiatives. Lead customer workshops to assess AI readiness, focusi...Voir plus

Dernière mise à jour : il y a 1 jour • Offre sponsorisée

AI Engineer - US SaaS startup | Remote

Atomic HR • Vancouver, British Columbia, .CA

Télétravail

Temps plein

Quick Apply

We connect talented tech professionals in Latin America and Canada with remote career opportunities at innovative startups worldwide. We specialize in finding roles that align with your skills, expe...Voir plus

Dernière mise à jour : il y a plus de 30 jours

Sales Partner

ATIA Ltd • Richmond, Metro Vancouver Regional District, Canada

Télétravail

Temps plein

ATIA Ltd is multinational company which has 2 main sectors : .First sector : ISO Standards - which includes : .Second sector : Software Development. Developing applications for all technologies and platfo...Voir plus

Dernière mise à jour : il y a 5 jours • Offre sponsorisée

Hindi-English Bilingual (AI Consulting) - Aligned Labs

Aligned Labs • richmond, bc, ca

Temps partiel

Dernière mise à jour : il y a 1 jour • Offre sponsorisée

Director of Product Management — Humanoid Robotics & AI

Sanctuary AI • Vancouver, Metro Vancouver Regional District, Canada

Temps plein

A robotics and AI innovation company located in Vancouver is looking for a Director of Product Management.This role will lead the product management organization and shape product evolution from co...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

Product Design Development Engineer - richmond

The Peak Group of Companies • richmond, bc, ca

Temps plein

Dernière mise à jour : il y a 1 jour • Offre sponsorisée