Researcher, Agentic RL for LLMs (Contract)Huawei Canada • Markham, York Region, CA

Researcher, Agentic RL for LLMs (Contract)

Huawei Canada • Markham, York Region, CA

29 days ago

Job type

Full-time

Job description

A prominent technology company in York Region, Canada is seeking a Researcher for Reinforcement Learning. This role focuses on enhancing Large Language Models (LLMs) through innovative training techniques and evaluation methods. The ideal candidate will hold a PhD in Computer Science, with strong deep learning and reinforcement learning skills, along with proficiency in Python and experience with tools like PyTorch. This entry-level position offers a contract duration of 12 months.

#J-18808-Ljbffr

Create a job alert for this search

Researcher Agentic RL for LLMs Contract • Markham, York Region, CA

Similar jobs

ML Platform Engineer, Intelligence Accelerator

Okta, Inc. • Toronto C6A, ON, Canada

Remote

Full-time

A leading identity management company in Toronto seeks an AI / ML Engineer II to join the Intelligence Accelerator team.You'll be responsible for building scalable machine learning infrastructure, co...Show more

Last updated: 5 hours ago • Promoted • New!

GenAI / AML Solutions Architect

Vision Talent Co. • Toronto C6A, ON, Canada

Remote

Full-time +1

This range is provided by Vision Talent Co.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Vision Talent’s client is seeking a highly experience...Show more

Last updated: 12 days ago • Promoted

Construction Associate / Counsel Role - newmarket

ZSA Canada • newmarket, on, ca

Full-time

Remote (anywhere in Canada)| 2+ years.Are you looking to be part of an all-star group at a boutique firm that offers ABOVE Bay Street compensation? Our client is looking for a Construction Litigati...Show more

Last updated: 1 day ago • Promoted

Remote ML Engineer — Data Insights & Production Models

Dew Software • Toronto C6A, ON, Canada

Remote

Full-time +1

A technology company is looking for a Machine Learning Engineer to join their team in Toronto, Canada.The ideal candidate will analyze datasets to provide insights and assist in developing machine ...Show more

Last updated: 2 days ago • Promoted

Machine Learning Engineer (Contract) Hybrid, Toronto GTA

Architech Solutions Consulting Services Inc. • Toronto C6A, ON, Canada

Remote

Full-time

Join Us in Building the Future.At Architech, we don’t just ship software.We partner with North America’s leading brands to modernize legacy platforms, embed AI into real operations, and launch digi...Show more

Last updated: 24 days ago • Promoted

Remote IB / PE Finance Expert for AI Modeling

TrainMind • Toronto C6A, ON, Canada

Remote

Part-time

A financial consulting firm is seeking an Investment Banking / Private Equity Expert for a remote contract role focused on enhancing AI financial models. Candidates should have over 2 years of exper...Show more

Last updated: 15 days ago • Promoted

ML Ops Engineer - Hybrid Role, Growth & Impact

Wawanesa • Toronto C6A, ON, Canada

Remote

Full-time

A leading mutual insurance company in Toronto, Ontario, is seeking a Machine Learning Operations Engineer.This role involves developing, deploying, and monitoring machine learning models, collabora...Show more

Last updated: 18 days ago • Promoted

Licensed Real Estate Agent

Realtris Inc • Markham, ON, CA

Full-time

Quick Apply

Become a Part of Realtris Today.Welcome to Realtris, a pioneering tech-focused real estate company that is transforming the property landscape in Canada. We specialize in contemporary and innovative...Show more

Last updated: 30+ days ago

Founding CEO : AI RegTech for RIA Compliance

FutureSight • Toronto, Ontario, Canada

Full-time

A venture studio is seeking an experienced Founding CEO to lead an AI Compliance startup serving SEC and FINRA firms.You will shape the vision, raise capital, build your team, and manage initial pr...Show more

Last updated: 23 days ago • Promoted

Trigonometry Private Tutoring Jobs Georgina

Superprof • Georgina, Canada

Full-time +1

Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Show more

Last updated: 30+ days ago • Promoted

Founder - Loud Solutions

Loud Solutions • newmarket, on, ca

Full-time

Loud has partnered with a well-capitalized, highly active VC deploying capital into AI-driven businesses across large, legacy industries. What’s missing is the right person to steer the ship.We are ...Show more

Last updated: 1 day ago • Promoted

[T4] Program Coordinator – French Language Compliance - newmarket

AMISEQ • newmarket, on, ca

Full-time

This role will focus on coordinating compliance activities, supporting audits, and ensuring alignment with Quebec French language regulations while working closely with internal teams and external ...Show more

Last updated: 12 hours ago • Promoted • New!

Recruitment Resourcer - Engineering

Trindent Consulting • Toronto, Ontario, Canada

Full-time

Sourcer & Recruiter, Engineering, Oil & Gas.Join Trindent Consulting, a global management consulting firm specializing in technical augmentation in the energy sector, specifically downstream Oil & ...Show more

Last updated: 1 day ago • Promoted

Cybersecurity Consultant – Azure & AI Governance ((French Bilingual) - Concentrix

Concentrix • newmarket, on, ca

Full-time

Cybersecurity Consultant – Azure & AI Governance.Microsoft ecosystem to advise enterprise customers and lead strategic AI security initiatives. Lead customer workshops to assess AI readiness, focusi...Show more

Last updated: 1 day ago • Promoted

ML Engineer - Recommendations & Personalization

Lyft • Toronto, ON, Canada

Full-time

A ride-sharing platform based in Toronto is seeking a Machine Learning Engineer to develop and launch algorithms that drive their core services. The ideal candidate should possess at least 3 years o...Show more

Last updated: 4 days ago • Promoted

GenAI & AI-Detection ML Engineer — Impact & Equity

Motion Recruitment Partners LLC • Toronto C6A, ON, Canada

Full-time

A cutting-edge technology firm in Toronto is seeking a skilled software engineer to develop next-generation AI tools.The ideal candidate will have over 3 years of experience in Python, familiarity ...Show more

Last updated: 5 days ago • Promoted

Principal ML Engineer – Personalization & Recommendations

Tubi, Inc. • Toronto C6A, ON, Canada

Remote

Full-time

A leading streaming service provider is searching for a Principal Machine Learning Engineer for their Toronto office.This hybrid role involves leading the design and implementation of advanced reco...Show more

Last updated: 17 days ago • Promoted

ML Engineer (NLP / LLMs) – AI R&D for JusticeTech

EvenUp • Toronto, ON, Canada

Full-time

A fast-growing vertical SaaS company in Toronto is seeking a Data Scientist / Machine Learning Engineer to join their AI R&D team. You will develop and deploy models for their claims-intelligence plat...Show more

Last updated: 5 days ago • Promoted