Talent.com
Upwork
Senior Lead Research Scientist, Agentic AIUpwork • Toronto, Ontario, Canada
Senior Lead Research Scientist, Agentic AI

Senior Lead Research Scientist, Agentic AI

Upwork • Toronto, Ontario, Canada
30+ days ago
Job type
  • Full-time
Job description

Upwork Inc.’s (Nasdaq: UPWK) family of companies connects businesses with global, AI-enabled talent across every contingent work type including freelance, fractional, and payrolled. This portfolio includes the Upwork Marketplace, which connects businesses with on-demand access to highly skilled talent across the globe, and Lifted, which provides a purpose-built solution for enterprise organizations to source, contract, manage, and pay talent across the full spectrum of contingent work. From Fortune 100 enterprises to entrepreneurs, businesses rely on Upwork Inc. to find and hire expert talent, leverage AI-powered work solutions, and drive business transformation. With access to professionals spanning more than 10,000 skills across AI & machine learning, software development, sales & marketing, customer support, finance & accounting, and more, the Upwork family of companies enables businesses of all sizes to scale, innovate, and transform their workforces for the age of AI and beyond.

Since its founding, Upwork Inc. has facilitated more than $30 billion in total transactions and services as it fulfills its purpose to create opportunity in every era of work. Learn more about the Upwork Marketplace at

We’re seeking a Senior Lead Research Scientist (Agentic AI) to push the frontier of autonomous, tool‑using AI and ensure that innovations make it into production. You’ll split your time between novel research (benchmarks, learning algorithms, publications, and thought leadership) and building the tools, datasets, and systems required to run rigorous experiments and ship results into our agentic platform. You will partner closely with ML engineers, product, platform, and safety teams to translate research into reliable, scalable capabilities for customers and developers on Upwork.

Responsibilities

  • 50/50 Split between research and engineering/productionalization.
  • Advance agentic benchmarking. Define and maintain a rigorous evaluation suite for agents (task success, reliability, recovery, safety, latency, and cost). Establish protocols, datasets, and reproducible metrics aligned to best practices in agentic evaluation; continuously harden benchmarks against loopholes and overfitting.
  • Invent and publish. Lead novel studies on agent planning, tool use, reflection/memory, safety, and multi‑agent coordination. Publish at top venues (e.g., NeurIPS/ICML/ICLR/ACL) and present learnings internally and externally.
  • Explore RLEF for agents. Develop Reinforcement Learning from Execution Feedback (RLEF) approaches that ground agent behavior in environment/run‑time signals (e.g., execution traces, tool results, test outcomes), comparing to RLHF/RLAIF on agent tasks.
  • Continuous/online learning. Design safe, measurable loops for continual improvement (data selection, drift detection, reward model updates, policy refresh), with guardrails that protect quality and cost.
  • Human‑in‑the‑loop systems. Partner on data strategy, labeling protocols, and reviewer tooling for RLHF and workflow‑level judgment; instrument quality controls and reviewer calibration.
  • Build research tooling. Stand up agents‑at‑scale experiment infrastructure: simulators, sandboxes, and orchestration for long‑horizon tasks; evaluation harnesses; offline/online A/B; and dashboards for longitudinal tracking.
  • Train & align models. Implement high‑quality pipelines for SFT, DPO, RLHF/RLAIF/RLEF; manage data provenance, safety filters, and automated red‑teaming; integrate eval signals into CI/CD.
  • Ship to production. Collaborate with platform teams to graduate prototypes into reliable services (APIs/SDKs, auth, observability, rate limiting) and to integrate agents with developer protocols (e.g., MCP) and runtime services.

What it takes to catch our eye

  • PhD or equivalent research track record with peer‑reviewed publications in relevant venues; strong empirical methodology and scientific writing/presentation skills.
  • Demonstrated contributions to agentic evaluation/benchmarks or long‑horizon reasoning (e.g., designing tasks, metrics, robust protocols).
  • Hands‑on experience adapting LLMs for tool use and multi‑step plans; fluent in prompting, function/tool calling, and memory/critique patterns.
  • Practical mastery of alignment methods (SFT, DPO, RLHF, RLAIF, and RLEF) and reward‑modeling; you know when to prefer each and how to evaluate them.
  • Proficiency in Python and one or more of PyTorch/JAX; experience with distributed training (e.g., DDP/Ray), dataset curation, experiment tracking, and reproducibility.
  • Ability to build research‑grade tools that evolve into production‑grade services (APIs/SDKs, data stores, streaming/messaging, tracing/metrics).
  • Comfortable building end‑to‑end eval pipelines (offline + online), defining pass/fail gates, and quantifying trade‑offs (quality, safety, latency, cost).
  • Experience with safety testing and red‑teaming for agents; familiarity with risk taxonomies for autonomous systems.
  • Proven success mentoring senior ICs, leading cross‑functional initiatives, and educating internal/external audiences (talks, tutorials, blog posts, open‑source).

Come change how the world works.

Upwork is establishing its first international operational hub in Lisbon, Portugal. The new office is expected to be fully operational by Q4 2026.

This position will initially be employed through a partner to ensure a seamless hiring process while we establish the hub. Once the hub is established, there may be opportunities to transition to employment with Upwork depending on business needs and other requirements. While employed by the partner, you’ll work as part of Upwork’s team, with access to our resources, culture, and growth opportunities.

Our partner will offer competitive benefits. When Upwork’s hub is established, we will be excited to offer employment and benefits directly as business needs require.

Upwork is committed to building a diverse, inclusive, and equitable workforce. Employment decisions are made without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, or any other status protected by applicable law.

Create a job alert for this search

Senior Lead Research Scientist, Agentic AI • Toronto, Ontario, Canada

Similar jobs

Senior Scientist in AI Personalization

Autodesk, Inc.Toronto, ON, CA
Full-time

Drive personalization innovations at Autodesk as a Senior Scientist in AI Personalization.Leverage your skills in AI/ML to enhance products for a global audience.In your role within the Personalize... Show more

 • Promoted

Lead Agentic AI Researcher

LG Electronics Canada, Inc.Toronto
Full-time

At LG, we create Innovation for a Better Life.We design products and services that make life better, easier, and more enjoyable.Whether it’s through smart functionality, design, or innovative techn... Show more

 • Promoted

Senior Applied Scientist Spearheading AI Research Innovations

PowerToFlyToronto, ON, CA
Full-time

Empower your career as a Senior Applied Scientist driving cutting-edge research.Utilize AI and machine learning to address industry needs in a flexible, hybrid environment.This role entails leading... Show more

 • Promoted

Senior AI Researcher - Emerging Risks & Opportunities

MSCI IncToronto, ON, CA
Full-time

The Emerging Risks & Opportunities R&D team is building MSCI’s next-generation capability for measuring and classifying portfolio exposure to economy-wide structural themes.Our team develops AI-nat... Show more

 • Promoted

Senior AI Researcher - Emerging Risks & Opportunities

MSCIToronto, ON, CA
Full-time

The Emerging Risks & Opportunities R&D team is building MSCI’s next-generation capability for measuring and classifying portfolio exposure to economy-wide structural themes.Our team develops AI-nat... Show more

 • Promoted

Senior Principal Researcher AI for Science

Huawei Technologies Canada Co., Ltd.Markham, York Region, CA
Permanent

Huawei Canada has an immediate permanent opening for a Senior Principal Researcher - AI4Science.The Technology Planning and Cooperation Department promotes strategic innovation across all of Huawei... Show more

 • Promoted

AI Research Lead Scientist - Hybrid Role

Thomson ReutersToronto, ON, CA
Full-time

Drive AI innovation as a Lead Scientist at Thomson Reuters Labs in a hybrid work setting.Focus on high-impact applied research in diverse areas, including legal AI solutions.In this role, you will ... Show more

 • Promoted

Senior Researcher for Distributed AI and Agentic Learning Solutions

Huawei CanadaMarkham, York Region, CA
Full-time

Catalyze breakthroughs in agentic learning as a Senior Researcher.Develop and optimize multi-agent systems and algorithms for distributed environments that drive practical AI applications.The role ... Show more

 • Promoted

Senior Research Role in AI Platforms

FujitsuToronto, ON, CA
Full-time

Join the University of Toronto as a Senior Researcher specializing in energy-efficient AI platforms.Engage in cutting-edge research to enhance performance per watt in AI technologies.This role requ... Show more

 • Promoted

Senior AI Researcher Specializing in Algorithms

Edmates InternationalToronto, ON, CA
Full-time

Elevate your career as a Senior AI Researcher, specializing in innovative algorithm creation and machine learning.This role offers a high salary and unmatched opportunities in a rapidly growing ind... Show more

 • Promoted

Senior Scientist - Agentic AI Systems - Vaccines

SanofiToronto, ON, CA
Full-time

Design and implement state-of-the-art agentic AI systems for drug discovery, clinical development, and R&D operations* Develop multi-agent architectures incorporating perception, memory, planning, ... Show more

 • Promoted

Lead Agentic AI Researcher

LG Electronics CanadaToronto
Full-time

At LG, we create Innovation for a Better Life.We design products and services that make life better, easier, and more enjoyable.Whether it’s through smart functionality, design, or innovative techn... Show more

 • Promoted

Senior/ Lead - AI Engineer

FICOmarkham, on, ca
Full-time

As a Senior Engineer on our Applied AI team, you will be at the forefront of building AI-powered software that transforms how our platform operates.You will design, build, and maintain production-g... Show more

 • Promoted

Senior ML Scientist: RAG & Agentic AI for Biopharma

Katalyze AIToronto, ON, CA
Full-time

A pioneering AI company in Toronto seeks a Senior ML Scientist to innovate in RAG and knowledge retrieval.The role involves designing autonomous workflows and advanced scientific pipelines, collabo... Show more

 • Promoted

Lead AI Researcher in Autonomous Agents

HuaweiMarkham, ON, CA
Full-time

Join Huawei Canada as a Senior Researcher to lead groundbreaking work on autonomous AI agents.Your research and skills will drive significant advancements in the field.Huawei Canada is expanding it... Show more

 • Promoted

Senior Agentic AI Engineer

Talent To Hire Inc.Toronto, ON, CA
Full-time

Senior AI Engineer - Agentic Systems / LLMOPS.We’re looking for an AI Engineer with deep technical expertise in agentic AI systems, LLM orchestration, and cloud deployment.The ideal candidate has h... Show more

 • Promoted

Senior AI/ML Scientist

Vanguard CanadaToronto, ON, CA
Full-time

The EAiR team at Vanguard is seeking a highly skilled and motivated Senior AI Research Scientist to join our AI research team at Vanguard.In this role, you will be responsible for conducting innova... Show more

 • Promoted

Senior AI Lead - Scotia Jarislowsky Fraser - Toronto

Scotia Wealth ManagementToronto, ON, CA
Full-time

Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.We are looking for a strategic, hands‑on Senior Artificial Intelligence (AI) Lead to drive and ... Show more

 • Promoted

Senior AI/ML Scientist

VanguardToronto, ON, CA
Full-time

The EAiR team at Vanguard is seeking a highly skilled and motivated Senior AI Research Scientist to join our AI research team at Vanguard.In this role, you will be responsible for conducting innova... Show more

 • Promoted

Senior AI Researcher - Emerging Risks & Opportunities

MSCI Inc.Toronto, ON, CA
Full-time

Emerging Risks & Opportunities R&D team is building MSCI's next-generation capability for measuring and classifying portfolio exposure to economy-wide structural themes.Our team develops AI-native ... Show more