Talent.com
Tubi
Software Engineer, ML Infra & Distributed Systems (Staff& Principal)Tubi • Toronto, Canada
No longer accepting applications
Software Engineer, ML Infra & Distributed Systems (Staff& Principal)

Software Engineer, ML Infra & Distributed Systems (Staff& Principal)

Tubi • Toronto, Canada
13 days ago
Job type
  • Full-time
Job description
About The Role As a Staff Software Engineer on the ML Infrastructure team, you will collaborate closely with the Machine Learning and Product teams to build world‑class machine learning inference platforms. These platforms power essential services like personalized recommendations, search, and content understanding across Tubi.

A core responsibility of this team is developing and maintaining low‑latency ML model serving systems that support Deep Learning, LLM, and Search models. This involves building self‑service infrastructure and critical components such as the inference engine, feature store, vector store, and experimentation engine.

You will improve the way we deploy and operate our services and even contribute to open‑source projects. This role grants the architectural freedom to explore new frameworks, lead critical cross‑functional projects, and transform the capabilities of our ML and Product teams.

Responsibilities

Design and build scalable, high‑throughput, and low‑latency distributed systems using Scala

Build reusable components and services that serve various ML applications like Personalization, Search, Ads, and Exploration

Partner closely with ML engineers to understand their challenges and limitations and develop scalable solutions to address them. Proactively recommend solutions to keep our ML Inference stack state of the art

Take a data‑driven approach to identifying & optimizing latency, cost, and efficiency of our infra. Lead large scale cross‑functional refactorings if necessary

Mentor other engineers on the team on system design, effective incident management, interviewing, leveraging LLMs for work, etc.

Collaborate with ML, Product, and cross‑functional engineering teams to define the long‑term vision and architecture for ML Infrastructure at Tubi

Your Background

8+ years of experience designing and building scalable, distributed systems in any modern backend language (e.g., Scala, Java, Python, Go, C++); experience with Scala or JVM‑based language is a plus

Strong experience with AWS or an equivalent cloud platform

Experience building online microservices at scale with low‑latency serving

Experience with both SQL (e.g. Postgres) and NoSQL databases (e.g. Cassandra), message brokers (e.g. Kafka), and caches (e.g. Redis)

Experience with containerization technologies, such as Docker or Kubernetes

Led the response and resolution efforts for multiple major, large‑scale incidents

Bonus

Familiarity with the machine learning infrastructure like inference engines (e.g. torschserve, triton, vLLM), vector stores (e.g. LanceDB, FAISS), feature stores (e.g. Feast), ElastiCache, model training orchestration, etc.

Understanding of ML model training pipelines and model internals. Experience with Recommender Systems, Search, Autocomplete and Ads ML is a plus

Previous experience with Akka, Erlang, Elixir or Go

Proficient in data‑driven analysis of complex A/B testing results

About Tubi Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world’s largest collection of Hollywood movies and TV shows, thousands of creator‑led stories and hundreds of Tubi Originals made for the most passionate fans. Headquartered in San Francisco and founded in 2014, Tubi is part of Tubi Media Group, a division of Fox Corporation.

Pursuant to local pay disclosure requirements, the pay range for this role, with final offer amount dependent on education, skills, experience, and location is as listed annually below.

Toronto, Canada

$164,600 - $235,100 CAD

This role is also eligible for an annual discretionary bonus, long‑term incentive plan, and various benefits including medical/dental/vision, insurance, vacation/paid time off and other benefits in accordance with applicable plan documents.

For all salaried employees, in lieu of the FOX Vacation policy, Tubi offers a Flexible Time Off Policy to manage all personal matters.

For all full‑time, regular employees, in lieu of FOX Paid Parental Leave, Tubi offers a generous Parental Leave Program, which allows parents twelve (12) weeks of paid bonding leave (top up in Canada) within the first year of birth, adoption, surrogacy, or foster placement of a child in addition to applicable government leave program(s) and FOX’s short‑term disability policy (if applicable). This time is 100% paid through a combination of any applicable government leaves and wage‑replacement programs in addition to contributions made by Tubi.

For all full‑time, regular employees, Tubi offers a monthly wellness reimbursement.

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, gender identity, disability, protected veteran status, or any other characteristic protected by law. We will consider for employment qualified applicants with criminal histories consistent with applicable law.

#J-18808-Ljbffr
Create a job alert for this search

Software Engineer, ML Infra & Distributed Systems (Staff& Principal) • Toronto, Canada

Similar jobs

Lead Principal Software Engineer

Auxo | Growth Partnermarkham, on, ca
Full-time

SaaS products and platforms at scale.You’ll work closely with the Head of Product & Technology to shape technical direction across multiple teams.This is a high-impact IC role where you’ll influenc... Show more

 • Promoted

Staff Software Engineer, ML Platform

AfreshToronto, ON, CA
Full-time

Remote in Ontario, Canada or within U.Afresh is on a mission to eliminate food waste and make fresh food accessible to all.With our Fresh Operating System, regional and national grocery retailers h... Show more

 • Promoted

Staff ML Platform Engineer – Scale & Impact

Afresh Technologies, Inc.Toronto, ON, CA
Full-time

A leading AI firm in fresh food is looking for an ML Platform Engineer to enhance the core machine learning platform.You will work on critical infrastructure, ensuring performance and scalability o... Show more

 • Promoted

Senior Machine Engineer, ML Systems and Infrastructure

Autodesk, Inc.Toronto, ON, CA
Full-time

POSITION OVERVIEW**The work we do at Autodesk touches nearly every person on the planet.By creating software tools for making buildings, machines, and even the latest movies, we influence and empow... Show more

 • Promoted

Software Engineer, ML Infra & Distributed Systems (Staff & Principal)

TubiToronto, Ontario, Canada
Full-time

About The Role As a Staff Software Engineer on the ML Infrastructure team, you will collaborate closely with the Machine Learning and Product teams to build world‑class machine learning inference p... Show more

 • Promoted

Software Engineer & PRM Systems Lead

AmazonToronto, ON, CA
Full-time

A leading global technology company seeks a Software Development Engineer in Toronto to develop solutions for automating operational compliance.This role impacts workforce management and entails wo... Show more

 • Promoted

Software Engineer, Ml Infra & Distributed Systems (Staff & Principal)

TubiToronto, Canada
Full-time

About The Role As a Staff Software Engineer on the ML Infrastructure team, you will collaborate closely with the Machine Learning and Product teams to build world‑class machine learning inference p... Show more

 • Promoted

Senior Engineer - ASIC Infrastructure and ML Integration

NVIDIA AIToronto, ON, CA
Full-time

Join our team as a Senior ASIC Infrastructure Engineer, focusing on AI and ML integration.You will play a key role in enhancing chip design and debug processes with your technical expertise.This fu... Show more

 • Promoted

Software Engineer – AI/ML, Digital Pathology Image Management System (IMS)

University Health NetworkToronto, ON, CA
Full-time

New or Replacement Position: New.Site: Toronto General Hospital.Department: Lab Medicine Program.Reports to: Principal Investigator.Salary Range: $74,114 to $111,170 per annum.We are seeking a high... Show more

 • Promoted

Staff Software / Platform Engineer, Infrastructure- Privileged Access Management

TechBrainsToronto, ON, CA
Full-time

Staff Software / Platform Engineer, Infrastructure- Privileged Access Management.Okta is The World’s Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.O... Show more

 • Promoted

Staff ML Infrastructure Engineer — Remote (Canada)

SamsaraToronto, ON, CA
Remote
Full-time

A leading technology firm is hiring a Staff / Senior Staff Machine Learning Infrastructure Engineer to design and operate a cutting-edge ML platform in Canada.This role involves collaboration with ... Show more

 • Promoted

Principal Software Engineer at Workday

WorkdayToronto, ON, CA
Full-time

Elevate your career as a Principal Software Engineer at Workday, where you will shape ML capabilities and improve product experiences.Collaborate in a dynamic environment focused on MLOps and cutti... Show more

 • Promoted

AI/ML Systems Design Engineer

Advanced Micro DevicesMarkham, York Region, CA
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst... Show more

 • Promoted

MuleSoft Architect

OSF Digitalnewmarket, on, ca
Full-time

OSF Digital is a leading digital transformation firm with a global footprint in 30+ countries and over 1500+ employees.Our passion is in helping businesses leverage commerce, marketing, sales, serv... Show more

 • Promoted

Software Engineer – Advanced Systems

Draganfly Inc.toronto, on, ca
Full-time

Company”) has been a recognized technology leader within the commercial UAV space for over two decades.We helped establish the commercial market & adoption of multi-rotor helicopters for public saf... Show more

 • Promoted

Software Engineer – Advanced Systems - markham

Draganfly Inc.markham, on, ca
Full-time

Company”) has been a recognized technology leader within the commercial UAV space for over two decades.We helped establish the commercial market & adoption of multi-rotor helicopters for public saf... Show more

 • Promoted

AI/ML Systems Design Engineer

AMDMarkham, York Region, CA
Full-time

At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.Grounded in a culture of innovatio... Show more

 • Promoted

Senior / Staff Software Engineer, ML Datasets & Data Pipelines

WaabiToronto, ON, CA
Full-time

Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI.With a world‑class team, we’re unlocking the next era of autonomous transportation with technology that’s powering commer... Show more

 • Promoted

Senior GenAI & LLM Systems Engineer

GuidepointToronto, ON, CA
Full-time

A leading research enablement platform is seeking an experienced Data/AI Engineer for its Toronto office.This hybrid role involves building scalable AI systems and applications, optimizing Generati... Show more

 • Promoted

Lead ML Engineer

HaysGreater Toronto Area, Canada, Canada
Full-time

You’ll be joining a leading Canadian digital organization building advanced eCommerce experiences across grocery, beauty, pharmacy, loyalty, and apparel.This team handles millions of daily customer... Show more