Talent.com
NVIDIA
Senior Software Engineer, AI Inference SystemsNVIDIA • Winnipeg, Canada
No longer accepting applications
Senior Software Engineer, AI Inference Systems

Senior Software Engineer, AI Inference Systems

NVIDIA • Winnipeg, Canada
10 days ago
Job type
  • Full-time
Job description
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across multi-GPU, multi-node, and multi-cloud environments. You’ll collaborate across inference, compiler, scheduling, and performance teams to push the frontier of accelerated computing for AI.

What You’ll Be Doing

Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and optimize the inference framework (vLLM) with methods like speculative decoding, data/tensor/expert/pipeline-parallelism, prefill-decode disaggregation.

Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated) using techniques such as fusion, autotuning, and memory/layout optimization; build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization.

Define and build inference benchmarking methodologies and tools; contribute both new benchmark and NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite.

Architect the scheduling and orchestration of containerized large-scale inference deployments on GPU clusters across clouds.

Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way to integrate research ideas and prototypes into NVIDIA’s software products.

What We Need To See

Bachelor’s degree (or equivalent experience) in Computer Science (CS), Computer Engineering (CE) or Software Engineering (SE) with 7+ years of experience; alternatively, Master’s degree in CS/CE/SE with 5+ years of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high-performance computing.

Strong programming skills in Python and C/C++; experience with Go or Rust is a plus; solid CS fundamentals: algorithms & data structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories.

Knowledgeable and passionate about performance engineering in ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM and SGLang).

Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/debug tools (e.g., Nsight Systems/Compute).

Experience with containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces and cgroups.

Excellent debugging, problem-solving, and communication skills; ability to excel in a fast-paced, multi-functional setting.

Ways to Stand Out from the Crowd

Experience building and optimizing LLM inference engines (e.g., vLLM, SGLang).

Hands‑on work with ML compilers and DSLs (e.g., Triton, TorchDynamo/Inductor, MLIR/LLVM, XLA), GPU libraries (e.g., CUTLASS) and features (e.g., CUDA Graph, Tensor Cores).

Experience contributing to containerization/virtualization technologies such as containerd/CRI‑O/CRIU.

Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and production observability.

Contributions to open-source projects and/or publications; please include links to GitHub pull requests, published papers and artifacts.

At NVIDIA, we believe artificial intelligence (AI) will fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone to harness the power of AI and benefit from its potential. Our team consists of experts in AI, systems and performance optimization. Our leadership includes world‑renowned experts in AI systems who have received multiple academic and industry research awards. If you’re excited to build systems, kernels, and tools that make large‑scale AI faster, more efficient, and easier to deploy, we’d love to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 170,000 CAD - 220,000 CAD for Level 4, and 225,000 CAD - 275,000 CAD for Level 5.

You will also be eligible for equity and benefits.

#J-18808-Ljbffr
Create a job alert for this search

Senior Software Engineer, AI Inference Systems • Winnipeg, Canada

Similar jobs

Senior/ Lead - AI Engineer - FICO

FICOwinnipeg, mb, ca
Full-time

As a Senior Engineer on our Applied AI team, you will be at the forefront of building AI-powered software that transforms how our platform operates.You will design, build, and maintain production-g... Show more

 • Promoted

Senior Applied AI Software Engineer Remote

Photonic Inc.Winnipeg, Manitoba, Canada
Remote
Full-time

Take the lead in AI innovation with Photonic as a Senior Applied AI Software Engineer.Work remotely in Canada to apply your AI expertise in the ambitious realm of quantum engineering.In this key ro... Show more

 • Promoted

Senior Platform & Extensions Engineer (APIs & AI)

Jane.appWinnipeg, MB, CA
Full-time

A leading software company is seeking a Senior Developer to drive technical direction for integrations and extensibility.This role involves designing APIs, mentoring developers, and shaping platfor... Show more

 • Promoted

Zapier Senior Ai Systems Engineer

Zapier Inc.Winnipeg, Canada
Full-time

Become a Senior AI Systems Engineer at Zapier, where your focus will be on developing AI platforms and enhancing machine learning workflows.This role significantly impacts how teams build with AI.I... Show more

 • Promoted

Senior AI Platform Engineer

SamsaraWinnipeg, Manitoba, Canada
Full-time

Who we are Samsara (NYSE: IOT) is the pioneer of the Connected Operations™ Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (Io... Show more

 • Promoted

Senior Software Engineer with AI Focus in a Remote Role

eSCRIBEWinnipeg, MB, CA
Remote
Full-time

Join a transformative engineering team as a Senior Software Engineer with a focus on AI-driven applications.This remote opportunity emphasizes leadership and innovation throughout the development c... Show more

 • Promoted

Senior Software Engineer For Ai Systems

SocureWinnipeg, Canada
Full-time

Drive innovation at Socure as a Senior Software Engineer focused on Agentic AI Foundations.Collaborate on designing foundational platforms that support agent-based applications.Your role will invol... Show more

 • Promoted

Senior/ Lead - AI Engineer

FICOwinnipeg, mb, ca
Full-time

As a Senior Engineer on our Applied AI team, you will be at the forefront of building AI-powered software that transforms how our platform operates.You will design, build, and maintain production-g... Show more

 • Promoted

Senior Software Engineer for AI Systems

SocureWinnipeg, Manitoba, Canada
Full-time

Drive innovation at Socure as a Senior Software Engineer focused on Agentic AI Foundations.Collaborate on designing foundational platforms that support agent-based applications.Your role will invol... Show more

 • Promoted

Senior Software Engineer In Ai Development

AlphaSenseWinnipeg, Canada
Full-time

Drive innovation in AI technology as a Principal Software Engineer.Lead the evolution of scalable data extraction systems while fostering a collaborative and inclusive team atmosphere.In this senio... Show more

 • Promoted

Remote Senior Software Engineer - AI Foundations

CybSafeWinnipeg, Manitoba, Canada
Remote
Full-time

Elevate your software engineering career with Socure's remote Senior Software Engineer role.Focus on developing innovative agentic platforms that drive the future of identity verification.As part o... Show more

 • Promoted

Innovative Senior Software Engineer For Ai-Driven Solutions

GrafanaWinnipeg, Canada
Full-time

Lead the development of cutting-edge AI automation tools as a Senior Software Engineer.This role focuses on enhancing team operations through intelligent workflow solutions in a fully remote settin... Show more

 • Promoted

Senior AI Engineer - Tech Lead Remote

NTT America, Inc.Winnipeg, MB, CA
Remote
Full-time

Become a Senior AI Engineer - Tech Lead with NTT DATA, working remotely in Toronto, Ontario.Shape and lead the innovation of key AI and ML systems.This role combines deep technical knowledge with l... Show more

 • Promoted

Senior Software Engineer - AI SaaS Focus

Process StreetWinnipeg, MB, CA
Full-time

Transform how software is engineered at Process Street as a Senior Software Engineer with a focus on AI SaaS.This fully remote role emphasizes creativity and technical expertise in a dynamic enviro... Show more

 • Promoted

Senior Engineer in AI-Driven Environment

ApolloWinnipeg, Manitoba, Canada
Full-time

Transform how revenue teams operate as a Senior Engineer at Apollo.Work collaboratively in a dynamic, fast-growing SaaS environment.In your role as a Senior Software Engineer at Apollo.Collaborate ... Show more

 • Promoted

Zapier Senior AI Systems Engineer

Zapier Inc.Winnipeg, Manitoba, Canada
Full-time

Become a Senior AI Systems Engineer at Zapier, where your focus will be on developing AI platforms and enhancing machine learning workflows.This role significantly impacts how teams build with AI.I... Show more

 • Promoted

Expert Senior AI Engineer - Remote Role

NTT DATA, Inc.Winnipeg, MB, CA
Remote
Full-time

Make a difference in AI technology with NTT DATA as a Remote Senior AI Engineer from Toronto, Ontario.Focus on designing AI models and enhancing existing architectures for impactful solutions.As a ... Show more

 • Promoted

Senior Ai Software Engineer — Hybrid (Toronto) - C$100,000 - C$145,000 A Year

Global Information Services CompanyWinnipeg, Canada
Full-time

Develops and maintains AI/ML features using Python and integrates machine learning models into software applications.Collaborates with teams to build AI-driven features and optimize model performance. Show more

 • Promoted

Lead Software Engineer - AI Innovation

Luxury PresenceWinnipeg, Manitoba, Canada
Full-time

Enhance your career at Luxury Presence as a Lead Software Engineer, focused on developing AI-driven solutions for real estate.Help shape our technology stack while guiding innovative projects and e... Show more

 • Promoted

Senior AI Engineer - Tech Lead Remote

NTT Data Americas, Inc.Winnipeg, MB, CA
Remote
Full-time

Become a Senior AI Engineer - Tech Lead with NTT DATA, working remotely in Toronto, Ontario.Shape and lead the innovation of key AI and ML systems.This role combines deep technical knowledge with l... Show more