Talent.com
Advanced Micro Devices, Inc
Senior Software Engineer - AI Triton KernelsAdvanced Micro Devices, Inc • VANCOUVER, British Columbia, Canada
Senior Software Engineer - AI Triton Kernels

Senior Software Engineer - AI Triton Kernels

Advanced Micro Devices, Inc • VANCOUVER, British Columbia, Canada
6 days ago
Job type
  • Full-time
Job description


WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.Together, we advance your career.




THE ROLE:

Triton is a widely adopted language and compiler for high-performance GPU kernels, powering major AI frameworks such as PyTorch, vLLM, and SGLang. As AI workloads increasingly rely on Triton-based kernels, first-class Triton support is strategically critical to AMD’s AI software roadmap.AMD GPUs are an official Triton backend; delivering industry-leading Triton performance on AMD Instinct accelerators is a top priority for AMD. The performance and usability of Triton directly impact the competitiveness of AMD hardware in large-scale AI training and inference.In this role you will author state-of-the-art performant Triton/Gluon kernels for ML kernels powering the latest and greatest AI models.You will collaborate with research, compiler, and hardware architecture teams to co-design high-performance solutions, analyze bottlenecks to make AMD GPUs the best-in-class platform for Triton-powered AI workloads.

THE PERSON:

The ideal candidate has deep expertise in SIMT programming, parallel algorithms, GPU architecture, and performance engineering. You are comfortable working across the full stack to drive e2e model performance — from vLLM/SGL down to ISA-level performance tuning — and can perform rigorous quantitative analysis to drive measurable improvements.You thrive in highly technical environments, enjoy solving complex performance problems, and are excited to collaborate across model deployment, compiler, runtime, and hardware teams. Most importantly, you are curious, hands-on, and willing to learn and work across boundaries.

KEY RESPONSIBILITIES:

  • Design, research, implement, and rigorously optimize high-performance matmul, attention (flash, paged, grouped-query), MoE, and fully fused transformer kernels using Triton, targeting large-scale LLM and multimodal workloads
  • Own and productionize critical Triton/Gluon kernels within vLLM and SGL (e.g., paged attention, extend attention, MoE, quantized kernels, etc), ensuring correctness, scalability, and peak throughput
  • Partner closely with compiler engineers to develop and maintain the Triton AMD backend across ROCm and the LLVM AMDGPU stack, targeting CDNA and next-generation architectures
  • Drive deep kernel-level optimizations across the AMD memory hierarchy (LDS, L2, HBM), wavefront execution (wave32/wave64), vectorization, MFMA utilization, occupancy tuning, and instruction scheduling to maximize hardware efficiency
  • Perform rigorous profiling and microbenchmarking led optimization on AMD Instinct GPUs using hardware counters and tracing tools; root-cause bottlenecks in memory bandwidth, latency hiding, synchronization, and register pressure
  • Debug and resolve performance and correctness issues end-to-end across PyTorch, vLLM/SGL runtimes, Triton IR/MLIR, ROCm runtime, and the LLVM AMDGPU backend
  • Contribute to open-source Triton, LLVM, and ROCm ecosystems

PREFERRED EXPERIENCE:

  • 3+ years of experience in GPU kernel development, compiler backends, or performance engineering focused on AI/ML workloads
  • Strong hands-on expertise with Triton, including writing custom matmul, attention, and fused transformer kernels and understanding Triton IR lowering to GPU backends
  • Deep understanding of modern GPU architectures (wavefront execution, memory hierarchy, scheduling, occupancy)
  • Meaningful contributions to open-source projects such as Triton, Torch, vLLM, SGLang, IREE, MLIR, LLVM, or ROCm, with a strong collaborative and upstream-first engineering mindset

PREFERRED ACADEMIC CREDENTIALS:

Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience.

This role is not eligible for visa sponsorship.


#LI-G11

#LI-HYBRID




Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

This posting is for an existing vacancy.

THE ROLE:

Triton is a widely adopted language and compiler for high-performance GPU kernels, powering major AI frameworks such as PyTorch, vLLM, and SGLang. As AI workloads increasingly rely on Triton-based kernels, first-class Triton support is strategically critical to AMD’s AI software roadmap.AMD GPUs are an official Triton backend; delivering industry-leading Triton performance on AMD Instinct accelerators is a top priority for AMD. The performance and usability of Triton directly impact the competitiveness of AMD hardware in large-scale AI training and inference.In this role you will author state-of-the-art performant Triton/Gluon kernels for ML kernels powering the latest and greatest AI models.You will collaborate with research, compiler, and hardware architecture teams to co-design high-performance solutions, analyze bottlenecks to make AMD GPUs the best-in-class platform for Triton-powered AI workloads.

THE PERSON:

The ideal candidate has deep expertise in SIMT programming, parallel algorithms, GPU architecture, and performance engineering. You are comfortable working across the full stack to drive e2e model performance — from vLLM/SGL down to ISA-level performance tuning — and can perform rigorous quantitative analysis to drive measurable improvements.You thrive in highly technical environments, enjoy solving complex performance problems, and are excited to collaborate across model deployment, compiler, runtime, and hardware teams. Most importantly, you are curious, hands-on, and willing to learn and work across boundaries.

KEY RESPONSIBILITIES:

  • Design, research, implement, and rigorously optimize high-performance matmul, attention (flash, paged, grouped-query), MoE, and fully fused transformer kernels using Triton, targeting large-scale LLM and multimodal workloads
  • Own and productionize critical Triton/Gluon kernels within vLLM and SGL (e.g., paged attention, extend attention, MoE, quantized kernels, etc), ensuring correctness, scalability, and peak throughput
  • Partner closely with compiler engineers to develop and maintain the Triton AMD backend across ROCm and the LLVM AMDGPU stack, targeting CDNA and next-generation architectures
  • Drive deep kernel-level optimizations across the AMD memory hierarchy (LDS, L2, HBM), wavefront execution (wave32/wave64), vectorization, MFMA utilization, occupancy tuning, and instruction scheduling to maximize hardware efficiency
  • Perform rigorous profiling and microbenchmarking led optimization on AMD Instinct GPUs using hardware counters and tracing tools; root-cause bottlenecks in memory bandwidth, latency hiding, synchronization, and register pressure
  • Debug and resolve performance and correctness issues end-to-end across PyTorch, vLLM/SGL runtimes, Triton IR/MLIR, ROCm runtime, and the LLVM AMDGPU backend
  • Contribute to open-source Triton, LLVM, and ROCm ecosystems

PREFERRED EXPERIENCE:

  • 3+ years of experience in GPU kernel development, compiler backends, or performance engineering focused on AI/ML workloads
  • Strong hands-on expertise with Triton, including writing custom matmul, attention, and fused transformer kernels and understanding Triton IR lowering to GPU backends
  • Deep understanding of modern GPU architectures (wavefront execution, memory hierarchy, scheduling, occupancy)
  • Meaningful contributions to open-source projects such as Triton, Torch, vLLM, SGLang, IREE, MLIR, LLVM, or ROCm, with a strong collaborative and upstream-first engineering mindset

PREFERRED ACADEMIC CREDENTIALS:

Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience.

This role is not eligible for visa sponsorship.


#LI-G11

#LI-HYBRID

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

This posting is for an existing vacancy.

Create a job alert for this search

Senior Software Engineer - AI Triton Kernels • VANCOUVER, British Columbia, Canada

Similar jobs

Senior Software Engineer - AI Team

BrinqaVancouver, Metro Vancouver Regional District, CA
Full-time

Brinqa is the leading provider of unified cyber risk management and security analytics.Our software empowers companies to make cyber risk decisions across a wide range of platforms quickly and effi... Show more

 • Promoted

Senior Software Engineer - AI Triton Kernels

Advanced Micro DevicesVancouver, Metro Vancouver Regional District, CA
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst... Show more

 • Promoted

Senior Software Engineer in AI Core

WorkdayVancouver, Metro Vancouver Regional District, CA
Full-time

Elevate enterprise AI with Workday as a Senior Software Engineer on the AI Core team.Drive innovation by architecting software systems for intelligent, data-driven decision-making.Join Workday's AI... Show more

 • Promoted

Senior Software Engineer - AI Platform

WorkdayVancouver, Metro Vancouver Regional District, CA
Full-time

Elevate your career as a Senior Software Engineer at Workday, focusing on AI Platform innovations from our Toronto location.Leverage your skills in backend services, distributed systems, and Python... Show more

 • Promoted

Senior Engineer for AI Systems Development

DataRobot, Inc.Vancouver, Metro Vancouver Regional District, CA
Full-time

Advance AI technology as a Senior Backend Engineer.Focus on designing robust systems for AI applications, enhancing usability, flexibility, and performance in production settings.In this pivotal ro... Show more

 • Promoted

Remote Senior Software Engineer - AI Foundations

CybSafeVancouver, Metro Vancouver Regional District, CA
Remote
Full-time

Elevate your software engineering career with Socure's remote Senior Software Engineer role.Focus on developing innovative agentic platforms that drive the future of identity verification.As part o... Show more

 • Promoted

Principal Software Engineer - AI

GoTo MeetingVancouver, Metro Vancouver Regional District, CA
Full-time

As a Principal Software Engineer, you will be a technical leader driving the design and implementation of complex backend systems at scale.You will set technical direction, guide team members, and ... Show more

 • Promoted

Senior AI Engineer for Autonomous Systems in Healthcare Applications

Toboggan LabsVancouver, Metro Vancouver Regional District, CA
Full-time

Drive transformative AI projects as a Senior AI Engineer.Engage with innovative technology to deliver autonomous systems that improve efficiency and accuracy in healthcare settings.We invite applic... Show more

 • Promoted

Senior Software Engineer, Horizon — AI-Driven VDI

Omnissa, LLCVancouver, Metro Vancouver Regional District, CA
Full-time

A leading digital work platform company is seeking a seasoned Engineer in Vancouver to lead software system design and development.The ideal candidate will have extensive experience in HTML and Typ... Show more

 • Promoted

Senior Software Engineer for AI Innovation

ToastVancouver, Metro Vancouver Regional District, CA
Full-time

Join Toast's AI software company in Vancouver as a Senior Software Engineer.Help build LLM-powered systems that transform competitive intelligence with actionable insights.As a Senior Software Engi... Show more

 • Promoted

Innovative Senior AI Engineer Opportunity

Robots and PencilsVancouver, Metro Vancouver Regional District, CA
Full-time

Join Robots & Pencils as a Senior AI Engineer and drive innovation in scalable digital products.This hands-on role tackles complex integration challenges in a vibrant production environment.You’ll ... Show more

 • Promoted

Senior AI Solutions Engineer

TYLinVancouver, Metro Vancouver Regional District, CA
Full-time

With over 3,000 employees throughout the Americas, Asia, and Europe, the firm provides support on projects of varying size and complexity.Together, we enhance conventional designs with smarter, mor... Show more

 • Promoted

Senior AI Engineer

ProcurifyVancouver, Metro Vancouver Regional District, CA
Full-time

Procurify is the AI-enhanced procurement and AP automation platform for mid-market organizations.We make it easy for organizations to take control of spend and save money.We're seeking a Senior AI ... Show more

 • Promoted

Senior Software Development Engineer: Innovate Market Solutions Using AI

AmazonVancouver, Metro Vancouver Regional District, CA
Full-time

Lead technological advancements as a Senior Software Development Engineer, enhancing marketplace solutions with Java and AI.Shape the future of selling experiences for numerous sellers worldwide.Th... Show more

 • Promoted

AI-Focused Senior Software Engineer at Rival

Rival Technologies, Inc.Vancouver, Metro Vancouver Regional District, CA
Full-time

Lead impactful projects as a Senior Full-Stack Software Engineer at Rival, specializing in AI-enabled features.Work in a hybrid model and shape the future of customer insights.At Rival, we are look... Show more

 • Promoted

Senior Software Engineer with AI Focus in a Remote Role

eSCRIBEVancouver, Metro Vancouver Regional District, CA
Remote
Full-time

Join a transformative engineering team as a Senior Software Engineer with a focus on AI-driven applications.This remote opportunity emphasizes leadership and innovation throughout the development c... Show more

 • Promoted

Transformative Senior Software Engineer in AI Governance Projects

OptroVancouver, Metro Vancouver Regional District, CA
Full-time

Embark on a dynamic career as a Senior Software Engineer focusing on AI governance projects.Work remotely to innovate and develop scalable software solutions, driving success with your programming ... Show more

 • Promoted

Senior Software Engineer for Perception Systems

AerovectVancouver, Metro Vancouver Regional District, CA
Full-time

Join AeroVect as a Remote Senior Software Engineer, focusing on revolutionary perception systems within the aviation industry.Drive autonomy in ground handling operations through innovative object ... Show more

 • Promoted

Senior Software Engineer in AI Development

AlphaSenseVancouver, Metro Vancouver Regional District, CA
Full-time

Drive innovation in AI technology as a Principal Software Engineer.Lead the evolution of scalable data extraction systems while fostering a collaborative and inclusive team atmosphere.In this senio... Show more

 • Promoted

Innovative Senior Software Engineer for AI-Driven Solutions

GrafanaVancouver, Metro Vancouver Regional District, CA
Full-time

Lead the development of cutting-edge AI automation tools as a Senior Software Engineer.This role focuses on enhancing team operations through intelligent workflow solutions in a fully remote settin... Show more