Talent.com
Software Development Engineer - SGLang and Inference Stack
Software Development Engineer - SGLang and Inference StackAdvanced Micro Devices, Inc • VANCOUVER, British Columbia, Canada
Software Development Engineer - SGLang and Inference Stack

Software Development Engineer - SGLang and Inference Stack

Advanced Micro Devices, Inc • VANCOUVER, British Columbia, Canada
Il y a 3 jours
Type de contrat
  • Temps plein
Description de poste

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE : As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your work will be instrumental in enhancing GPU kernel performance, accelerating deep learning models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU software teams and engage with open-source communities to integrate and optimize cutting-edge compiler technologies and drive upstream contributions that benefit AMD’s AI software ecosystem. THE PERSON : Skilled engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential. KEY RESPONSIBILITIES : Optimize Deep Learning Frameworks : Enhance performance of frameworks like TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream contributions in open-source repositories. Develop and Optimize Deep Learning Models : Profile, analyze, code change and tune large-scale training and inference models for optimal performance on AMD hardware. Day-0 supports to many SOTA models, DeepSeek 3.2, Kimi K2.5, etc. GPU Kernel Development : Design, implement, and optimize high-performance GPU kernels using HIP, Triton, TileLang or other DSLs for AI operator efficiency. Collaborate with GPU Library and Compiler Teams : Work closely with internal compiler and GPU math library teams to integrate, optimize and align kernel-level optimizations with full-stack performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development : Support optimization, feature development, and scaling of the SGLang framework across AMD GPU platforms for LLM, multimodal serving and RL-training. Distributed System Optimization : Tune and scale performance across both multi-GPU (scale-up) and multi-node (scale-out) environments, including inference parallelism, prefill-decode disaggregation, Wide-EP and collective communication strategies. Graph Compiler Integration : Integrate and optimize runtime execution through graph compilers such as XLA, TorchDynamo, or custom pipelines. Open-Source Collaboration : Partner with external maintainers to understand framework needs, propose optimizations, and upstream contributions effectively. Apply Engineering Best Practices : Leverage modern software engineering practices in debugging, profiling, test-driven development, and CI / CD integration. PREFERRED EXPERIENCE : Strong Programming Skills : Proficient in C++ and / or Python (PyTorch, Triton, TileLang), with demonstrated ability to code, debug, profile, and optimize performance-critical code. SGLang and LLM Optimization : Hands-on experience with SGLang or similar LLM inference frameworks is highly preferred. Compiler and GPU Architecture Knowledge : Background in compiler design or familiarity with technologies like LLVM, MLIR, or ROCm is a plus. Heterogeneous System Workloads : Experience running and scaling workloads on large-scale, heterogeneous clusters (CPU + GPU) using distributed training or inference strategies. AI Framework Integration : Experience contributing to or integrating optimizations into deep learning frameworks such as PyTorch, SGLang, vLLM, Slime, VeRL GPGPU Computing : Working knowledge of HIP, CUDA, Triton, TileLang or other GPU programming models; experience with GCN / CDNA architecture preferred. ACADEMIC CREDENTIALS : Bachelor’s and / or Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering, Physics or a related field. #LI-JG1 Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy.THE ROLE : As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your work will be instrumental in enhancing GPU kernel performance, accelerating deep learning models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU software teams and engage with open-source communities to integrate and optimize cutting-edge compiler technologies and drive upstream contributions that benefit AMD’s AI software ecosystem. THE PERSON : Skilled engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux environments. The ideal candidate will thrive in both collaborative team settings and independent work, with the ability to define goals, manage development efforts, and deliver high-quality solutions. Strong problem-solving skills, a proactive approach, and a keen understanding of software engineering best practices are essential. KEY RESPONSIBILITIES : Optimize Deep Learning Frameworks : Enhance performance of frameworks like TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream contributions in open-source repositories. Develop and Optimize Deep Learning Models : Profile, analyze, code change and tune large-scale training and inference models for optimal performance on AMD hardware. Day-0 supports to many SOTA models, DeepSeek 3.2, Kimi K2.5, etc. GPU Kernel Development : Design, implement, and optimize high-performance GPU kernels using HIP, Triton, TileLang or other DSLs for AI operator efficiency. Collaborate with GPU Library and Compiler Teams : Work closely with internal compiler and GPU math library teams to integrate, optimize and align kernel-level optimizations with full-stack performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development : Support optimization, feature development, and scaling of the SGLang framework across AMD GPU platforms for LLM, multimodal serving and RL-training. Distributed System Optimization : Tune and scale performance across both multi-GPU (scale-up) and multi-node (scale-out) environments, including inference parallelism, prefill-decode disaggregation, Wide-EP and collective communication strategies. Graph Compiler Integration : Integrate and optimize runtime execution through graph compilers such as XLA, TorchDynamo, or custom pipelines. Open-Source Collaboration : Partner with external maintainers to understand framework needs, propose optimizations, and upstream contributions effectively. Apply Engineering Best Practices : Leverage modern software engineering practices in debugging, profiling, test-driven development, and CI / CD integration. PREFERRED EXPERIENCE : Strong Programming Skills : Proficient in C++ and / or Python (PyTorch, Triton, TileLang), with demonstrated ability to code, debug, profile, and optimize performance-critical code. SGLang and LLM Optimization : Hands-on experience with SGLang or similar LLM inference frameworks is highly preferred. Compiler and GPU Architecture Knowledge : Background in compiler design or familiarity with technologies like LLVM, MLIR, or ROCm is a plus. Heterogeneous System Workloads : Experience running and scaling workloads on large-scale, heterogeneous clusters (CPU + GPU) using distributed training or inference strategies. AI Framework Integration : Experience contributing to or integrating optimizations into deep learning frameworks such as PyTorch, SGLang, vLLM, Slime, VeRL GPGPU Computing : Working knowledge of HIP, CUDA, Triton, TileLang or other GPU programming models; experience with GCN / CDNA architecture preferred. ACADEMIC CREDENTIALS : Bachelor’s and / or Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering, Physics or a related field. #LI-JG1

Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy.

Créer une alerte emploi pour cette recherche

Software Development Engineer SGLang and Inference Stack • VANCOUVER, British Columbia, Canada

Offres similaires
Intermediate Full Stack Software Engineer

Intermediate Full Stack Software Engineer

D3 Security Management Systems • Vancouver, BC, Canada
Temps plein
Location : Greater Vancouver area candidates only.D3 Security is transforming SecOps with Morpheus, our AI-driven Autonomous Security Operations Center (ASOC) platform. Morpheus automates Tier 13 anal...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Firmware & Hardware Developer

Firmware & Hardware Developer

SST Wireless • Richmond, BC, Canada
Temps plein
With several new products in the design pipeline, this is an exciting time for creative thinkers who are adept in realizing technical solutions to join us in creating products where your contributi...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Senior Software Engineer - Frugal

Senior Software Engineer - Frugal

Frugal • richmond, bc, ca
Temps plein
Frugal is an AI-powered coding agent purpose-built to tackle one of the most persistent problems in tech : runaway cloud costs. Despite years of optimization efforts, cloud expenses remain high—and w...Voir plus
Dernière mise à jour : il y a 21 heures • Offre sponsorisée • Nouvelle offre
Software Engineer - II

Software Engineer - II

FISPAN • Vancouver, BC, Canada
Permanent
FISPAN) is an Enterprise SaaS FinTech company that allows banks to deploy embedded financial products and services to create a seamless banking connection for their corporate clients.Our product ai...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Full Stack Engineer

Full Stack Engineer

Targeted Talent • Delta, BC, Canada
Temps plein
We are searching for a creative, flexible technical thinker capable of managing, planning and understanding team dynamics. Responsible for authoring, analyzing and translating User Stories into syst...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Software Development Engineer

Software Development Engineer

Connor, Clark & Lunn Investment Management (CC&L) • Vancouver
Temps plein
We are a special software team embedded in a top performing quantitative equity fund that manages over $75 billion in financial assets. This is a fantastic opportunity in the exciting intersection o...Voir plus
Dernière mise à jour : il y a 16 jours • Offre sponsorisée
Senior Core Banking Software Engineer

Senior Core Banking Software Engineer

CGI • Vancouver
Temps plein
A global IT services firm is seeking an experienced Software Developer to enhance existing applications and create new components in the Wealthview banking platform. This role requires a minimum of ...Voir plus
Dernière mise à jour : il y a 9 jours • Offre sponsorisée
Full Stack Engineer - delta

Full Stack Engineer - delta

Set 2 Close | B Corp • delta, bc, ca
Temps plein
The ideal candidate brings strong backend development experience, solid database skills, and the ability to contribute to scalable, maintainable applications. Develop and maintain backend services u...Voir plus
Dernière mise à jour : il y a 13 jours • Offre sponsorisée
Senior Generative AI Software Developer (ID#5114)

Senior Generative AI Software Developer (ID#5114)

freelance.ca • Richmond, Canada
Temps plein
This contract position follows a hybrid model and requires onsite presence in Richmond, BC a minimum of three days per week. Design and build applications using OpenAI, Azure OpenAI, and open-source...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Full Stack Software Engineer

Full Stack Software Engineer

Insight Global • Burnaby
Temps plein
Insight Global is hiring an Intermediate / Senior Developer to join the team building a Device Management Platform (ADMP). This is a high-impact role focused on designing and implementing secure, scal...Voir plus
Dernière mise à jour : il y a 16 jours • Offre sponsorisée
Senior Software Development Engineer in Test(SDET)

Senior Software Development Engineer in Test(SDET)

PDF Solutions, Inc. • Vancouver
Temps plein
At PDF Solutions, we are at the forefront of revolutionizing the semiconductor industry.Our cutting-edge technologies and data-driven solutions empower semiconductor manufacturers to achieve unprec...Voir plus
Dernière mise à jour : il y a 14 jours • Offre sponsorisée
Senior Software Backend Engineer (Django) - Changing the face of sports

Senior Software Backend Engineer (Django) - Changing the face of sports

Uplifter Inc. • Burnaby
Temps plein
Senior Software Engineer - Changing the face of sports at.Senior Software Engineer - Changing the face of sports.Hybrid – Burnaby (Vancouver), BC. Develop and maintain backend services using Python / ...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
Sr Software Development Engineer (Full Stack) - Evisort AI

Sr Software Development Engineer (Full Stack) - Evisort AI

Latinx in AI (LXAI) • Vancouver
Temps plein
Your work days are brighter here.We’re obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing...Voir plus
Dernière mise à jour : il y a 1 jour • Offre sponsorisée
Software Engineer (AI / Full Stack)

Software Engineer (AI / Full Stack)

TrustFlight • Vancouver
Temps plein
TrustFlight is an innovative aviation software company that specializes in developing cutting‑edge AI, digital workflow, and analytics applications for the aviation industry.Our software empowers m...Voir plus
Dernière mise à jour : il y a 16 jours • Offre sponsorisée
Senior C++ Software Engineer

Senior C++ Software Engineer

Equest • North Vancouver, British Columbia, Canada
Temps plein
DarkVision, a Koch Engineered Solutions company, is looking for a talented Senior Software Engineer to help develop our data analysis and visualization applications. This development involves writin...Voir plus
Dernière mise à jour : il y a 8 jours • Offre sponsorisée
Software Engineer - AI (Vancouver Hybrid)

Software Engineer - AI (Vancouver Hybrid)

Boomi • Vancouver
Temps plein
Boomi is a fast-growing company building an award-winning, intelligent integration and automation platform.We aim to connect everyone to everything, anywhere, and we hire trailblazers with an entre...Voir plus
Dernière mise à jour : il y a 4 jours • Offre sponsorisée
Software Development Engineer (Full Stack) - Evisort AI

Software Development Engineer (Full Stack) - Evisort AI

Workday, Inc. • Vancouver
Temps plein
Your work days are brighter here.We’re obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing...Voir plus
Dernière mise à jour : il y a 7 jours • Offre sponsorisée
Software Development Engineer

Software Development Engineer

Connor, Clark & Lunn group • Vancouver
Temps plein
Connor, Clark & Lunn Investment Management Ltd.We are a special software team embedded in a top performing quantitative equity fund that manages over $75 billion in financial assets.This is a fanta...Voir plus
Dernière mise à jour : il y a 16 jours • Offre sponsorisée