Talent.com
Sr. AI/ML Software System Design Engineer
Sr. AI/ML Software System Design EngineerAdvanced Micro Devices, Inc • MARKHAM, Ontario, Canada
Sr. AI / ML Software System Design Engineer

Sr. AI / ML Software System Design Engineer

Advanced Micro Devices, Inc • MARKHAM, Ontario, Canada
28 days ago
Job type
  • Full-time
Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE : As a Sr. AI / ML Engineer, you will lead the design and implementation of advanced AI / ML architectures across AMD’s GPU and data center platforms. This global technical leadership role focuses on defining strategies for AI-driven validation methodologies, ensuring robust system performance, scalability, and reliability. You will collaborate across silicon, firmware, hardware, and software teams to deliver optimized AI solutions for next-generation computing experiences. THE PERSON : You are passionate about AI / ML technologies and system architecture, with a strong ability to innovate and solve complex technical challenges. You thrive in a collaborative environment, influencing cross-functional teams and driving architectural decisions that shape the future of AI computing. Your curiosity and leadership will enable continuous improvement and excellence in AMD’s AI solutions. KEY RESPONSIBILITIES : Define and drive AI architecture strategies for GPU-based platforms and distributed systems. Collaborate with engineering teams to design and optimize AI / ML workloads for performance, scalability, and efficiency, while architecting AI / ML solutions that integrate into innovative validation methodologies for driver code and hardware. Develop AI-driven frameworks for automated testing, predictive analytics, and intelligent bug triage to accelerate validation cycles. Lead architectural reviews and provide guidance on design decisions for AI frameworks, drivers, and system integration. Create reference designs and benchmarks for AI workloads, ensuring alignment with industry standards. Drive automation and validation strategies for AI solutions, including cluster-scale deployments. Partner with customers and internal teams to deliver end-to-end AI solutions for data centers and edge platforms. Mentor junior engineers and foster technical innovation across teams. Provide regular updates on architectural progress and influence roadmap decisions. PREFERRED EXPERIENCE : Strong background in AI / ML frameworks such as PyTorch, TensorFlow, ONNX Runtime, and familiarity with Hugging Face for model fine-tuning and deployment. Experience with GPU computing and ROCm software stack, including libraries like MIGraphX, rocBLAS, and MIOpen. Knowledge of distributed systems and performance optimization for AI workloads. Proficiency in C / C++, Python, and Linux environments; experience with HIP for GPU programming. Familiarity with networking technologies such as RDMA and RoCE for high-performance data transfer in cluster environments. Excellent communication, leadership, and problem-solving skills. Proven track record of delivering complex, multi-functional AI solutions in fast-paced environments. ACADEMIC CREDENTIALS : Bachelor’s or Master's degree in Computer or Electrical Engineering or equivalent #LI-JG1 Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.THE ROLE : As a Sr. AI / ML Engineer, you will lead the design and implementation of advanced AI / ML architectures across AMD’s GPU and data center platforms. This global technical leadership role focuses on defining strategies for AI-driven validation methodologies, ensuring robust system performance, scalability, and reliability. You will collaborate across silicon, firmware, hardware, and software teams to deliver optimized AI solutions for next-generation computing experiences. THE PERSON : You are passionate about AI / ML technologies and system architecture, with a strong ability to innovate and solve complex technical challenges. You thrive in a collaborative environment, influencing cross-functional teams and driving architectural decisions that shape the future of AI computing. Your curiosity and leadership will enable continuous improvement and excellence in AMD’s AI solutions. KEY RESPONSIBILITIES : Define and drive AI architecture strategies for GPU-based platforms and distributed systems. Collaborate with engineering teams to design and optimize AI / ML workloads for performance, scalability, and efficiency, while architecting AI / ML solutions that integrate into innovative validation methodologies for driver code and hardware. Develop AI-driven frameworks for automated testing, predictive analytics, and intelligent bug triage to accelerate validation cycles. Lead architectural reviews and provide guidance on design decisions for AI frameworks, drivers, and system integration. Create reference designs and benchmarks for AI workloads, ensuring alignment with industry standards. Drive automation and validation strategies for AI solutions, including cluster-scale deployments. Partner with customers and internal teams to deliver end-to-end AI solutions for data centers and edge platforms. Mentor junior engineers and foster technical innovation across teams. Provide regular updates on architectural progress and influence roadmap decisions. PREFERRED EXPERIENCE : Strong background in AI / ML frameworks such as PyTorch, TensorFlow, ONNX Runtime, and familiarity with Hugging Face for model fine-tuning and deployment. Experience with GPU computing and ROCm software stack, including libraries like MIGraphX, rocBLAS, and MIOpen. Knowledge of distributed systems and performance optimization for AI workloads. Proficiency in C / C++, Python, and Linux environments; experience with HIP for GPU programming. Familiarity with networking technologies such as RDMA and RoCE for high-performance data transfer in cluster environments. Excellent communication, leadership, and problem-solving skills. Proven track record of delivering complex, multi-functional AI solutions in fast-paced environments. ACADEMIC CREDENTIALS : Bachelor’s or Master's degree in Computer or Electrical Engineering or equivalent #LI-JG1

Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Create a job alert for this search

Sr Software Engineer • MARKHAM, Ontario, Canada

Similar jobs
AI / ML System Staff Software Engineer

AI / ML System Staff Software Engineer

Nutanix • Markham
Full-time
AI is revolutionizing how we solve complex, cross‑domain challenges—and Generative AI (GenAI) and Agentic AI is at the forefront of this transformation. As part of the AI Software team, you will con...Show more
Last updated: 3 days ago • Promoted
AI / ML System Staff Software Engineer

AI / ML System Staff Software Engineer

Qualcomm • Markham
Full-time
Engineering Group, Engineering Group > .AI / ML System Staff Software Engineer.AI is revolutionizing how we solve complex, cross-domain challenges—and Generative AI (GenAI) and Agentic AI is at the fo...Show more
Last updated: 29 days ago • Promoted
Sr. AI / ML Software System Design Engineer

Sr. AI / ML Software System Design Engineer

Advanced Micro Devices • Markham
Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Show more
Last updated: 8 days ago • Promoted
Sr. AI / ML Software System Design Engineer

Sr. AI / ML Software System Design Engineer

AMD • Markham
Full-time
AI / ML Software System Design Engineer.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. Your actual pay will be based on your skills and experience — ta...Show more
Last updated: 9 days ago • Promoted
Generative AI ML Engineer — Build State-of-the-Art Models

Generative AI ML Engineer — Build State-of-the-Art Models

Ideogram • Toronto, ON, Canada
Full-time
A technology firm focused on AI-driven design is seeking a Machine Learning Engineer to build and deploy state-of-the-art models. The role involves collaborating with an ambitious team on complex ch...Show more
Last updated: 30+ days ago • Promoted
Senior ML Ops Engineer for AI Systems

Senior ML Ops Engineer for AI Systems

Oncoustics • Toronto
Full-time
A leading medical technology firm in Toronto is seeking a Senior DevOps Engineer to enhance ML development pipelines.The ideal candidate will have over 5 years in DevOps for cloud systems and famil...Show more
Last updated: 29 days ago • Promoted
Sr. Engineer, SoC Design Verification – AI / ML Accelerator Chiplets

Sr. Engineer, SoC Design Verification – AI / ML Accelerator Chiplets

Tenstorrent Inc. • Toronto
Full-time +1
Engineer, SoC Design Verification – AI / ML Accelerator Chiplets.Tenstorrent is leading the industry on cutting‑edge AI technology, revolutionizing performance expectations, ease of use, and cost eff...Show more
Last updated: 9 days ago • Promoted
GenAI ML Software Engineer – Build Scalable AI Systems

GenAI ML Software Engineer – Build Scalable AI Systems

RBC • Toronto
Full-time
A leading financial institution in Canada is seeking a Machine Learning Software Engineer to develop and scale AI-driven solutions. The role involves building and managing ML / GenAI projects from end...Show more
Last updated: 9 days ago • Promoted
Senior AI / ML Engineer

Senior AI / ML Engineer

Rogue Sentinel Studios Inc. • Toronto, ON, Canada
Full-time +1
We collaborate with global clients on game development, interactive experiences, and creative technology projects.As part of our expansion into North America, our Toronto office will serve as a key...Show more
Last updated: 15 days ago • Promoted
ROCm AI System Software Architect – Lead AI Stack

ROCm AI System Software Architect – Lead AI Stack

AMD • Markham
Full-time
A leading semiconductor company is seeking an expert-level Machine Learning System Software Engineer to develop advanced AI software solutions and optimize the AI software stack across AMD products...Show more
Last updated: 29 days ago • Promoted
Product ML Engineer : Scale Generative AI for Millions

Product ML Engineer : Scale Generative AI for Millions

1851 Labs • Toronto C6A, ON, Canada
Full-time
A technology company in Toronto is seeking a Product ML Engineer to build the intelligence layer of their consumer AI platform. The ideal candidate should have proven experience in shipping producti...Show more
Last updated: 3 days ago • Promoted
Sr. Engineer, SoC Design Verification - AI / ML Accelerator Chiplets

Sr. Engineer, SoC Design Verification - AI / ML Accelerator Chiplets

Tenstorrent • Toronto
Full-time +1
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show more
Last updated: 9 days ago • Promoted
DL System Software Engineer - AI Platform

DL System Software Engineer - AI Platform

NVIDIA • Toronto
Full-time
DL System Software Engineer - AI Platform.We are seeking highly motivated and skilled systems engineers to join our team to help in developing an AI Platform that offers an efficient infrastructure...Show more
Last updated: 29 days ago • Promoted
DL System Software Engineer - AI Platform

DL System Software Engineer - AI Platform

NVIDIA Corporation • Toronto
Full-time
DL System Software Engineer - AI Platform page is loaded## DL System Software Engineer - AI Platformlocations : Canada, Torontotime type : Full timeposted on : Posted Yesterdayjob requisition id...Show more
Last updated: 29 days ago • Promoted
Senior ML Engineer - Scalable AI Systems & VectorDB Hybrid

Senior ML Engineer - Scalable AI Systems & VectorDB Hybrid

Electronic Arts (EA) • Toronto, ON, Canada
Full-time
A leading entertainment company is looking for a Machine Learning Engineer to optimize and scale ML systems with a focus on vectorDB integration. This mid-senior level role requires 6+ years of indu...Show more
Last updated: 30+ days ago • Promoted
Senior AI Platform Engineer – GenAI & LLM Automation

Senior AI Platform Engineer – GenAI & LLM Automation

Rivian • Toronto C6A, ON, Canada
Full-time
A leading automotive technology company in Toronto seeks a Senior Software Engineer to shape their GenAI platform.This role involves architecting intelligent agents and automating workflows using L...Show more
Last updated: 7 days ago • Promoted
Sr. Machine Learning Engineer

Sr. Machine Learning Engineer

TheAppLabb • Toronto, ON, Canada
Full-time
The AppLabb is a leading innovation company specializing in AI-powered digital solutions, mobile app development, and emerging technologies. We leverage data-driven insights to enhance digital exper...Show more
Last updated: 30+ days ago • Promoted
Sr AI / ML Applications Architect

Sr AI / ML Applications Architect

GE Vernova • Markham
Full-time
Sr AI / ML Applications Architect – GE Vernova.GE Vernova is accelerating the path to more reliable, affordable, and sustainable energy, while helping our customers power economies and deliver the el...Show more
Last updated: 29 days ago • Promoted