Talent.com
Sr. AI/ML Software System Design Engineer
Sr. AI/ML Software System Design EngineerAdvanced Micro Devices, Inc • MARKHAM, Ontario, Canada
Sr. AI / ML Software System Design Engineer

Sr. AI / ML Software System Design Engineer

Advanced Micro Devices, Inc • MARKHAM, Ontario, Canada
22 days ago
Job type
  • Full-time
Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE : As a Sr. AI / ML Engineer, you will lead the design and implementation of advanced AI / ML architectures across AMD’s GPU and data center platforms. This global technical leadership role focuses on defining strategies for AI-driven validation methodologies, ensuring robust system performance, scalability, and reliability. You will collaborate across silicon, firmware, hardware, and software teams to deliver optimized AI solutions for next-generation computing experiences. THE PERSON : You are passionate about AI / ML technologies and system architecture, with a strong ability to innovate and solve complex technical challenges. You thrive in a collaborative environment, influencing cross-functional teams and driving architectural decisions that shape the future of AI computing. Your curiosity and leadership will enable continuous improvement and excellence in AMD’s AI solutions. KEY RESPONSIBILITIES : Define and drive AI architecture strategies for GPU-based platforms and distributed systems. Collaborate with engineering teams to design and optimize AI / ML workloads for performance, scalability, and efficiency, while architecting AI / ML solutions that integrate into innovative validation methodologies for driver code and hardware. Develop AI-driven frameworks for automated testing, predictive analytics, and intelligent bug triage to accelerate validation cycles. Lead architectural reviews and provide guidance on design decisions for AI frameworks, drivers, and system integration. Create reference designs and benchmarks for AI workloads, ensuring alignment with industry standards. Drive automation and validation strategies for AI solutions, including cluster-scale deployments. Partner with customers and internal teams to deliver end-to-end AI solutions for data centers and edge platforms. Mentor junior engineers and foster technical innovation across teams. Provide regular updates on architectural progress and influence roadmap decisions. PREFERRED EXPERIENCE : Strong background in AI / ML frameworks such as PyTorch, TensorFlow, ONNX Runtime, and familiarity with Hugging Face for model fine-tuning and deployment. Experience with GPU computing and ROCm software stack, including libraries like MIGraphX, rocBLAS, and MIOpen. Knowledge of distributed systems and performance optimization for AI workloads. Proficiency in C / C++, Python, and Linux environments; experience with HIP for GPU programming. Familiarity with networking technologies such as RDMA and RoCE for high-performance data transfer in cluster environments. Excellent communication, leadership, and problem-solving skills. Proven track record of delivering complex, multi-functional AI solutions in fast-paced environments. ACADEMIC CREDENTIALS : Bachelor’s or Master's degree in Computer or Electrical Engineering or equivalent #LI-JG1 Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.THE ROLE : As a Sr. AI / ML Engineer, you will lead the design and implementation of advanced AI / ML architectures across AMD’s GPU and data center platforms. This global technical leadership role focuses on defining strategies for AI-driven validation methodologies, ensuring robust system performance, scalability, and reliability. You will collaborate across silicon, firmware, hardware, and software teams to deliver optimized AI solutions for next-generation computing experiences. THE PERSON : You are passionate about AI / ML technologies and system architecture, with a strong ability to innovate and solve complex technical challenges. You thrive in a collaborative environment, influencing cross-functional teams and driving architectural decisions that shape the future of AI computing. Your curiosity and leadership will enable continuous improvement and excellence in AMD’s AI solutions. KEY RESPONSIBILITIES : Define and drive AI architecture strategies for GPU-based platforms and distributed systems. Collaborate with engineering teams to design and optimize AI / ML workloads for performance, scalability, and efficiency, while architecting AI / ML solutions that integrate into innovative validation methodologies for driver code and hardware. Develop AI-driven frameworks for automated testing, predictive analytics, and intelligent bug triage to accelerate validation cycles. Lead architectural reviews and provide guidance on design decisions for AI frameworks, drivers, and system integration. Create reference designs and benchmarks for AI workloads, ensuring alignment with industry standards. Drive automation and validation strategies for AI solutions, including cluster-scale deployments. Partner with customers and internal teams to deliver end-to-end AI solutions for data centers and edge platforms. Mentor junior engineers and foster technical innovation across teams. Provide regular updates on architectural progress and influence roadmap decisions. PREFERRED EXPERIENCE : Strong background in AI / ML frameworks such as PyTorch, TensorFlow, ONNX Runtime, and familiarity with Hugging Face for model fine-tuning and deployment. Experience with GPU computing and ROCm software stack, including libraries like MIGraphX, rocBLAS, and MIOpen. Knowledge of distributed systems and performance optimization for AI workloads. Proficiency in C / C++, Python, and Linux environments; experience with HIP for GPU programming. Familiarity with networking technologies such as RDMA and RoCE for high-performance data transfer in cluster environments. Excellent communication, leadership, and problem-solving skills. Proven track record of delivering complex, multi-functional AI solutions in fast-paced environments. ACADEMIC CREDENTIALS : Bachelor’s or Master's degree in Computer or Electrical Engineering or equivalent #LI-JG1

Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Create a job alert for this search

Sr Software Engineer • MARKHAM, Ontario, Canada

Similar jobs
AI / ML System Staff Software Engineer

AI / ML System Staff Software Engineer

Qualcomm • Markham
Full-time
Engineering Group, Engineering Group > .AI / ML System Staff Software Engineer.AI is revolutionizing how we solve complex, cross-domain challenges—and Generative AI (GenAI) and Agentic AI is at the fo...Show more
Last updated: 22 days ago • Promoted
Sr. AI / ML Software System Design Engineer

Sr. AI / ML Software System Design Engineer

Advanced Micro Devices • Markham
Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Show more
Last updated: 1 day ago • Promoted
Sr. AI / ML Software System Design Engineer

Sr. AI / ML Software System Design Engineer

AMD • Markham
Full-time
AI / ML Software System Design Engineer.Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features. Your actual pay will be based on your skills and experience — ta...Show more
Last updated: 2 days ago • Promoted
Generative AI ML Engineer — Build State-of-the-Art Models

Generative AI ML Engineer — Build State-of-the-Art Models

Ideogram • Toronto, ON, Canada
Full-time
A technology firm focused on AI-driven design is seeking a Machine Learning Engineer to build and deploy state-of-the-art models. The role involves collaborating with an ambitious team on complex ch...Show more
Last updated: 30+ days ago • Promoted
ML Engineer - Generative AI & LLMs (Remote)

ML Engineer - Generative AI & LLMs (Remote)

Ample Insight Inc. • Toronto, ON, Canada
Remote
Full-time
You will join a world-class team of engineers and data scientists from Facebook, Uber, Amazon and Google.We are a fast growing consulting firm based in Toronto with clients ranging from leading sta...Show more
Last updated: 30+ days ago • Promoted
Sr. Engineer, SoC Design Verification – AI / ML Accelerator Chiplets

Sr. Engineer, SoC Design Verification – AI / ML Accelerator Chiplets

Tenstorrent Inc. • Toronto
Full-time +1
Engineer, SoC Design Verification – AI / ML Accelerator Chiplets.Tenstorrent is leading the industry on cutting‑edge AI technology, revolutionizing performance expectations, ease of use, and cost eff...Show more
Last updated: 2 days ago • Promoted
GenAI ML Software Engineer – Build Scalable AI Systems

GenAI ML Software Engineer – Build Scalable AI Systems

RBC • Toronto
Full-time
A leading financial institution in Canada is seeking a Machine Learning Software Engineer to develop and scale AI-driven solutions. The role involves building and managing ML / GenAI projects from end...Show more
Last updated: 2 days ago • Promoted
Senior AI / ML Engineer

Senior AI / ML Engineer

Rogue Sentinel Studios Inc. • Toronto, ON, Canada
Full-time +1
We collaborate with global clients on game development, interactive experiences, and creative technology projects.As part of our expansion into North America, our Toronto office will serve as a key...Show more
Last updated: 8 days ago • Promoted
Lead AI / ML Software Engineer

Lead AI / ML Software Engineer

Mercor • Toronto, Canada
Full-time
Lead AI / ML Software Engineer Company : Mercor.Position : Machine Learning Engineer.Design and implement scalable ML pipelines for model training, evaluation, and continuous improvement.Build and fine...Show more
Last updated: 9 hours ago • Promoted • New!
Senior ML Engineer

Senior ML Engineer

hireVouch • Toronto, ON, Canada
Full-time
As a Senior Machine Learning Engineer, you will lead efforts to build models and services that support our core timekeeping product. You’ll collaborate closely with cross-functional teams to d...Show more
Last updated: 30+ days ago • Promoted
ROCm AI System Software Architect – Lead AI Stack

ROCm AI System Software Architect – Lead AI Stack

AMD • Markham
Full-time
A leading semiconductor company is seeking an expert-level Machine Learning System Software Engineer to develop advanced AI software solutions and optimize the AI software stack across AMD products...Show more
Last updated: 22 days ago • Promoted
Sr. Engineer, SoC Design Verification - AI / ML Accelerator Chiplets

Sr. Engineer, SoC Design Verification - AI / ML Accelerator Chiplets

Tenstorrent • Toronto
Full-time +1
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show more
Last updated: 2 days ago • Promoted
Sr. Machine Learning Engineer

Sr. Machine Learning Engineer

Align Technology • Toronto
Full-time
Join a team that is changing millions of lives.Transforming smiles, changing lives.At Align Technology, we believe a great smile can transform a person’s life, so we create technology that gives pe...Show more
Last updated: 22 days ago • Promoted
Senior Agentic AI Engineer

Senior Agentic AI Engineer

Talent To Hire Inc. • Toronto, ON, Canada
Full-time
Senior AI Engineer - Agentic Systems / LLMOPS.AI systems, LLM orchestration, and cloud deployment.The ideal candidate has hands-on experience building, deploying, and optimizing multi-agent archite...Show more
Last updated: 30+ days ago • Promoted
Director of Software Engineering (SAAS / AI / ML)

Director of Software Engineering (SAAS / AI / ML)

TEEMA Solutions Group • Toronto
Full-time
Director of Software Engineering.Downtown Toronto, Full-Time | Hybrid 4 days.Director or experienced Senior Manager.This is a unique opportunity to help grow a. You’ll provide technical oversight, l...Show more
Last updated: 22 days ago • Promoted
Hybrid ML Platform Engineer, Senior — LLMOps & AI Systems

Hybrid ML Platform Engineer, Senior — LLMOps & AI Systems

PRICELINE CAREERS • Toronto
Full-time
A leading technology firm in Toronto seeks an experienced ML Engineer to develop machine learning models and frameworks in a hybrid work environment. Applicants must have a Masters or PhD in a relev...Show more
Last updated: 2 days ago • Promoted
Senior AI Software Engineer | Hybrid, ML & API Focus

Senior AI Software Engineer | Hybrid, ML & API Focus

Thomson Reuters • Toronto
Full-time
A major global media company in Toronto is seeking a Senior Software Engineer, AI, to develop scalable AI / ML solutions. This role involves collaboration with cross-functional teams to enhance produc...Show more
Last updated: 20 days ago • Promoted
Sr. Machine Learning Engineer

Sr. Machine Learning Engineer

TheAppLabb • Toronto, ON, Canada
Full-time
The AppLabb is a leading innovation company specializing in AI-powered digital solutions, mobile app development, and emerging technologies. We leverage data-driven insights to enhance digital exper...Show more
Last updated: 30+ days ago • Promoted