Talent.com
Rack-Level IO Diagnostics Software Engineer
Rack-Level IO Diagnostics Software EngineerAdvanced Micro Devices, Inc • MARKHAM, Ontario, Canada
Rack-Level IO Diagnostics Software Engineer

Rack-Level IO Diagnostics Software Engineer

Advanced Micro Devices, Inc • MARKHAM, Ontario, Canada
14 days ago
Job type
  • Full-time
Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE : The IO Diagnostics Engineering team is responsible for delivering diagnostic test suites for AMD Instinct GPUs and Epyc CPUs, which test the leading industry high-performance interconnects IP subsystems and OS supporting IP subsystems. This role requires considerable use of initiative and independent judgment. As a rack level IO diagnostics engineer, you will be responsible for scoping the feature test requirements, creating diagnostic coverage plan documents, implementation and automation of diagnostic test content, as well as customer issues debugging, etc., for the expanding and strategic market for AMD. To accomplish this, you will need to interact with other key engineering teams such as silicon design, board engineering and manufacturing and leaders within AMD. THE PERSON : The rack level IO diagnostic engineer will have excellent communication skills, planning skills, technical expertise, and critical problem-solving skills. KEY RESPONSIBILITIES : Deliver forward thinking IP & board & rack level diagnostic strategy. Transfer high level requirements into comprehensive diagnostic test coverage with desired coverage metrics, and detailed diagnostic test items. Specific pre-silicon work would include developing all the defined diagnostics test contents with required timelines and verifying critical features on emulation environments for reducing risk of post-silicon work. Post-silicon work includes silicon bring up, test execution, issues debugging, and optimizing test time without sacrificing the test coverage. Collaborate with cross functional teams to achieve the key program milestones, such as bring up, all feature enablement, performance profiling, production support, customer issue debugging, etc. Be a driver of continuous improvement of enhancing diagnostic coverage, code quality, and development processes. Drive test innovation and strategic initiatives based on learning from prior programs and your own best practice knowledge. PREFERRED EXPERIENCE : Experience in developing automation software to enable Silicon / System / Cluster level validation. Expertise of silicon / system test methodologies. Solid understanding of HW / FW / SW interaction and system-level engineering. Proficiency in programming / scripting languages (e.g., C / C++, Perl, Ruby, Python). Deep knowledge in low-level software and system level debug and has capability to quickly identify problems and provide robust solutions. Technical leadership experience is preferred - technical subject matter experience in a large, fast paced environment. Understanding of AI infrastructure and experience in the ML / AI space is preferred. ROCm software development experience is a plus. Good understanding of x86 / ARM architecture and experience with BIOS, GPU, PCIe firmware development is a plus. Self-motivated, organized, detailed-oriented and results-oriented. Strong communication and collaboration skills. ACADEMIC CREDENTIALS : Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. LOCATION : Markham, ON #LI-AJ1 #LI-HYBRID Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.THE ROLE : The IO Diagnostics Engineering team is responsible for delivering diagnostic test suites for AMD Instinct GPUs and Epyc CPUs, which test the leading industry high-performance interconnects IP subsystems and OS supporting IP subsystems. This role requires considerable use of initiative and independent judgment. As a rack level IO diagnostics engineer, you will be responsible for scoping the feature test requirements, creating diagnostic coverage plan documents, implementation and automation of diagnostic test content, as well as customer issues debugging, etc., for the expanding and strategic market for AMD. To accomplish this, you will need to interact with other key engineering teams such as silicon design, board engineering and manufacturing and leaders within AMD. THE PERSON : The rack level IO diagnostic engineer will have excellent communication skills, planning skills, technical expertise, and critical problem-solving skills. KEY RESPONSIBILITIES : Deliver forward thinking IP & board & rack level diagnostic strategy. Transfer high level requirements into comprehensive diagnostic test coverage with desired coverage metrics, and detailed diagnostic test items. Specific pre-silicon work would include developing all the defined diagnostics test contents with required timelines and verifying critical features on emulation environments for reducing risk of post-silicon work. Post-silicon work includes silicon bring up, test execution, issues debugging, and optimizing test time without sacrificing the test coverage. Collaborate with cross functional teams to achieve the key program milestones, such as bring up, all feature enablement, performance profiling, production support, customer issue debugging, etc. Be a driver of continuous improvement of enhancing diagnostic coverage, code quality, and development processes. Drive test innovation and strategic initiatives based on learning from prior programs and your own best practice knowledge. PREFERRED EXPERIENCE : Experience in developing automation software to enable Silicon / System / Cluster level validation. Expertise of silicon / system test methodologies. Solid understanding of HW / FW / SW interaction and system-level engineering. Proficiency in programming / scripting languages (e.g., C / C++, Perl, Ruby, Python). Deep knowledge in low-level software and system level debug and has capability to quickly identify problems and provide robust solutions. Technical leadership experience is preferred - technical subject matter experience in a large, fast paced environment. Understanding of AI infrastructure and experience in the ML / AI space is preferred. ROCm software development experience is a plus. Good understanding of x86 / ARM architecture and experience with BIOS, GPU, PCIe firmware development is a plus. Self-motivated, organized, detailed-oriented and results-oriented. Strong communication and collaboration skills. ACADEMIC CREDENTIALS : Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. LOCATION : Markham, ON #LI-AJ1 #LI-HYBRID

Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Create a job alert for this search

Software Engineer • MARKHAM, Ontario, Canada

Similar jobs
Senior Software Engineer, IoT Fleet & Safety (Remote)

Senior Software Engineer, IoT Fleet & Safety (Remote)

Samsara • Toronto C6A, ON, Canada
Remote
Full-time
A leading technology firm is seeking a Senior Software Engineer II for remote work in Canada.This role requires strong programming skills and over 8 years of software development experience.Respons...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer II

Senior Software Engineer II

Tripledot Studios Limited • Toronto C6A, ON, Canada
Remote
Full-time +1
Tripledot is one of the largest independent mobile games companies in the world.We are a multi‑award‑winning organisation, with a global 2,500+ strong team across 12 studios.Our expanded portfolio ...Show more
Last updated: 5 days ago • Promoted
Senior Software Engineer II

Senior Software Engineer II

Tripledot Studios • Toronto C6A, ON, Canada
Full-time +1
For candidates residing in California, Colorado, Florida, Georgia, Illinois, Maryland, Massachusetts, Minnesota, Nevada, New Hampshire, New Jersey, Oregon, Rhode Island, Texas, and Washington State...Show more
Last updated: 4 days ago • Promoted
Senior Software Engineer, iOS

Senior Software Engineer, iOS

Tubi • Toronto C6A, ON, Canada
Full-time
One week ago Be among the first 25 applicants.Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest c...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer

Senior Software Engineer

Headstart AI • Toronto C6A, ON, Canada
Full-time
This range is provided by Headstart AI.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. At Headstart our mission is to bring companies into the a...Show more
Last updated: 23 days ago • Promoted
Senior TypeScript SDK Engineer (Remote)

Senior TypeScript SDK Engineer (Remote)

Walrus Foundation • Toronto C6A, ON, Canada
Full-time
A tech innovative firm in Toronto is seeking a Senior Software Engineer specializing in TypeScript to develop crucial SDKs and APIs. The ideal candidate will have over 4 years of experience, particu...Show more
Last updated: 23 days ago • Promoted
Senior Software Engineer, AI II

Senior Software Engineer, AI II

Thomson Reuters • Toronto C6A, ON, Canada
Remote
Full-time
Are you ready to shape the future of AI-driven content technology while leading cutting-edge innovation in a mission-critical role? Do you thrive in environments where your technical expertise can ...Show more
Last updated: 21 days ago • Promoted
Senior Software Engineer

Senior Software Engineer

Medeloop • Toronto C6A, ON, Canada
Full-time
Medeloop is creating the future of clinical operations and health research through cutting‑edge AI and big data technologies. Our unified platform, spanning AI‑powered analytics, study management, a...Show more
Last updated: 23 days ago • Promoted
Senior Software Engineer, AI Model Serving - Toronto, Canada Toronto, Canada

Senior Software Engineer, AI Model Serving - Toronto, Canada Toronto, Canada

Speechify, Inc. • Toronto C6A, ON, Canada
Remote
Full-time
Senior Software Engineer, AI Model serving - Toronto, Canada.The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-spee...Show more
Last updated: 14 days ago • Promoted
Senior Software Engineer, AI

Senior Software Engineer, AI

Superbar • Toronto C6A, ON, Canada
Full-time
Continue with Google Continue with Google.This range is provided by Superbar.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Superbar is the AI-...Show more
Last updated: 30+ days ago • Promoted
MedTech Software Engineer III (C# / C++)

MedTech Software Engineer III (C# / C++)

Orthofix Holdings, Inc. • Toronto C6A, ON, Canada
Full-time
A medical technology company in Toronto is looking for a Software Developer III to design, support, and maintain software for navigation systems. Candidates should have a Bachelor's degree in a comp...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, AI Model serving - Toronto, Canada

Senior Software Engineer, AI Model serving - Toronto, Canada

Clutch Canada • Toronto C6A, ON, Canada
Full-time
PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...Show more
Last updated: 21 days ago • Promoted
Senior Software Engineer II

Senior Software Engineer II

Forter • Toronto C6A, ON, Canada
Full-time
Senior Software Engineer II – Forter.About the role : Forter is looking for a Senior Software Engineer II to develop and cultivate new products for fraud detection. The engineering decisions made by ...Show more
Last updated: 15 days ago • Promoted
Senior Software Engineer — AI in Healthcare

Senior Software Engineer — AI in Healthcare

Medsender • Toronto C6A, ON, Canada
Full-time
A healthcare technology company in Toronto is looking for a talented software engineer with strong programming fundamentals and a passion for improving patient care through AI.You will be joining a...Show more
Last updated: 24 days ago • Promoted
Remote Software Engineer - Agentic AI & Scalable Systems

Remote Software Engineer - Agentic AI & Scalable Systems

SCALIS • Toronto C6A, ON, Canada
Remote
Full-time
A progressive tech company in Toronto is looking for Software Engineers to develop intelligent AI products.The ideal candidate will be responsible for developing robust applications using TypeScrip...Show more
Last updated: 20 days ago • Promoted
Senior Programmer / Developer - Azure ML Architect Leak Detection

Senior Programmer / Developer - Azure ML Architect Leak Detection

Capgemini • Toronto C6A, ON, Canada
Full-time
Must have one or more Azure Certification preferably Azure ML.Good Hands On Experience of Azure AI / ML tools and Databricks. Good to have experience of Azure Data Engineering Tools.Must be good in ...Show more
Last updated: 19 days ago • Promoted
Senior Software Engineer - Azure & C++

Senior Software Engineer - Azure & C++

Hamilton-Carr • Toronto C6A, ON, Canada
Full-time
A leading consulting firm is seeking a Senior Software Engineer to join their Engineering team in Toronto.This role involves developing advanced software applications using C++, SQL, and Microsoft ...Show more
Last updated: 8 days ago • Promoted
High-Performance AI ASIC Product Engineer

High-Performance AI ASIC Product Engineer

Tenstorrent Inc. • Toronto C6A, ON, Canada
Remote
Full-time +1
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show more
Last updated: 22 days ago • Promoted