Talent.com
Staff Software Engineer - Data & ML Infrastructure
Staff Software Engineer - Data & ML InfrastructureAdvanced Micro Devices, Inc • MARKHAM, Ontario, Canada
Staff Software Engineer - Data & ML Infrastructure

Staff Software Engineer - Data & ML Infrastructure

Advanced Micro Devices, Inc • MARKHAM, Ontario, Canada
30+ days ago
Job type
  • Full-time
Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE : We are looking for an experienced Software Engineer with a strong interest in data infrastructure, automation and applied machine learning. This role is critical to building AMD’s next-generation data pipeline and analytics infrastructure to support AI workloads and GPU validation. You will design and implement scalable, high-performance data systems that collect, process, and analyze GPU telemetry, performance, and power data. You will work closely with firmware, software, and infrastructure teams to transform raw data into actionable insights and predictive intelligence. THE PERSON : You are passionate about software development with creative and effective problem-solving skills, a motivated, self-starter who can work both independently and collaboratively in fast paced environments. You have excellent technical communication, interpersonal and leadership skills. KEY RESPONSIBILITIES : Design, develop, and maintain scalable data pipelines for collecting, preprocessing, transforming, and storing large volumes of GPU and system telemetry data. Architect data models and ETL processes for both structured and unstructured data across SQL / NoSQL ecosystems. Integrate ML-based analytics (e.g., anomaly detection, performance prediction, power efficiency modeling) into production pipelines. Collaborate with multiple engineering teams to enable model training, evaluation, and deployment workflows using real-world GPU data. REQUIRED SKILLS : Strong proficiency in Python with production-grade data pipeline experience. Solid understanding of databases (SQL & NoSQL) and distributed data systems (e.g., PostgreSQL, MongoDB, Kafka, or Databricks). Hands-on experience with ETL frameworks and orchestration tools (e.g., Airflow, Prefect, or Luigi). Familiarity with ML frameworks such as PyTorch, Scikit-learn, or TensorFlow for applied data analysis and predictive modeling. Experience with data visualization and reporting tools (e.g., Grafana, PowerBI) is a plus. Experience working with cloud-based storage and compute services e.g., Azure, AWS, or GCP. PREFERRED EXPERIENCE : Background in hardware telemetry, performance, or GPU analytics. Experience building AI-driven automation systems or data-driven decision frameworks. Familiarity with containerized environments (Docker, Kubernetes) and CI / CD workflows. ACADEMIC CREDENTIALS : Bachelor’s / master’s degree program in Computer Science, Engineering, Mathematics, Data Engineering or similar program with focus on Software Engineering. #LI-JE1 Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy.THE ROLE : We are looking for an experienced Software Engineer with a strong interest in data infrastructure, automation and applied machine learning. This role is critical to building AMD’s next-generation data pipeline and analytics infrastructure to support AI workloads and GPU validation. You will design and implement scalable, high-performance data systems that collect, process, and analyze GPU telemetry, performance, and power data. You will work closely with firmware, software, and infrastructure teams to transform raw data into actionable insights and predictive intelligence. THE PERSON : You are passionate about software development with creative and effective problem-solving skills, a motivated, self-starter who can work both independently and collaboratively in fast paced environments. You have excellent technical communication, interpersonal and leadership skills. KEY RESPONSIBILITIES : Design, develop, and maintain scalable data pipelines for collecting, preprocessing, transforming, and storing large volumes of GPU and system telemetry data. Architect data models and ETL processes for both structured and unstructured data across SQL / NoSQL ecosystems. Integrate ML-based analytics (e.g., anomaly detection, performance prediction, power efficiency modeling) into production pipelines. Collaborate with multiple engineering teams to enable model training, evaluation, and deployment workflows using real-world GPU data. REQUIRED SKILLS : Strong proficiency in Python with production-grade data pipeline experience. Solid understanding of databases (SQL & NoSQL) and distributed data systems (e.g., PostgreSQL, MongoDB, Kafka, or Databricks). Hands-on experience with ETL frameworks and orchestration tools (e.g., Airflow, Prefect, or Luigi). Familiarity with ML frameworks such as PyTorch, Scikit-learn, or TensorFlow for applied data analysis and predictive modeling. Experience with data visualization and reporting tools (e.g., Grafana, PowerBI) is a plus. Experience working with cloud-based storage and compute services e.g., Azure, AWS, or GCP. PREFERRED EXPERIENCE : Background in hardware telemetry, performance, or GPU analytics. Experience building AI-driven automation systems or data-driven decision frameworks. Familiarity with containerized environments (Docker, Kubernetes) and CI / CD workflows. ACADEMIC CREDENTIALS : Bachelor’s / master’s degree program in Computer Science, Engineering, Mathematics, Data Engineering or similar program with focus on Software Engineering. #LI-JE1

Benefits offered are described : AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy.

Create a job alert for this search

Staff Software Engineer Data ML Infrastructure • MARKHAM, Ontario, Canada

Similar jobs
Software Engineer, ML Data Infrastructure

Software Engineer, ML Data Infrastructure

Ideogram • Toronto
Full-time
Ideogram’s mission is to make world-class design accessible to everyone, multiplying human creativity.We build proprietary generative media models and AI native creative workflows, tackling unsolve...Show more
Last updated: 10 days ago • Promoted
Staff Software Engineer, AI Infra & Production

Staff Software Engineer, AI Infra & Production

ODAIA • Toronto
Full-time
A leading financial technology firm is seeking a Staff Software Engineer to lead the development of AI / ML infrastructure. The successful candidate will have 8+ years of experience in production engi...Show more
Last updated: 8 days ago • Promoted
Staff, ML Infrastructure Engineer

Staff, ML Infrastructure Engineer

Tubi • Toronto, Canada
Full-time
Overview About Tubi : Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood mo...Show more
Last updated: 18 days ago • Promoted
Staff ML Engineer : Scalable Search & RAG

Staff ML Engineer : Scalable Search & RAG

Zendesk • Toronto, Canada
Part-time
A leading customer service technology company is looking for a Staff Machine Learning Engineer to enhance their AI-driven search capabilities. The role involves delivering scalable AI solutions, men...Show more
Last updated: 1 day ago • Promoted
Principal Staff Engineer – AI Infrastructure - AI / ML Leader

Principal Staff Engineer – AI Infrastructure - AI / ML Leader

Andiamo • Toronto C6A, ON, Canada
Full-time +1
Principal Staff Engineer - AI Infrastructure.This role sits at the intersection of large-scale distributed systems and cutting-edge machine learning, powering the platforms that enable researchers ...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer - Data & ML Infrastructure

Staff Software Engineer - Data & ML Infrastructure

AMD • Markham
Full-time
Staff Software Engineer - Data & ML Infrastructure.Staff Software Engineer - Data & ML Infrastructure.What you do at AMD changes everything. At AMD, our mission is to build great products that accel...Show more
Last updated: 11 days ago • Promoted
Staff ML Engineer : Content Recommendations & NLP Systems Lead

Staff ML Engineer : Content Recommendations & NLP Systems Lead

Pinterest • Toronto C6A, ON, Canada
Remote
Full-time
A leading social media platform is seeking a Staff Machine Learning Engineer to lead a team focused on content recommendations and ML system design. The ideal candidate has over 5 years of experienc...Show more
Last updated: 30+ days ago • Promoted
Staff ML Engineer – Personalization & Recommendations

Staff ML Engineer – Personalization & Recommendations

Tubi, Inc. • Toronto C6A, ON, Canada
Remote
Full-time
A leading streaming service provider in Toronto is looking for a highly skilled Staff Machine Learning Engineer.You will design and implement advanced algorithms to enhance video personalization fo...Show more
Last updated: 28 days ago • Promoted
Staff Software Engineer, Inference Infrastructure

Staff Software Engineer, Inference Infrastructure

Cohere • Toronto, Canada
Full-time
Who are we? Our mission is to scale intelligence to serve humanity.We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experienc...Show more
Last updated: 16 days ago • Promoted
Staff ML Engineer – Location Platform & Mobility

Staff ML Engineer – Location Platform & Mobility

Life360 • Toronto, Canada
Full-time
A leading technology company in Canada seeks a Staff Machine Learning Engineer to architect intelligence for their location platform. In this hands-on role, you will design ML systems enhancing user...Show more
Last updated: 29 days ago • Promoted
Staff AI Software Engineer - ML Systems & Architectures

Staff AI Software Engineer - ML Systems & Architectures

PowerToFly • Toronto, Canada
Full-time
A global leader in technology solutions is hiring a Staff Software Engineer to develop complex AI systems for enhancing workflows in Toronto. The ideal candidate will have over 7 years of software e...Show more
Last updated: 27 days ago • Promoted
Staff Software / Platform Engineer, Infrastructure- Privileged Access Management

Staff Software / Platform Engineer, Infrastructure- Privileged Access Management

TechBrains • Toronto, Canada
Full-time
Staff Software / Platform Engineer, Infrastructure- Privileged Access Management Okta 29 July 2025.Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on a...Show more
Last updated: 18 days ago • Promoted
Staff Software Engineer

Staff Software Engineer

Nova Credit • Toronto, Canada
Full-time
This range is provided by Nova Credit.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. At Nova Credit, we’re on a mission to power financial incl...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer Cloud Infrastructure

Staff Software Engineer Cloud Infrastructure

Promote Project • Toronto
Full-time
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show more
Last updated: 11 days ago • Promoted
Staff Platform Engineer — AI / LLM Infrastructure, Hybrid Toronto

Staff Platform Engineer — AI / LLM Infrastructure, Hybrid Toronto

EvenUp • Toronto, Canada
Full-time
A technology company focused on fairness is seeking a Staff Software Engineer for their hybrid role in Toronto.The ideal candidate will have strong experience building and scaling backend systems, ...Show more
Last updated: 18 days ago • Promoted
Staff Software Engineer - Data & ML Infrastructure

Staff Software Engineer - Data & ML Infrastructure

Advanced Micro Devices • Markham
Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Show more
Last updated: 10 days ago • Promoted
Staff Data Engineer : Data Pipelines & ML (Remote)

Staff Data Engineer : Data Pipelines & ML (Remote)

Stripe • Toronto, Canada
Remote
Full-time
A financial infrastructure platform is seeking a Staff Software Engineer focused on data management in Toronto.This role involves leading the development of data pipelines and applications for Sale...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer, Data Systems

Staff Software Engineer, Data Systems

Hive.co • Toronto, Canada
Full-time
At Hive, we’re all about creating moments that matter and helping event marketers connect with their biggest fans.Our platform powers marketing for 1,500+ iconic events, festivals, venues, and prom...Show more
Last updated: 18 days ago • Promoted