Talent.com
AI Performance Engineer (Cloud AI Engineering), Sr | Staff | Sr. Staff
AI Performance Engineer (Cloud AI Engineering), Sr | Staff | Sr. StaffQualcomm • Markham, York Region, CA
AI Performance Engineer (Cloud AI Engineering), Sr | Staff | Sr. Staff

AI Performance Engineer (Cloud AI Engineering), Sr | Staff | Sr. Staff

Qualcomm • Markham, York Region, CA
30+ days ago
Job type
  • Full-time
Job description

AI Performance Engineer (Cloud AI Engineering) – Senior – Staff – Senior Staff role at Qualcomm .

Engineering Group, Machine Learning Engineering.

Job Summary

Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. We are investing in several supporting technologies including Deep Learning. The Qualcomm Cloud AI team is developing hardware and software solutions for Inference Acceleration.

Responsibilities

  • Convert, optimize and deploy models for efficient inference using PyTorch, ONNX.
  • Work at the forefront of GenAI by understanding advanced algorithms e.g. attention mechanisms, MoEs and numerics to identify new optimization opportunities.
  • Performance analysis and optimization of LLM, VLM, and diffusion models for inference. Scale performance for throughput and latency constraints.
  • Mapping the next generation AI workloads on top of current and future hardware designs.
  • Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams.
  • Analyze complex performance or stability issues to work towards final root cause of underlying problems.
  • Create engineering solutions to deliver continuous insights into performance of AI workloads guiding the improvements over time.
  • Design and implement high-level kernels, e.g. in Triton, with a focus on generating efficient, low-level code.

Qualifications

  • Hands-on experience in building and optimizing language models, notably in PyTorch, ONNX, preferably in production-grade environments.
  • Deep understanding of transformer architectures, attention mechanisms and performance trade-offs.
  • Experience in workload mapping strategies exhibiting sharding or various parallelisms.
  • Strong Python programming skills.
  • Proactive learning about the latest inference optimization techniques.
  • Understanding of computer architecture, ML accelerators, in-memory processing and distributed systems.
  • Strong communication, problem-solving skills and ability to learn and work effectively in a fast-paced and collaborative environment.
  • MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering.
  • Bonus Skills

  • Background in neural network operators and mathematical operations, including linear algebra and math libraries.
  • Understanding of machine learning compilers.
  • Experience in converging accuracy and its evaluation methods.
  • Knowledge of torch.compile or torchDynamo.
  • PhD in Computer Science, Computer Engineering or Machine Learning.
  • Minimum Qualifications

  • Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 6+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
  • Master's degree in Computer Science, Engineering, Information Systems, or related field and 5+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
  • PhD in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
  • Pay Range And Other Compensation & Benefits

    $178,400.00 - $267,600.00

    Qualcomm offers competitive annual discretionary bonus program and RSU grants. Contact Qualcomm Careers for more details.

    Equal Opportunity and Accessibility Statement

    Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application / hiring process, rest assured that Qualcomm is committed to providing an accessible process. Contact disability-accomodations@qualcomm.com or call Qualcomm's toll‑free number for reasonable accommodations.

    Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.

    #J-18808-Ljbffr

    Create a job alert for this search

    Ai Engineer • Markham, York Region, CA

    Similar jobs
    Remote AI Engineering Lead : Head of AI & Agentic Platforms

    Remote AI Engineering Lead : Head of AI & Agentic Platforms

    CommerceIQ • Toronto C6A, ON, Canada
    Remote
    Full-time
    A leading AI-driven technology firm is seeking a Senior Director / VP - Head of AI Engineering to lead a team in developing next-generation AI-driven SaaS solutions. The ideal candidate will have exte...Show more
    Last updated: 30+ days ago • Promoted
    Founding Staff Engineer - AI SaaS, Toronto Hybrid

    Founding Staff Engineer - AI SaaS, Toronto Hybrid

    Talent To Hire Inc. • Toronto C6A, ON, Canada
    Full-time
    A pioneering AI SaaS startup in Toronto seeks an experienced Lead Software Engineer to establish its engineering hub.This role requires 10+ years in software engineering, with a strong focus on bui...Show more
    Last updated: 30+ days ago • Promoted
    Senior Full-Stack Engineer - Generative AI SaaS (Remote)

    Senior Full-Stack Engineer - Generative AI SaaS (Remote)

    Human Agency • Toronto C6A, ON, Canada
    Remote
    Full-time
    A tech-focused company seeks a Senior Full-Stack Engineer to drive software development and contribute to innovative AI solutions. This role involves leading projects in backend and frontend develop...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer : AI-Driven Full-Stack Role (Toronto)

    Software Engineer : AI-Driven Full-Stack Role (Toronto)

    Redcan.ai • Toronto C6A, ON, Canada
    Full-time
    A leading AI solutions provider in Toronto is seeking a Software Engineer (Early Career to Senior) to develop scalable applications using TypeScript and Python. The ideal candidate will own the soft...Show more
    Last updated: 15 days ago • Promoted
    Senior Software Engineer - AI-Driven, Cloud & Mentorship

    Senior Software Engineer - AI-Driven, Cloud & Mentorship

    Sitetracker • Toronto C6A, ON, Canada
    Full-time
    A leading software solutions company in Toronto is looking for a Senior Software Engineer to deliver high-quality software solutions. This role includes building and shipping features, mentoring jun...Show more
    Last updated: 30+ days ago • Promoted
    Sr Data and AI Engineer

    Sr Data and AI Engineer

    Sanofi • Toronto, Canada
    Full-time
    Main Responsibilities : • • • Lead the design, implementation, and delivery of data pipelines, datamarts and AI workflows, ensuring scalability, resilience, and compliance with Sanofi’s standards.Work ...Show more
    Last updated: 10 days ago • Promoted
    Hands-On AI Architect : AWS & Agentic AI

    Hands-On AI Architect : AWS & Agentic AI

    TheAppLabb • Toronto C6A, ON, Canada
    Full-time
    A technology innovation firm in Toronto is seeking a hands-on AI Architect to code, architect, and deploy AI solutions.This role involves working with clients across various industries, leading tec...Show more
    Last updated: 17 days ago • Promoted
    Senior Principal Software Engineer - AI Multi-Agents

    Senior Principal Software Engineer - AI Multi-Agents

    Huawei Canada • Markham, York Region, Canada
    Full-time +1
    Senior Principal Software Engineer - AI Multi-Agents.Senior Principal Software Engineer - AI Multi-Agents.Huawei Canada has an immediate permanent opening for a Principal Software Engineer.Establis...Show more
    Last updated: 30+ days ago • Promoted
    Senior Full-Stack Engineer — AI for Science

    Senior Full-Stack Engineer — AI for Science

    BenchSci • Toronto C6A, ON, Canada
    Full-time
    A leading tech company in Toronto is seeking a Full Stack Engineer to work on complex projects and enhance solutions for scientists. Ideal candidates will have over 4 years of experience in full-sta...Show more
    Last updated: 30+ days ago • Promoted
    Lead Software Engineer – Cloud, AI & Enterprise Apps

    Lead Software Engineer – Cloud, AI & Enterprise Apps

    Pariveda Solutions • Toronto C6A, ON, Canada
    Remote
    Full-time
    A North American-based professional services firm in Toronto is seeking lead software developers to design enterprise-level applications. This role involves technical leadership, innovative solution...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer – Full-Stack, AI & Cloud

    Senior Software Engineer – Full-Stack, AI & Cloud

    Boomerang • Toronto C6A, ON, Canada
    Full-time
    A tech startup in Toronto is seeking a Senior Software Engineer to join as one of the first hires.You will develop features across the stack, mentor other engineers, and have a significant impact o...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Platform Engineer — Experiences & Innovation

    Senior AI Platform Engineer — Experiences & Innovation

    CaptivateIQ • Toronto C6A, ON, Canada
    Full-time
    A technology company in Toronto is looking for a Senior Software Engineer focused on building AI-powered capabilities and user experiences. This role includes designing systems from the ground up, i...Show more
    Last updated: 6 days ago • Promoted
    Senior Full-Stack Engineer for AI Workspace Platform

    Senior Full-Stack Engineer for AI Workspace Platform

    Cohere • Toronto C6A, ON, Canada
    Remote
    Full-time
    A leading AI technology firm in Toronto is looking for a Senior Full Stack Engineer to build and ship features for their AI workspace platform. The role requires expertise in Python and React, along...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI / ML Engineer — Real-Time Personalization at Scale

    Senior AI / ML Engineer — Real-Time Personalization at Scale

    LTV.ai • Toronto C6A, ON, Canada
    Full-time
    A leading AI technology firm in Toronto is seeking a Senior AI / ML Engineer to advance product capabilities and improvements. This role involves evolving their segmentation engine and enhancing per...Show more
    Last updated: 30+ days ago • Promoted
    Head of Sales & Revenue Engine (SaaS, AI)

    Head of Sales & Revenue Engine (SaaS, AI)

    Ribbon AI Inc. • Toronto C6A, ON, Canada
    Full-time
    A leading AI recruitment firm in Toronto is looking for its first Head of Sales to build a world-class revenue organization. You will oversee the entire sales process while working directly with fou...Show more
    Last updated: 3 days ago • Promoted
    Senior CCaaS & AI Solutions Engineer

    Senior CCaaS & AI Solutions Engineer

    Avaya Corporation • Toronto C6A, ON, Canada
    Full-time
    A leading telecommunications firm is hiring a Senior Sales Engineer for Channel Accounts in Toronto, Ontario.The candidate will drive adoption of CCaaS solutions and leverage AI integrations to enh...Show more
    Last updated: 30+ days ago • Promoted
    Engineering Leader, Toronto Site & AI Infra

    Engineering Leader, Toronto Site & AI Infra

    Tubi, Inc. • Toronto C6A, ON, Canada
    Full-time
    A leading streaming service provider is hiring a Vice President of Engineering for their new Toronto office.You will lead Infrastructure and Data Engineering teams and play a critical role in shapi...Show more
    Last updated: 8 days ago • Promoted
    Senior Software Engineer - AI I

    Senior Software Engineer - AI I

    PowerToFly • Toronto C6A, ON, Canada
    Remote
    Full-time
    New Position : This position is open due to an existing vacancy to support our evolving business needs.Senior Software Engineer – AI I. Are you excited about building AI-driven software that redefine...Show more
    Last updated: 9 days ago • Promoted