Talent.com
Senior ML Performance Engineer
Senior ML Performance EngineerLemurian Labs Inc. • Toronto, ON, CA
Senior ML Performance Engineer

Senior ML Performance Engineer

Lemurian Labs Inc. • Toronto, ON, CA
Il y a plus de 30 jours
Type de contrat
  • Temps plein
Description de poste

About Us

At Lemurian Labs, we're on a mission to bring the power of AI to everyone—without leaving a massive environmental footprint. We care deeply about the impact AI has on our society and planet, and we're building a solid foundation for its future, ensuring AI grows sustainably and responsibly. Innovation should help the world, not harm it.

We are building a high-performance, portable compiler that lets developers "build once, deploy anywhere." Yes, anywhere. We're talking about seamless cross-platform compatibility, so you can train your models in the cloud, deploy them to the edge, and everything in between—all while optimizing for resource efficiency and scalability.

If the idea of sustainably scaling AI motivates you and you're excited about making AI development both powerful and accessible, then we'd love to have you. Join us at Lemurian Labs, where you can have fun building the future—without leaving a mess behind.

The Role

We're looking for a Senior ML Performance Engineer to architect and lead our Performance Testing Platform from the ground up. You'll be the technical authority on how we measure, validate, and optimize the performance of large language models (Llama 3.2 70B, DeepSeek, and others) before and after compiler optimization on modern GPU architectures.

This is a high-impact role where you'll directly influence our product quality and our customers' success. You'll work at the intersection of ML systems, GPU architecture, and performance engineering—building the infrastructure that proves our compiler delivers real value.

What You'll Do

  • Design and build a comprehensive performance testing platform for evaluating LLM inference workloads across GPU clusters
  • Define and implement the benchmarking methodology, metrics, and test suites that measure latency, throughput, memory utilization, power consumption, and model accuracy
  • Establish baseline performance for unoptimized models (Llama 3.2 70B, DeepSeek, etc.) and validate post-optimization improvements
  • Develop automated testing pipelines for continuous performance validation across compiler releases and model updates
  • Investigate performance bottlenecks using profiling tools (ROCm profilers, GPU traces, system-level monitoring) and work with the compiler team to drive optimizations
  • Create dashboards and reporting that provide clear visibility into performance trends, regressions, and wins
  • Collaborate cross-functionally with compiler engineers, ML engineers, and DevOps to ensure performance testing is integrated into our development workflow
  • Document best practices for performance testing and optimization of ML workloads on GPU hardware

What You'll Bring

  • 7+ years of experience in performance engineering, benchmarking, or systems engineering roles
  • Deep understanding of ML inference workloads, particularly transformer-based models and LLMs
  • Hands-on experience with GPU programming and optimization (CUDA, ROCm, or similar)
  • Strong programming skills in Python and C / C++
  • Proven track record of building performance testing infrastructure or benchmarking platforms from scratch
  • Experience with ML frameworks (PyTorch, TensorFlow, ONNX Runtime, vLLM, TensorRT-LLM, etc.)
  • Proficiency with profiling and debugging tools for GPU workloads
  • Strong analytical skills with the ability to design experiments, analyze results, and communicate findings clearly
  • Experience with CI / CD systems and test automation frameworks
  • Nice to Have

  • Experience with AMD GPUs (Mi200 / Mi300 series) and ROCm ecosystem
  • Knowledge of compiler optimization techniques and their impact on performance
  • Experience with distributed inference and multi-GPU workloads
  • Familiarity with ML model quantization, pruning, and other optimization techniques
  • Background in high-performance computing or systems-level optimization
  • Experience with infrastructure-as-code (Kubernetes, Docker, Terraform)
  • Contributions to open-source ML or systems projects
  • Personal Attributes

  • Obsessive about details — you notice the 2% regression that others miss
  • Self-driven — you take ownership and don't wait for permission to solve problems
  • Collaborative mindset — you work well across teams and help others succeed
  • Passionate about sustainability — you care about making AI more efficient and environmentally responsible
  • Clear communicator — you can explain complex technical concepts to both engineers and stakeholders
  • Salary depends on experience and geographical location.

    This salary range may be inclusive of several career levels and will be narrowed during the interview process based on a number of factors, such as the candidate’s experience, knowledge, skills, and abilities, as well as internal equity among our team.

    Additional benefits for this role may include : equity, company bonus opportunities, medical, dental, and vision benefits; retirement savings plan; and supplemental wellness benefits.

    Lemurian Labs ensures equal employment opportunity without discrimination or harassment based on race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity or expression, age, disability, national origin, marital or domestic / civil partnership status, genetic information, citizenship status, veteran status, or any other characteristic protected by law.

    EOE

    #J-18808-Ljbffr

    Créer une alerte emploi pour cette recherche

    Performance Engineer • Toronto, ON, CA

    Offres similaires
    Senior Performance Engineer

    Senior Performance Engineer

    Diverse Lynx • Toronto
    Temps plein
    Role : Senior Performance Engineer.Location - Tampa, FL (5 Days Onsite).Developing and maintaining performance testing scripts and scenarios. Conducting performance, load, and stress testing on vario...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Senior MLOps Engineer — Scale ML Infra & AI Innovation

    Senior MLOps Engineer — Scale ML Infra & AI Innovation

    Okta • Toronto C6A, ON, Canada
    Temps plein
    A leading identity management company is seeking a Senior MLOps Engineer to join the Intelligence Accelerator team in Toronto. This role involves designing scalable ML infrastructures and collaborat...Voir plus
    Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
    Senior ML Engineer : Cloud & On‑Prem Pipelines

    Senior ML Engineer : Cloud & On‑Prem Pipelines

    Aviva Canada • Markham
    Temps plein
    A leading insurance company is hiring a Senior Machine Learning Engineer for their AI / ML Platform team in Markham, Ontario. The ideal candidate will have over 5 years of experience in machine learni...Voir plus
    Dernière mise à jour : il y a 12 jours • Offre sponsorisée
    Senior ML Performance Engineer – Neuron SDK

    Senior ML Performance Engineer – Neuron SDK

    Amazon • Toronto, ON, Canada
    Temps plein
    A leading cloud computing company is looking for systems and compiler engineers to join their performance team in Toronto. In this role, you will analyze and optimize the system-level performance of...Voir plus
    Dernière mise à jour : il y a 6 jours • Offre sponsorisée
    Senior ML Engineer : Personalization & Recommendations (Hybrid Toronto)

    Senior ML Engineer : Personalization & Recommendations (Hybrid Toronto)

    Tubi Tv • Toronto, ON, Canada
    Temps plein
    A leading streaming service provider is seeking a Principal Machine Learning Engineer for its Toronto office.This hybrid role emphasizes developing advanced recommendation systems and requires sign...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Inference Performance Engineer, ML Systems & Optimization

    Inference Performance Engineer, ML Systems & Optimization

    Cerebras Systems • Toronto, ON, Canada
    Temps plein
    A leading AI technology company in Toronto is seeking an experienced software engineer to join their inference model team. This role involves prototyping AI architectural tweaks, developing benchmar...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior I / O Performance Modeling Engineer

    Senior I / O Performance Modeling Engineer

    AMD • Markham
    Temps plein
    A leading semiconductor company is looking for a Senior Performance Modeling Engineer in Markham.You will drive I / O performance from pre-silicon to post-silicon by analyzing performance data and op...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior ML Engineer – Listing Quality (Hybrid)

    Senior ML Engineer – Listing Quality (Hybrid)

    Startops • Toronto
    Temps plein
    A technology-driven marketplace is seeking a Senior Data Scientist / Machine Learning Engineer to leverage machine learning for improving product listings. In this role, you will design high-perform...Voir plus
    Dernière mise à jour : il y a 12 jours • Offre sponsorisée
    Senior Performance Modeling Engineer

    Senior Performance Modeling Engineer

    Advanced Micro Devices • Markham
    Temps plein
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Voir plus
    Dernière mise à jour : il y a 10 jours • Offre sponsorisée
    Performance Engineer

    Performance Engineer

    Cerebras • Toronto
    Temps plein
    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior ML Engineer - Mortgage Tech & Cloud Platform

    Senior ML Engineer - Mortgage Tech & Cloud Platform

    Nesto Cloud • Toronto
    Temps plein
    A leading Canadian technology firm is seeking a Machine Learning Developer to create innovative models and enhance processes. The ideal candidate will have strong experience in Python, machine learn...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    LTV.ai • Toronto
    Temps plein
    AI‑powered ambassadors to deliver hyper‑personalized Email and SMS interactions at an unprecedented scale.Our platform enables brands to communicate with their audience in a natural and contextuall...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior ML Platform Engineer (MLOps) - Remote

    Senior ML Platform Engineer (MLOps) - Remote

    SurveyMonkey • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    A prominent technology firm is seeking a Senior Software Engineer (MLOps) to contribute to the operationalization of machine learning models and features. The role involves collaborating with data s...Voir plus
    Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
    Senior Performance Engineer - Tangerine

    Senior Performance Engineer - Tangerine

    Scotiabank • Toronto
    Temps plein
    Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative products, and award-winning Client service. The reason why Tangerine employees come to work eac...Voir plus
    Dernière mise à jour : il y a 8 jours • Offre sponsorisée
    Senior ML Engineer - Growth & Pricing (Equity)

    Senior ML Engineer - Growth & Pricing (Equity)

    DoorDash • Toronto
    Temps plein
    A leading on-demand delivery service in Toronto is seeking a Senior Machine Learning Engineer to develop advanced ML models that enhance pricing and growth strategies. The ideal candidate will have ...Voir plus
    Dernière mise à jour : il y a 12 jours • Offre sponsorisée
    Senior AI / ML Engineer – Battlefield Tech

    Senior AI / ML Engineer – Battlefield Tech

    Electronic Arts • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    A global gaming company based in Toronto is looking for a Machine Learning Engineer to revolutionize gaming experiences.You will research, design, and implement AI solutions, leveraging emerging ma...Voir plus
    Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
    Senior Performance Modeling Engineer

    Senior Performance Modeling Engineer

    AMD • Markham
    Temps plein
    Senior Performance Modeling Engineer.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. At AMD, our mission is to build great products that acceler...Voir plus
    Dernière mise à jour : il y a 12 jours • Offre sponsorisée
    Principal ML Engineer – Personalization & Recommendations

    Principal ML Engineer – Personalization & Recommendations

    Tubi, Inc. • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    A leading streaming service provider is searching for a Principal Machine Learning Engineer for their Toronto office.This hybrid role involves leading the design and implementation of advanced reco...Voir plus
    Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre