Talent.com
Performance Engineer - Inference
Performance Engineer - InferenceCerebras Systems Inc. • Toronto, ON, CA
Performance Engineer - Inference

Performance Engineer - Inference

Cerebras Systems Inc. • Toronto, ON, CA
Il y a 11 jours
Type de contrat
  • Temps plein
Description de poste

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About The Role

Engineers on the inference performance team operate at the intersection of hardware and software, driving end-to-end model inference speed and throughput. Their work spans low-level kernel performance debugging and optimization, system-level performance analysis, performance modeling and estimation, and the development of tooling for performanceprojection and diagnostics.

Responsibilities

  • Build performance models (kernel-level, end-to-end) to estimate the performance of state of the art and customer ML models.
  • Optimize and debug our kernel micro code and compiler algorithms to elevate ML model inference speed, throughput and compute utilization on the Cerebras WSE.
  • Debug and understand runtime performance on the system and cluster.
  • Develop tools and infrastructure to help visualize performance data collected from the Wafer Scale Engine and our compute cluster.

Requirements

  • Bachelors / Masters / PhD in Electrical Engineering or Computer Science.
  • Strong background in computer architecture.
  • Exposure to and understanding of low-level deep learning / LLM math.
  • Strong analytical and problem-solving mindset.
  • 3+ years of experience in a relevant domain (Computer Architecture, CPU / GPU Performance, Kernel Optimization, HPC).
  • Experience working on CPU / GPU simulators.
  • Exposure to performance profiling and debug on any system pipeline.
  • Comfort with C++ and Python.
  • Why Join Cerebras

    People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras :

  • Build a breakthrough AI platform beyond the constraints of the GPU.
  • Publish and open source their cutting-edge AI research.
  • Work on one of the fastest AI supercomputers in the world.
  • Enjoy job stability with startup vitality.
  • Our simple, non-corporate work culture that respects individual beliefs.
  • Apply today and become part of the forefront of groundbreaking advancements in AI!

    Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

    This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

    #J-18808-Ljbffr

    Créer une alerte emploi pour cette recherche

    Performance Engineer Inference • Toronto, ON, CA

    Offres similaires
    Senior Presales Engineer, Data Platform

    Senior Presales Engineer, Data Platform

    Perforce Software, Inc. • Toronto C6A, ON, Canada
    Temps plein
    A leading software firm is seeking a Senior Sales Engineer in Toronto, Ontario to partner with sales teams in presenting Delphix’s data management solutions. The role requires effective communicatio...Voir plus
    Dernière mise à jour : il y a 13 jours • Offre sponsorisée
    Full Stack Engineer

    Full Stack Engineer

    GEI Consultants • Markham, ON, Canada
    Temps plein
    The Full Stack Engineer is responsible for front-end development and back-end interconnection of solutions that support AI-powered applications and integrations across GEI.This role builds user-fac...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    IAM Engineer

    IAM Engineer

    Dexian • toronto, on, ca
    Temps plein
    We are looking for candidates with strong technical expertise to fill this role.Below are the details of the position : . Mode of Job : Hybrid (4 days onsite).Locations : Toronto / Mississauga / Markham / Sca...Voir plus
    Dernière mise à jour : il y a 19 heures • Offre sponsorisée • Nouvelle offre
    Quality Engineer

    Quality Engineer

    freelance.ca • Toronto, Canada
    Temps plein
    If anyone is interested, please let me know.Roles & Responsibilities⦁ 4-6 years of hands-on experience in AI / ML engineering in cloud environments⦁ Strong proficiency in Python, SQL and ML framework...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Customer Quality Engineer

    Customer Quality Engineer

    Magna International, Inc • Markham, ON, Canada
    Permanent
    At Magna, you can expect an engaging and dynamic environment where you can help to develop industry-leading automotive technologies. We invest in our employees, providing them with the support and r...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Platform Engineer – Ingress & Service Mesh (Istio) - richmond hill

    Senior Platform Engineer – Ingress & Service Mesh (Istio) - richmond hill

    Net2Source (N2S) • richmond hill, on, ca
    Temps plein
    Senior Platform Engineer – Istio / Ingress.Join the Boundary Services team to.Istio-based traffic routing, gateways, and reliability. Istio ingress gateway and service mesh.Set best practices for bo...Voir plus
    Dernière mise à jour : il y a 19 heures • Offre sponsorisée • Nouvelle offre
    Performance Engineer - Inference

    Performance Engineer - Inference

    Cerebras Systems • Toronto C6A, ON, Canada
    Temps plein
    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Voir plus
    Dernière mise à jour : il y a 13 jours • Offre sponsorisée
    Senior AI Full-Stack Engineer — Remote / Hybrid

    Senior AI Full-Stack Engineer — Remote / Hybrid

    AutoAlign AI • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    A tech-focused AI company in Toronto is seeking a Senior Full-Stack Application Developer.The role involves developing AI applications across platforms and integrating them with backend systems in ...Voir plus
    Dernière mise à jour : il y a 11 jours • Offre sponsorisée
    Data Analytics Engineer - richmond hill

    Data Analytics Engineer - richmond hill

    EXL • richmond hill, on, ca
    Temps plein
    Sports Analytics & Engineering Practice, and we’re hiring an.This role focuses on transforming client data into trusted, analytics-ready assets that power dashboards, insights, and decision-making....Voir plus
    Dernière mise à jour : il y a 3 heures • Offre sponsorisée • Nouvelle offre
    Director, Performance & Analytics (Hybrid)

    Director, Performance & Analytics (Hybrid)

    Canada Pension Plan Investment Board • Toronto C6A, ON, Canada
    Temps plein
    A leading investment organization in Toronto is seeking a Director of Performance to oversee the performance measurement and analysis for a globally diversified portfolio.The ideal candidate will h...Voir plus
    Dernière mise à jour : il y a 10 jours • Offre sponsorisée
    Senior Principal Research Engineer

    Senior Principal Research Engineer

    Autodesk, Inc. • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    Job Requisition ID # • •25WD94222 • • • •Learn More • • • • • •About Autodesk • •Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    EMS / SCADA Engineer - richmond hill

    EMS / SCADA Engineer - richmond hill

    Pacer Group • richmond hill, on, ca
    Temps plein
    Network or Transmission Application preferably Reliance.LINUX and Windows Operating Systems.Proficient in Electric Transmission EMS / SCADA / Implementation. Good knowledge of Electric SCADA applicat...Voir plus
    Dernière mise à jour : il y a 1 heure • Offre sponsorisée • Nouvelle offre
    AI-Driven Full-Stack Engineer (Remote)

    AI-Driven Full-Stack Engineer (Remote)

    PolicyMe Corp. • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    A digital insurance startup in Toronto is seeking an AI-Driven Full Stack Engineer to join their growing team.The ideal candidate will have over 5 years of experience in full stack development and ...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Site Reliability / Infrastructure Platform Engineer

    Senior Site Reliability / Infrastructure Platform Engineer

    Nextologies Limited • Markham, ON, Canada
    Temps plein
    Senior Site Reliability / Infrastructure Platform Engineer.Virtualization, distributed systems, Linux performance, and service reliability). Act as senior escalation point for service outages, platf...Voir plus
    Dernière mise à jour : il y a 19 jours • Offre sponsorisée
    Senior Systems Engineer

    Senior Systems Engineer

    Essence Coaching Group • Markham, ON, Canada
    Temps plein
    Lindsay, Ontario, Canada (Hybrid).CAD 165,000 – 210,000 gross / year.A senior-level Systems Engineer is sought to lead aircraft- and system-level engineering activities for next-generation elec...Voir plus
    Dernière mise à jour : il y a 26 jours • Offre sponsorisée
    Performance Engineer - Inference

    Performance Engineer - Inference

    Cerebras Systems Inc. • Toronto C6A, ON, Canada
    Temps plein
    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Voir plus
    Dernière mise à jour : il y a 12 jours • Offre sponsorisée
    Machine Learning Scientist / Engineer

    Machine Learning Scientist / Engineer

    SPECTRAFORCE • Greater Toronto Area, Canada
    Temps plein
    Job Title : Machine Learning Scientist / Engineer.Length of contract- 12 Months (Possible extension).Hybrid- 2-3 days a week @ Toronto, ON. Interview- 2 rounds (first panel, 45 min).Machine Learning S...Voir plus
    Dernière mise à jour : il y a 22 heures • Offre sponsorisée • Nouvelle offre
    Analytics Engineer

    Analytics Engineer

    Bird • Toronto, ON, Canada
    Temps plein
    Now we're shaping its future.We're Bird, and we're on a mission to.Our products, services, and people share one common goal : to make cities more livable by empowering people and communi...Voir plus
    Dernière mise à jour : il y a 15 heures • Offre sponsorisée • Nouvelle offre