Full Stack LLM EngineerCerebras Systems Inc. • Toronto, Canada

Les candidatures ne sont plus acceptées

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, Canada

Il y a 10 jours

Type de contrat

Temps plein

Description de poste

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About the Role

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.

Work across the stack : model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.

Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.

Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.

Comfort navigating the full AI toolchain : Python modeling code, compiler IRs, performance profiling, etc.

Strong debugging skills across performance, numerical accuracy, and runtime integration.

Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).

Proficiency in C / C++ programming and experience with low-level optimization.

Proven experience in compiler development, particularly with LLVM and / or MLIR.

Strong background in optimization techniques, particularly those involving NP-hard problems.

What We Offer

Competitive salary and benefits package.

Opportunities for professional growth and career advancement.

A dynamic and innovative work environment.

The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras :

Build a breakthrough AI platform beyond the constraints of the GPU.

Publish and open source their cutting-edge AI research.

Work on one of the fastest AI supercomputers in the world.

Enjoy job stability with startup vitality.

Our simple, non-corporate work culture that respects individual beliefs.

Equal Employment Opportunity

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

J-18808-Ljbffr

Créer une alerte emploi pour cette recherche

Full Stack LLM Engineer • Toronto, Canada

Offres similaires

LLM Engineer — Hybrid Toronto, Agentic AI & Pipelines

Mindlance • Toronto, ON, CA

Temporaire

A leading tech staffing company in Toronto is seeking an experienced LLM Engineer.This hybrid role involves building next-generation AI agents and working with large-scale AI models, requiring stro...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

Full Stack Developer - AFTIA Solutions

AFTIA Solutions • markham, on, ca

Temps plein

At AFTIA, our Full Stack Developer plays a key role in designing, developing, and maintaining enterprise-grade eDocument management platforms used by major clients, including large financial instit...Voir plus

Dernière mise à jour : il y a 5 heures • Offre sponsorisée • Nouvelle offre

MLOps Engineer

Quantum World Technologies Inc. • toronto, on, ca

Temps plein

We're looking for a MLOps Engineer with:.Strong software engineering experience in Python (clean architecture, API design, testing, packaging, performance tuning).Hands-on experience building and d...Voir plus

Dernière mise à jour : il y a 11 heures • Offre sponsorisée • Nouvelle offre

AI/ML Engineer - Rivago Infotech Inc

Rivago Infotech Inc • newmarket, on, ca

Temps plein

Responsible for designing, building, and deploying machine learning models and AI-driven systems within the Google Cloud ecosystem.This role bridges data science and software engineering, focusing ...Voir plus

Dernière mise à jour : il y a 2 jours • Offre sponsorisée

Dynamic Site Reliability Engineer with AI/ML and Cloud Expertise

Themesoft Inc. • Toronto, ON, CA

Temps plein

Drive site reliability as an Engineer focused on cloud systems and AI/ML-driven observability.Leverage scripting talents in Python while working with key tools like Dynatrace and Splunk.In this rol...Voir plus

Dernière mise à jour : il y a 4 jours • Offre sponsorisée

Kubernetes Platform Engineer - Capgemini Engineering

Capgemini Engineering • markham, on, ca

Temps plein

Job Title: Kubernetes Platform Engineer.At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the worl...Voir plus

Dernière mise à jour : il y a 13 heures • Offre sponsorisée • Nouvelle offre

MCP (Model Context Protocol) Engineer

BayOne Solutions • Greater Toronto Area, Canada, Canada

Temps plein

Strong hands on experience with Python.AI agents to interact with enterprise systems.REST API and GraphQL integrations.Integrate MCP capabilities with internal AI agent frameworks such as.Collabora...Voir plus

Dernière mise à jour : il y a 4 heures • Offre sponsorisée • Nouvelle offre

Forward Deployed Engineer

ForgeSight • richmond hill, on, ca

Temps plein

MVPs and pilot use cases to enterprise-wide deployments, optimization, and ongoing support.We are dedicated to helping organizations achieve measurable results and maximize the value of their inves...Voir plus

Dernière mise à jour : il y a 5 jours • Offre sponsorisée

Full-Stack Ml Engineer: Azure, Mlops & Apis - C$100,000 - C$120,000 A Year

ALS • Toronto, Canada

Temps plein

Seeking a Full Stack ML Engineer in Metro Vancouver to build and maintain web apps, manage backend systems, design APIs, and work with Azure and machine learning.Voir plus

Dernière mise à jour : il y a 14 jours • Offre sponsorisée

Sr. MLOps Engineer - E-Solutions

E-Solutions • toronto, on, ca

Temps plein

Charles Street West, Toronto (Hybrid).We are seeking a Machine Learning Developer to design, build, and deploy ML solutions that turn data into measurable business impact.This is a hands-on enginee...Voir plus

Dernière mise à jour : il y a 11 heures • Offre sponsorisée • Nouvelle offre

Senior Full Stack Engineer - Tundra Technical Solutions

Tundra Technical Solutions • richmond hill, on, ca

Temps plein

About Tundra Managed Solutions.Tundra Managed Solutions (TMS) is the solutions arm of Tundra Technical Solutions, delivering high-impact services across four core pillars: Digital, Security, Data &...Voir plus

Dernière mise à jour : il y a 5 jours • Offre sponsorisée

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, ON, CA

Temps plein

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

MLOps Engineer - toronto

Quantum World Technologies Inc. • toronto, on, ca

Temps plein

Dernière mise à jour : il y a 11 heures • Offre sponsorisée • Nouvelle offre

MLOps Engineer

Arkhya Tech. Inc. • toronto, on, ca

Temps plein

Dernière mise à jour : il y a 11 heures • Offre sponsorisée • Nouvelle offre

Staff ML Engineer - Contract

Signify Technology • richmond hill, on, ca

Temps plein

Signify has partnered with a key client who is currently hiring a Staff ML Engineer to drive the productionisation and scaling of ML systems.Remote | 3–6 months | ASAP Start.The team is based on th...Voir plus

Dernière mise à jour : il y a 11 heures • Offre sponsorisée • Nouvelle offre

IAM Engineer (Entra ID Automation) - Lorven Technologies Inc.

Lorven Technologies Inc. • markham, on, ca

Temps plein

Role - Cloud Identity Engineer (Entra ID Automation) –.Salary - CAD125k + Benefits Annually.In this role, you will be a key member of the team that manages user identities and provides appropriate ...Voir plus

Dernière mise à jour : il y a 7 jours • Offre sponsorisée

Staff ML Infra & Distributed Systems Engineer

Tubi • Toronto, ON, CA

Temps plein

A leading streaming service is seeking a Staff Software Engineer for its ML Infrastructure team in Toronto.The role focuses on designing low-latency distributed systems and optimizing machine learn...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée

Full Stack LLM Engineer

Cerebras • Toronto, ON, CA

Temps plein

Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs.Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Voir plus

Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée