Full Stack LLM EngineerCerebras Systems Inc. • Toronto, Canada

No longer accepting applications

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, Canada

10 days ago

Job type

Full-time

Job description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About the Role

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.

Work across the stack : model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.

Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.

Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.

Comfort navigating the full AI toolchain : Python modeling code, compiler IRs, performance profiling, etc.

Strong debugging skills across performance, numerical accuracy, and runtime integration.

Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).

Proficiency in C / C++ programming and experience with low-level optimization.

Proven experience in compiler development, particularly with LLVM and / or MLIR.

Strong background in optimization techniques, particularly those involving NP-hard problems.

What We Offer

Competitive salary and benefits package.

Opportunities for professional growth and career advancement.

A dynamic and innovative work environment.

The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras :

Build a breakthrough AI platform beyond the constraints of the GPU.

Publish and open source their cutting-edge AI research.

Work on one of the fastest AI supercomputers in the world.

Enjoy job stability with startup vitality.

Our simple, non-corporate work culture that respects individual beliefs.

Equal Employment Opportunity

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

J-18808-Ljbffr

Create a job alert for this search

Full Stack LLM Engineer • Toronto, Canada

Similar jobs

LLM Engineer — Hybrid Toronto, Agentic AI & Pipelines

Mindlance • Toronto, ON, CA

Temporary

A leading tech staffing company in Toronto is seeking an experienced LLM Engineer.This hybrid role involves building next-generation AI agents and working with large-scale AI models, requiring stro...Show more

Last updated: 30+ days ago • Promoted

Full Stack Developer - AFTIA Solutions

AFTIA Solutions • markham, on, ca

Full-time

At AFTIA, our Full Stack Developer plays a key role in designing, developing, and maintaining enterprise-grade eDocument management platforms used by major clients, including large financial instit...Show more

Last updated: 13 hours ago • Promoted • New!

MLOps Engineer

Quantum World Technologies Inc. • toronto, on, ca

Full-time

We're looking for a MLOps Engineer with:.Strong software engineering experience in Python (clean architecture, API design, testing, packaging, performance tuning).Hands-on experience building and d...Show more

Last updated: 19 hours ago • Promoted • New!

Staff ML Engineer - Contract

Signify Technology • newmarket, ON, ca

Full-time

Signify has partnered with a key client who is currently hiring a Staff ML Engineer to drive the productionisation and scaling of ML systems.Staff ML Engineer (Contract)Remote | 3–6 months | ASAP S...Show more

Last updated: 14 hours ago • Promoted • New!

Senior Full Stack Engineer - Tundra Technical Solutions

Tundra Technical Solutions • markham, on, ca

Full-time

About Tundra Managed Solutions.Tundra Managed Solutions (TMS) is the solutions arm of Tundra Technical Solutions, delivering high-impact services across four core pillars: Digital, Security, Data &...Show more

Last updated: 5 days ago • Promoted

Kubernetes Platform Engineer - Capgemini Engineering

Capgemini Engineering • richmond hill, on, ca

Full-time

Job Title: Kubernetes Platform Engineer.At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the worl...Show more

Last updated: 21 hours ago • Promoted • New!

MCP (Model Context Protocol) Engineer

BayOne Solutions • Greater Toronto Area, Canada, Canada

Full-time

Strong hands on experience with Python.AI agents to interact with enterprise systems.REST API and GraphQL integrations.Integrate MCP capabilities with internal AI agent frameworks such as.Collabora...Show more

Last updated: 12 hours ago • Promoted • New!

Forward Deployed Engineer

ForgeSight • richmond hill, on, ca

Full-time

MVPs and pilot use cases to enterprise-wide deployments, optimization, and ongoing support.We are dedicated to helping organizations achieve measurable results and maximize the value of their inves...Show more

Last updated: 5 days ago • Promoted

Sr. MLOps Engineer

E-Solutions • toronto, ON, ca

Full-time

MLOps EngineerCharles Street West, Toronto (Hybrid) Role OverviewWe are seeking a Machine Learning Developer to design, build, and deploy ML solutions that turn data into measurable business impact...Show more

Last updated: 1 hour ago • Promoted • New!

MLOps Engineer

Arkhya Tech. Inc. • Toronto, ON, Canada

Full-time

Charles Street West, Toronto (Hybrid).We are seeking a Machine Learning Developer to design, build, and deploy ML solutions that turn data into measurable business impact.This is a hands-on enginee...Show more

Last updated: 12 hours ago • Promoted • New!

Sr. MLOps Engineer - E-Solutions

E-Solutions • toronto, on, ca

Full-time

Last updated: 19 hours ago • Promoted • New!

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, ON, CA

Full-time

Last updated: 30+ days ago • Promoted

MLOps Engineer - toronto

Quantum World Technologies Inc. • toronto, on, ca

Full-time

Last updated: 19 hours ago • Promoted • New!

IAM Engineer (Entra ID Automation) - Lorven Technologies Inc.

Lorven Technologies Inc. • markham, on, ca

Full-time

Role - Cloud Identity Engineer (Entra ID Automation) –.Salary - CAD125k + Benefits Annually.In this role, you will be a key member of the team that manages user identities and provides appropriate ...Show more

Last updated: 7 days ago • Promoted

Full Stack LLM Engineer

Cerebras • Toronto, ON, CA

Full-time

Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs.Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show more

Last updated: 30+ days ago • Promoted

AWS ML engineer- 4 days a week Onsite at Toronto, ON

Q1 Technologies, Inc. • toronto, ON, ca

Full-time

Role: AWS ML engineerToronto, ON- 3-4 days a weekLong Term contract(6 Months to start with)Primary Skill: SageMaker, Glue, S3, and Lambda...Show more

Last updated: 22 hours ago • Promoted • New!

Lead ML Engineer

Hays • Greater Toronto Area, Canada, Canada

Full-time

You’ll be joining a leading Canadian digital organization building advanced eCommerce experiences across grocery, beauty, pharmacy, loyalty, and apparel.This team handles millions of daily customer...Show more

Last updated: 2 days ago • Promoted

AI/ML Engineer - Rivago Infotech Inc

Rivago Infotech Inc • markham, on, ca

Full-time

Responsible for designing, building, and deploying machine learning models and AI-driven systems within the Google Cloud ecosystem.This role bridges data science and software engineering, focusing ...Show more

Last updated: 2 days ago • Promoted