Talent.com
Cerebras Systems Inc.
Full Stack LLM EngineerCerebras Systems Inc. • Toronto, Canada
No longer accepting applications
Full Stack LLM Engineer

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, Canada
10 days ago
Job type
  • Full-time
Job description
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About the Role We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.

Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.

Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.

Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.

Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.

Strong debugging skills across performance, numerical accuracy, and runtime integration.

Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).

Proficiency in C/C++ programming and experience with low-level optimization.

Proven experience in compiler development, particularly with LLVM and/or MLIR.

Strong background in optimization techniques, particularly those involving NP-hard problems.

What We Offer

Competitive salary and benefits package.

Opportunities for professional growth and career advancement.

A dynamic and innovative work environment.

The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Why Join Cerebras People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.

Publish and open source their cutting-edge AI research.

Work on one of the fastest AI supercomputers in the world.

Enjoy job stability with startup vitality.

Our simple, non-corporate work culture that respects individual beliefs.

Equal Employment Opportunity Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

#J-18808-Ljbffr
Create a job alert for this search

Full Stack LLM Engineer • Toronto, Canada

Similar jobs

LLM Engineer

MindlanceToronto, Ontario, Canada
Full-time

Direct message the job poster from Mindlance.Hiring: LLM Engineers in Toronto! Be part of a fast-scaling team building Smarter, Next-Generation AI Agents alongside World-Leading AI Labs.Location: T... Show more

 • Promoted

AI Full‑Stack Engineer for Pharma LLM Apps

Zs AssociatesToronto, ON, CA
Full-time

A leading management consulting firm in Toronto is seeking a Sr.Associate Engineer – AI Full Stack Developer to design, develop, and support LLM-based solutions in Pharma R&D.This role requires str... Show more

 • Promoted

Remote AI Engineer - LLM Specialist

PulsoraToronto, ON, CA
Remote
Full-time

Discover an exciting career as a Remote AI Engineer specializing in Large Language Models.Play a key role in developing AI-driven software solutions in a fully remote setting.Your expertise in AI/M... Show more

 • Promoted • New!

Innovative Site Reliability Engineer for Cloud and AI Solutions

Themesoft Inc.Toronto, ON, CA
Full-time

Lead the charge in site reliability engineering focusing on cloud systems and AI-driven observability.Leverage your strong Python scripting and experience with tools like PagerDuty and Moogsoft.In ... Show more

 • Promoted

Lead Full-Stack AI-Driven Software Engineer (Remote)

Elation HealthToronto, ON, CA
Remote
Full-time

A healthcare technology company is looking for a Full Stack Engineer to enhance AI-driven product experiences.The ideal candidate will have 5+ years in software development, strong skills in web pr... Show more

 • Promoted

Staff ML Infrastructure Engineer — Remote (Canada)

SamsaraToronto, ON, CA
Remote
Full-time

A leading technology firm is hiring a Staff / Senior Staff Machine Learning Infrastructure Engineer to design and operate a cutting-edge ML platform in Canada.This role involves collaboration with ... Show more

 • Promoted

Senior Principal AI Framework Engineer: LLM & RL Systems

HuaweiMarkham, York Region, CA
Full-time

A leading technology company is seeking a Senior Principal Engineer in Markham, Canada, focused on optimizing open-source frameworks in Large Language Models and reinforcement learning.The ideal ca... Show more

 • Promoted

Senior GenAI & LLM Systems Engineer

GuidepointToronto, Ontario, Canada
Full-time

A leading research enablement platform is seeking an experienced Data/AI Engineer for its Toronto office.This hybrid role involves building scalable AI systems and applications, optimizing Generati... Show more

 • Promoted

Full Stack LLM Engineer

CerebrasToronto
Full-time

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team.This team is responsible for rapidly bringing up state‑of‑the‑art open‑source models (like LLaMA, Q... Show more

 • Promoted

Senior AI Full-Stack Engineer – LLM Apps

ZSToronto, ON, CA
Full-time

A leading management consulting firm in Toronto is seeking a Sr.Associate Engineer – AI Full Stack Developer.In this role, you will be tasked with designing and deploying sophisticated AI solutions... Show more

 • Promoted

Senior Full-Stack Engineer - Hybrid, Scale & ML

Onico SolutionsRichmond Hill, York Region, CA
Permanent

Onico Solutions is seeking a Full Stack Software Developer to join an innovative team in downtown Toronto.This permanent hybrid position offers a competitive salary between $110,000.Ideal candidate... Show more

 • Promoted

AI Engineer: LLM Systems, RAG & Cloud Deployments

JoleraToronto, ON, CA
Full-time

A leading IT solutions provider in Toronto is seeking a practical AI Engineer to design and deploy scalable AI systems.In this hands-on role, you will work closely with the CTO to build LLM-powered... Show more

 • Promoted

Full-Stack Engineer II – Remote, Growth-Focused

AffirmToronto, ON, CA
Remote
Full-time

A leading financial technology company in Canada is seeking a Full Stack Developer to join their Card Acquisition team.This role involves collaborating with product and engineering managers to enha... Show more

 • Promoted

LLM Engineer — Hybrid Toronto, Agentic AI & Pipelines

MindlanceToronto
Full-time +1

A leading tech staffing company in Toronto is seeking an experienced LLM Engineer.This hybrid role involves building next-generation AI agents and working with large-scale AI models, requiring stro... Show more

 • Promoted

Senior Engineer for LLM Retrieval Workflows

KlueToronto, ON, CA
Full-time

Become a key player at Klue as a Senior Software Engineer in Toronto, focusing on LLM-powered retrieval and agentic workflows.Design innovative systems that enhance business insights and improve cu... Show more

 • Promoted

LLM Serving Engineer (Cloud AI Engineering), Senior / Staff Engineer

QualcommMarkham, Ontario, Canada
Full-time

Company Qualcomm Technologies, Inc.Job Area Engineering Group, Engineering Group >.LLM Serving Engineer (Cloud AI Engineering) Qualcomm is utilizing its traditional strengths in digital wireless te... Show more

 • Promoted

Staff ML Infra & Distributed Systems Engineer

TubiToronto, ON, CA
Full-time

A leading streaming service is seeking a Staff Software Engineer for its ML Infrastructure team in Toronto.The role focuses on designing low-latency distributed systems and optimizing machine learn... Show more

 • Promoted

Senior LLMOps Engineer -Cloud / AI Infrastructure

TEEMA Solutions GroupToronto, ON, CA
Full-time

Ready to build what powers the next generation of AI?.You’ll be the driving force behind taking trained models from lab to production—scaling efficiently across multi-GPU clusters and pushing the b... Show more

 • Promoted

Remote Full Stack Engineer in Fintech

VeemToronto
Remote
Full-time

Join the fintech revolution as a Remote Full Stack Engineer.Design and optimize backend systems while collaborating on enterprise solutions and API integrations.This fully remote role is tailored f... Show more

 • Promoted

MLOps Engineer

Mastech DigitalToronto, Ontario, Canada
Full-time

We are seeking a Senior DevSecOps / MLOps Engineer to drive security, compliance, and governance best practices across our AI/ML platforms on Azure and Databricks.This role will focus on embedding ... Show more