Talent.com
Full Stack LLM Engineer
Full Stack LLM EngineerCerebras Systems Inc. • Toronto, Canada
No longer accepting applications
Full Stack LLM Engineer

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, Canada
10 days ago
Job type
  • Full-time
Job description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About the Role

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.

Work across the stack : model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.

Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.

Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.

Comfort navigating the full AI toolchain : Python modeling code, compiler IRs, performance profiling, etc.

Strong debugging skills across performance, numerical accuracy, and runtime integration.

Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).

Proficiency in C / C++ programming and experience with low-level optimization.

Proven experience in compiler development, particularly with LLVM and / or MLIR.

Strong background in optimization techniques, particularly those involving NP-hard problems.

What We Offer

Competitive salary and benefits package.

Opportunities for professional growth and career advancement.

A dynamic and innovative work environment.

The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras :

Build a breakthrough AI platform beyond the constraints of the GPU.

Publish and open source their cutting-edge AI research.

Work on one of the fastest AI supercomputers in the world.

Enjoy job stability with startup vitality.

Our simple, non-corporate work culture that respects individual beliefs.

Equal Employment Opportunity

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

J-18808-Ljbffr

Create a job alert for this search

Full Stack LLM Engineer • Toronto, Canada

Similar jobs
LLM Engineer — Hybrid Toronto, Agentic AI & Pipelines

LLM Engineer — Hybrid Toronto, Agentic AI & Pipelines

Mindlance • Toronto, ON, CA
Temporary
A leading tech staffing company in Toronto is seeking an experienced LLM Engineer.This hybrid role involves building next-generation AI agents and working with large-scale AI models, requiring stro...Show more
Last updated: 30+ days ago • Promoted
Full Stack Developer - AFTIA Solutions

Full Stack Developer - AFTIA Solutions

AFTIA Solutions • markham, on, ca
Full-time
At AFTIA, our Full Stack Developer plays a key role in designing, developing, and maintaining enterprise-grade eDocument management platforms used by major clients, including large financial instit...Show more
Last updated: 13 hours ago • Promoted • New!
MLOps Engineer

MLOps Engineer

Quantum World Technologies Inc. • toronto, on, ca
Full-time
We're looking for a MLOps Engineer with:.Strong software engineering experience in Python (clean architecture, API design, testing, packaging, performance tuning).Hands-on experience building and d...Show more
Last updated: 19 hours ago • Promoted • New!
Staff ML Engineer - Contract

Staff ML Engineer - Contract

Signify Technology • newmarket, ON, ca
Full-time
Signify has partnered with a key client who is currently hiring a Staff ML Engineer to drive the productionisation and scaling of ML systems.Staff ML Engineer (Contract)Remote | 3–6 months | ASAP S...Show more
Last updated: 14 hours ago • Promoted • New!
Senior Full Stack Engineer - Tundra Technical Solutions

Senior Full Stack Engineer - Tundra Technical Solutions

Tundra Technical Solutions • markham, on, ca
Full-time
About Tundra Managed Solutions.Tundra Managed Solutions (TMS) is the solutions arm of Tundra Technical Solutions, delivering high-impact services across four core pillars: Digital, Security, Data &...Show more
Last updated: 5 days ago • Promoted
Kubernetes Platform Engineer - Capgemini Engineering

Kubernetes Platform Engineer - Capgemini Engineering

Capgemini Engineering • richmond hill, on, ca
Full-time
Job Title: Kubernetes Platform Engineer.At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the worl...Show more
Last updated: 21 hours ago • Promoted • New!
MCP (Model Context Protocol) Engineer

MCP (Model Context Protocol) Engineer

BayOne Solutions • Greater Toronto Area, Canada, Canada
Full-time
Strong hands on experience with Python.AI agents to interact with enterprise systems.REST API and GraphQL integrations.Integrate MCP capabilities with internal AI agent frameworks such as.Collabora...Show more
Last updated: 12 hours ago • Promoted • New!
Forward Deployed Engineer

Forward Deployed Engineer

ForgeSight • richmond hill, on, ca
Full-time
MVPs and pilot use cases to enterprise-wide deployments, optimization, and ongoing support.We are dedicated to helping organizations achieve measurable results and maximize the value of their inves...Show more
Last updated: 5 days ago • Promoted
Sr. MLOps Engineer

Sr. MLOps Engineer

E-Solutions • toronto, ON, ca
Full-time
MLOps EngineerCharles Street West, Toronto (Hybrid) Role OverviewWe are seeking a Machine Learning Developer to design, build, and deploy ML solutions that turn data into measurable business impact...Show more
Last updated: 1 hour ago • Promoted • New!
MLOps Engineer

MLOps Engineer

Arkhya Tech. Inc. • Toronto, ON, Canada
Full-time
Charles Street West, Toronto (Hybrid).We are seeking a Machine Learning Developer to design, build, and deploy ML solutions that turn data into measurable business impact.This is a hands-on enginee...Show more
Last updated: 12 hours ago • Promoted • New!
Sr. MLOps Engineer - E-Solutions

Sr. MLOps Engineer - E-Solutions

E-Solutions • toronto, on, ca
Full-time
Charles Street West, Toronto (Hybrid).We are seeking a Machine Learning Developer to design, build, and deploy ML solutions that turn data into measurable business impact.This is a hands-on enginee...Show more
Last updated: 19 hours ago • Promoted • New!
Full Stack LLM Engineer

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, ON, CA
Full-time
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show more
Last updated: 30+ days ago • Promoted
MLOps Engineer - toronto

MLOps Engineer - toronto

Quantum World Technologies Inc. • toronto, on, ca
Full-time
We're looking for a MLOps Engineer with:.Strong software engineering experience in Python (clean architecture, API design, testing, packaging, performance tuning).Hands-on experience building and d...Show more
Last updated: 19 hours ago • Promoted • New!
IAM Engineer (Entra ID Automation) - Lorven Technologies Inc.

IAM Engineer (Entra ID Automation) - Lorven Technologies Inc.

Lorven Technologies Inc. • markham, on, ca
Full-time
Role - Cloud Identity Engineer (Entra ID Automation) –.Salary - CAD125k + Benefits Annually.In this role, you will be a key member of the team that manages user identities and provides appropriate ...Show more
Last updated: 7 days ago • Promoted
Full Stack LLM Engineer

Full Stack LLM Engineer

Cerebras • Toronto, ON, CA
Full-time
Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs.Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show more
Last updated: 30+ days ago • Promoted
AWS ML engineer- 4 days a week Onsite at Toronto, ON

AWS ML engineer- 4 days a week Onsite at Toronto, ON

Q1 Technologies, Inc. • toronto, ON, ca
Full-time
Role: AWS ML engineerToronto, ON- 3-4 days a weekLong Term contract(6 Months to start with)Primary Skill: SageMaker, Glue, S3, and Lambda...Show more
Last updated: 22 hours ago • Promoted • New!
Lead ML Engineer

Lead ML Engineer

Hays • Greater Toronto Area, Canada, Canada
Full-time
You’ll be joining a leading Canadian digital organization building advanced eCommerce experiences across grocery, beauty, pharmacy, loyalty, and apparel.This team handles millions of daily customer...Show more
Last updated: 2 days ago • Promoted
AI/ML Engineer - Rivago Infotech Inc

AI/ML Engineer - Rivago Infotech Inc

Rivago Infotech Inc • markham, on, ca
Full-time
Responsible for designing, building, and deploying machine learning models and AI-driven systems within the Google Cloud ecosystem.This role bridges data science and software engineering, focusing ...Show more
Last updated: 2 days ago • Promoted