Talent.com
Cerebras Systems Inc.
Full Stack LLM EngineerCerebras Systems Inc. • Toronto, Canada
No longer accepting applications
Full Stack LLM Engineer

Full Stack LLM Engineer

Cerebras Systems Inc. • Toronto, Canada
11 days ago
Job type
  • Full-time
Job description
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About the Role We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.

Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.

Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.

Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Skills & Qualifications

Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.

Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.

Strong debugging skills across performance, numerical accuracy, and runtime integration.

Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).

Proficiency in C/C++ programming and experience with low-level optimization.

Proven experience in compiler development, particularly with LLVM and/or MLIR.

Strong background in optimization techniques, particularly those involving NP-hard problems.

What We Offer

Competitive salary and benefits package.

Opportunities for professional growth and career advancement.

A dynamic and innovative work environment.

The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Why Join Cerebras People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

Build a breakthrough AI platform beyond the constraints of the GPU.

Publish and open source their cutting-edge AI research.

Work on one of the fastest AI supercomputers in the world.

Enjoy job stability with startup vitality.

Our simple, non-corporate work culture that respects individual beliefs.

Equal Employment Opportunity Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies.

#J-18808-Ljbffr
Create a job alert for this search

Full Stack LLM Engineer • Toronto, Canada

Similar jobs

AI Full‑Stack Engineer for Pharma LLM Apps

Zs AssociatesToronto
Full-time

A leading management consulting firm in Toronto is seeking a Sr.Associate Engineer – AI Full Stack Developer to design, develop, and support LLM-based solutions in Pharma R&D.This role requires str... Show more

 • Promoted

LLM Engineer

MindlanceToronto, Ontario, Canada
Full-time

Direct message the job poster from Mindlance.Hiring: LLM Engineers in Toronto! Be part of a fast-scaling team building Smarter, Next-Generation AI Agents alongside World-Leading AI Labs.Location: T... Show more

 • Promoted

AI Engineer: LLM Systems, RAG & Cloud Deployments

JoleraToronto
Full-time

A leading IT solutions provider in Toronto is seeking a practical AI Engineer to design and deploy scalable AI systems.In this hands-on role, you will work closely with the CTO to build LLM-powered... Show more

 • Promoted

Innovative Site Reliability Engineer for Cloud and AI Solutions

Themesoft Inc.Toronto, ON, CA
Full-time

Lead the charge in site reliability engineering focusing on cloud systems and AI-driven observability.Leverage your strong Python scripting and experience with tools like PagerDuty and Moogsoft.In ... Show more

 • Promoted

Lead Full-Stack AI-Driven Software Engineer (Remote)

Elation HealthToronto, ON, CA
Remote
Full-time

A healthcare technology company is looking for a Full Stack Engineer to enhance AI-driven product experiences.The ideal candidate will have 5+ years in software development, strong skills in web pr... Show more

 • Promoted

Senior Software Engineer - Secure LLM Infra (Remote)

LLMToronto, ON, CA
Remote
Full-time

A technology solutions provider in Canada is seeking a Senior Software Engineer to architect backend systems for secure LLM deployment.The ideal candidate should have over 5 years of experience in ... Show more

 • Promoted

Senior Principal AI Framework Engineer: LLM & RL Systems

HuaweiMarkham, York Region, CA
Full-time

A leading technology company is seeking a Senior Principal Engineer in Markham, Canada, focused on optimizing open-source frameworks in Large Language Models and reinforcement learning.The ideal ca... Show more

 • Promoted

Full Stack LLM Engineer

CerebrasToronto
Full-time

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team.This team is responsible for rapidly bringing up state‑of‑the‑art open‑source models (like LLaMA, Q... Show more

 • Promoted

Senior AI Full-Stack Engineer – LLM Apps

ZSToronto, ON, CA
Full-time

A leading management consulting firm in Toronto is seeking a Sr.Associate Engineer – AI Full Stack Developer.In this role, you will be tasked with designing and deploying sophisticated AI solutions... Show more

 • Promoted

Full-Stack Engineer II – Remote, Growth-Focused

AffirmToronto, ON, CA
Remote
Full-time

A leading financial technology company in Canada is seeking a Full Stack Developer to join their Card Acquisition team.This role involves collaborating with product and engineering managers to enha... Show more

 • Promoted

Senior Engineer for LLM Retrieval Workflows

KlueToronto, ON, CA
Full-time

Become a key player at Klue as a Senior Software Engineer in Toronto, focusing on LLM-powered retrieval and agentic workflows.Design innovative systems that enhance business insights and improve cu... Show more

 • Promoted

Full-Stack Software Engineer - LLM

fiveonefour incToronto, Ontario, Canada
Full-time

But in our experience, to do anything exciting at scale with data/analytics, you have to leave the warm, cozy world of software development and enter the specialized workflows, tools, and technolog... Show more

 • Promoted

Senior GenAI & LLM Systems Engineer

GuidepointToronto, ON, CA
Full-time

A leading research enablement platform is seeking an experienced Data/AI Engineer for its Toronto office.This hybrid role involves building scalable AI systems and applications, optimizing Generati... Show more

 • Promoted

Staff ML Infra & Distributed Systems Engineer

TubiToronto, ON, CA
Full-time

A leading streaming service is seeking a Staff Software Engineer for its ML Infrastructure team in Toronto.The role focuses on designing low-latency distributed systems and optimizing machine learn... Show more

 • Promoted

Senior LLMOps Engineer -Cloud / AI Infrastructure

TEEMA Solutions GroupToronto, ON, CA
Full-time

Ready to build what powers the next generation of AI?.You’ll be the driving force behind taking trained models from lab to production—scaling efficiently across multi-GPU clusters and pushing the b... Show more

 • Promoted

Senior Full-Stack Engineer - Hybrid, Scale & ML

Onico SolutionsRichmond Hill
Full-time +1

Onico Solutions is seeking a Full Stack Software Developer to join an innovative team in downtown Toronto.This permanent hybrid position offers a competitive salary between $110,000.Ideal candidate... Show more

 • Promoted

Remote Full Stack Engineer in Fintech

VeemToronto
Remote
Full-time

Join the fintech revolution as a Remote Full Stack Engineer.Design and optimize backend systems while collaborating on enterprise solutions and API integrations.This fully remote role is tailored f... Show more

 • Promoted

Backend Engineer for LLM Integration at Stripe

MonographToronto, ON, CA
Full-time

Drive user engagement as a Backend Engineer with Stripe, focusing on LLM and API orchestration.Collaborate with cross-functional teams to build dynamic user experiences and tools.In this full-time ... Show more

 • Promoted

Software Engineer - FinTech: Scale & ML Infra (Remote)

Hunter BondToronto, ON, CA
Remote
Full-time

A leading FinTech company in Montreal seeks a Software Engineer with language-agnostic skills and a passion for technology.The role involves building advanced software solutions and robust ETL pipe... Show more

 • Promoted

CLM Engineer

Quantum World Technologies Inc.Toronto
Full-time

Experience: 6+ years managing PKI environments and digital certificates.Venafi Expertise: Hands‑on experience with Venafi Trust Protection Platform.Technical Skills: Strong understanding of X.TLS/S... Show more