Talent.com
Software Engineer- Model Performance Tooling
Software Engineer- Model Performance ToolingBaseTen Labs, Inc. • Vancouver, Metro Vancouver Regional District, CA
Software Engineer- Model Performance Tooling

Software Engineer- Model Performance Tooling

BaseTen Labs, Inc. • Vancouver, Metro Vancouver Regional District, CA
16 days ago
Job type
  • Full-time
Job description

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE OPPORTUNITY

We are looking for early-career Software Engineers to join our team in Vancouver, BC. This is a specialized role sitting at the intersection of high-performance computing (HPC) and Large Language Model (LLM) engineering. You will be responsible for building the automated "speedometer and diagnostic" suite for our next-generation AI infrastructure.

In this role, you won’t just be using models; you will be tearing them apart to see how they run on the metal. You will build tools that measure GPU FLOPS, stress-test InfiniBand clusters, and define the benchmarks that ensure our systems are production-ready.

RESPONSIBILITIES

Performance Benchmarking : Run and automate standard LLM quality benchmarks (GSM8K, MMLU) alongside custom performance suites for specific workloads (e.g., long-context window, KV cache reuse).

Infrastructure Validation : Create automated acceptance tests for new GPU clusters across x86 and ARM systems, measuring GPU memory bandwidth, networking throughput, and multi-node networking performance.

Model Dev Experience : Develop and maintain internal GPU-enabled development environments (similar to GitHub Codespaces). You will ensure the team has seamless, high-performance "dev machines" optimized for model experimentation.

Tool Development : Build and contribute to tools such as InferenceMAX and genai-bench to automate model evaluation and optimization.

Deep Hardware Profiling : Use PyTorch Profiler and NVIDIA Nsight Systems to collect performance profiles, identify bottlenecks, and debug the NVIDIA compute / networking stack.

Monitoring & Observability : Develop real-time dashboards and alerts to monitor system health, model startup times, and runtime performance.

Continuous Integration : Automate performance testing via CI / CD pipelines to catch regressions in model setups before they hit production.

Optimization Automation : Build tools to find the "Pareto frontier"—identifying the absolute best configuration (latency vs. cost vs. quality) for a given model and workload.

WHAT WE'RE LOOKING FOR

This is a fresher-friendly role. We care more about your trajectory, curiosity, and technical depth than your years of experience. We want to talk to you if you have :

A Love for Systems & Hardware : You aren’t just interested in the AI; you want to understand GPU memory subsystems, InfiniBand, and how data moves across a cluster.

An Automation Mindset : You believe that if a task has to be done twice, it should be scripted. You have a passion for stress-testing and fuzzy testing to find the "breaking point" of a system.

Mathematical Curiosity : A desire to understand the underlying math of Transformers and how it translates into FLOPs and memory requirements.

Interest in Optimization : You are excited to learn about (or already play with) quantization, speculative decoding, disaggregated serving, and kernel-level optimizations.

Technical Toolkit : Familiarity with Python, and an eagerness to master the NVIDIA software stack. C++ familiarity is good to have.

WHY THIS ROLE

Direct Impact : Your tools will be the gatekeeper for what defines "good" performance for our customers.

Deep Learning (Literally) : You will gain world-class expertise in GPU orchestration and LLM inference that few engineers in the industry possess.

High Ownership : As a small team of freshers led by experts, you will have the autonomy to build tools from scratch and contribute to open-source projects.

BENEFITS

Competitive compensation, including meaningful equity.

100% coverage of medical, dental, and vision insurance for employee and dependents

Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

Paid parental leave

Company-facilitated 401(k)

Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

#J-18808-Ljbffr

Create a job alert for this search

Software Engineer Model Performance Tooling • Vancouver, Metro Vancouver Regional District, CA

Similar jobs
Senior Software Engineer

Senior Software Engineer

Starboard Recruitment • Vancouver, BC, Canada
Full-time
On behalf of our client, Starboard Recruitment is searching for multiple Senior Software Engineers in Vancouver, BC who are experience with. Our client is a US-based, Series-B with over $35M USD in ...Show more
Last updated: 30+ days ago • Promoted
Intermediate Full Stack Software Engineer

Intermediate Full Stack Software Engineer

D3 Security Management Systems • Vancouver, BC, Canada
Full-time
Location : Greater Vancouver area candidates only.D3 Security is transforming SecOps with Morpheus, our AI-driven Autonomous Security Operations Center (ASOC) platform. Morpheus automates Tier 13 anal...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer (Product)

Senior Software Engineer (Product)

owl.co • Vancouver, BC, CA
Full-time
Quick Apply
AI systems for high‑stakes, real‑world decisions.Our platform ingests and reasons over large, messy data to surface evidence with hard constraints around fairness, auditability, and low bias.The si...Show more
Last updated: 30+ days ago
Firmware & Hardware Developer

Firmware & Hardware Developer

SST Wireless • Richmond, BC, Canada
Full-time
With several new products in the design pipeline, this is an exciting time for creative thinkers who are adept in realizing technical solutions to join us in creating products where your contributi...Show more
Last updated: 30+ days ago • Promoted
Qualcomm Chipset Firmware Engineer (Contractor)

Qualcomm Chipset Firmware Engineer (Contractor)

MistyWest • Vancouver, BC, Canada
Full-time
MistyWest is expanding its contractor pool and is actively seeking a Qualcomm-specific Bluetooth Audio Firmware Engineer to support current and upcoming headset and audio programs.This role is hand...Show more
Last updated: 30+ days ago • Promoted
Software Engineer - II

Software Engineer - II

FISPAN • Vancouver, BC, Canada
Permanent
FISPAN) is an Enterprise SaaS FinTech company that allows banks to deploy embedded financial products and services to create a seamless banking connection for their corporate clients.Our product ai...Show more
Last updated: 30+ days ago • Promoted
Software QA Lead

Software QA Lead

Delta Intelligent Building Technologies (Canada) Inc. • Surrey, BC, Canada
Full-time +1
About Delta Intelligent Building Technologies (Canada) Inc.Delta Intelligent Building Technologies (Canada) Inc.Delta Electronics) is a leading building controls manufacturer with over 300 partners...Show more
Last updated: 11 days ago • Promoted
Senior Software Engineer

Senior Software Engineer

Spring Financial Inc. • Vancouver, BC, Canada
Full-time +1
Salary : $115,000-$140,000+yearly salary + benefits (See below for more details).Spring Financialis a Canadianfinancial technology companyfocused on making every day financial servicessimpler, faste...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - Aplos

Senior Software Engineer - Aplos

Velora • Vancouver, BC, Canada
Full-time
We're excited to share that Aplos, Raisely, and Keela have come together to form one unified company,.While we continue to offer the products you know and love, we now operate as one team, dedi...Show more
Last updated: 19 days ago • Promoted
Senior Generative AI Software Developer (ID#5114)

Senior Generative AI Software Developer (ID#5114)

freelance.ca • Richmond, Canada
Full-time
This contract position follows a hybrid model and requires onsite presence in Richmond, BC a minimum of three days per week. Design and build applications using OpenAI, Azure OpenAI, and open-source...Show more
Last updated: 30+ days ago • Promoted
Sr. Machine Learning Engineer, Off-board Perception

Sr. Machine Learning Engineer, Off-board Perception

Serve Robotics • Vancouver, BC, Canada
Full-time
At Serve Robotics, we’re reimagining how things move in cities.Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, m...Show more
Last updated: 4 days ago • Promoted
Mechatronics Engineer

Mechatronics Engineer

Red Rabbit Robotics • Burnaby, BC, CA
Full-time
Quick Apply
At Red Rabbit Robotics, we are on a mission to solve global labor shortage and create a future of abundance.We aim to deploy one million humanoid robots in the next 10 years, eventually producing a...Show more
Last updated: 30+ days ago
Product Design Development Engineer

Product Design Development Engineer

The Peak Group of Companies • Richmond, BC, Canada
Full-time
The PEAK Group of Companies is a leader in home improvement, delivering innovative products across Canada, the United States, Australia, and New Zealand. As a trusted partner of The Home Depot (THD)...Show more
Last updated: 2 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Targeted Talent • Richmond, BC, Canada
Full-time
We are searching for a creative, flexible technical thinker capable of managing, planning and understanding team dynamics. Responsible for authoring, analyzing and translating User Stories into syst...Show more
Last updated: 30+ days ago • Promoted
Senior Systems & Graphics Engineer

Senior Systems & Graphics Engineer

Parallelz • Vancouver, BC, CA
Full-time
Quick Apply
Parallelz enables developers to instantly port their existing mobile apps / games to the web, without any SDKs, code changes, or engineering efforts. Developers can improve user acquisition, organic v...Show more
Last updated: 30+ days ago
Software Engineer

Software Engineer

pubGENIUS • Vancouver, BC, CA
Remote
Full-time
Quick Apply
We are looking for stellar developers to join our agency team to build websites and apps for clients in the US and Europe. We specialize in AI, decentralized finance (Defi crypto / NFT / blockchain), fi...Show more
Last updated: 3 days ago
Validation Engineer

Validation Engineer

Summa Linguae Technologies • Vancouver, BC, Canada
Full-time
DATAmundi builds advanced software solutions that power our localization and data services.Our team is hiring! Are you looking for a new-age working environment with a team of fun, creative, and pa...Show more
Last updated: 30+ days ago • Promoted
Controls and Automation Engineer

Controls and Automation Engineer

Saltworks Technologies • Richmond, BC, CA
Full-time
Quick Apply
Saltworks Technologies is a global leader in advanced industrial desalination and lithium refining.Our innovative machines produce clean water from high-strength industrial discharges and refine li...Show more
Last updated: 30+ days ago