Talent.com
Software Engineer- Model Performance Tooling
Software Engineer- Model Performance ToolingBaseTen Labs, Inc. • Vancouver, Metro Vancouver Regional District, CA
Software Engineer- Model Performance Tooling

Software Engineer- Model Performance Tooling

BaseTen Labs, Inc. • Vancouver, Metro Vancouver Regional District, CA
Il y a 16 jours
Type de contrat
  • Temps plein
Description de poste

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $150M Series D, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE OPPORTUNITY

We are looking for early-career Software Engineers to join our team in Vancouver, BC. This is a specialized role sitting at the intersection of high-performance computing (HPC) and Large Language Model (LLM) engineering. You will be responsible for building the automated "speedometer and diagnostic" suite for our next-generation AI infrastructure.

In this role, you won’t just be using models; you will be tearing them apart to see how they run on the metal. You will build tools that measure GPU FLOPS, stress-test InfiniBand clusters, and define the benchmarks that ensure our systems are production-ready.

RESPONSIBILITIES

Performance Benchmarking : Run and automate standard LLM quality benchmarks (GSM8K, MMLU) alongside custom performance suites for specific workloads (e.g., long-context window, KV cache reuse).

Infrastructure Validation : Create automated acceptance tests for new GPU clusters across x86 and ARM systems, measuring GPU memory bandwidth, networking throughput, and multi-node networking performance.

Model Dev Experience : Develop and maintain internal GPU-enabled development environments (similar to GitHub Codespaces). You will ensure the team has seamless, high-performance "dev machines" optimized for model experimentation.

Tool Development : Build and contribute to tools such as InferenceMAX and genai-bench to automate model evaluation and optimization.

Deep Hardware Profiling : Use PyTorch Profiler and NVIDIA Nsight Systems to collect performance profiles, identify bottlenecks, and debug the NVIDIA compute / networking stack.

Monitoring & Observability : Develop real-time dashboards and alerts to monitor system health, model startup times, and runtime performance.

Continuous Integration : Automate performance testing via CI / CD pipelines to catch regressions in model setups before they hit production.

Optimization Automation : Build tools to find the "Pareto frontier"—identifying the absolute best configuration (latency vs. cost vs. quality) for a given model and workload.

WHAT WE'RE LOOKING FOR

This is a fresher-friendly role. We care more about your trajectory, curiosity, and technical depth than your years of experience. We want to talk to you if you have :

A Love for Systems & Hardware : You aren’t just interested in the AI; you want to understand GPU memory subsystems, InfiniBand, and how data moves across a cluster.

An Automation Mindset : You believe that if a task has to be done twice, it should be scripted. You have a passion for stress-testing and fuzzy testing to find the "breaking point" of a system.

Mathematical Curiosity : A desire to understand the underlying math of Transformers and how it translates into FLOPs and memory requirements.

Interest in Optimization : You are excited to learn about (or already play with) quantization, speculative decoding, disaggregated serving, and kernel-level optimizations.

Technical Toolkit : Familiarity with Python, and an eagerness to master the NVIDIA software stack. C++ familiarity is good to have.

WHY THIS ROLE

Direct Impact : Your tools will be the gatekeeper for what defines "good" performance for our customers.

Deep Learning (Literally) : You will gain world-class expertise in GPU orchestration and LLM inference that few engineers in the industry possess.

High Ownership : As a small team of freshers led by experts, you will have the autonomy to build tools from scratch and contribute to open-source projects.

BENEFITS

Competitive compensation, including meaningful equity.

100% coverage of medical, dental, and vision insurance for employee and dependents

Generous PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)

Paid parental leave

Company-facilitated 401(k)

Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

#J-18808-Ljbffr

Créer une alerte emploi pour cette recherche

Software Engineer Model Performance Tooling • Vancouver, Metro Vancouver Regional District, CA

Offres similaires
Senior Software Engineer

Senior Software Engineer

Starboard Recruitment • Vancouver, BC, Canada
Temps plein
On behalf of our client, Starboard Recruitment is searching for multiple Senior Software Engineers in Vancouver, BC who are experience with. Our client is a US-based, Series-B with over $35M USD in ...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Senior Software Engineer (Product)

Senior Software Engineer (Product)

owl.co • Vancouver, BC, CA
Temps plein
Quick Apply
AI systems for high‑stakes, real‑world decisions.Our platform ingests and reasons over large, messy data to surface evidence with hard constraints around fairness, auditability, and low bias.The si...Voir plus
Dernière mise à jour : il y a plus de 30 jours
Software Engineering Consultant

Software Engineering Consultant

E-Solutions • richmond, bc, ca
Temps plein
ServiceNow Administrator – Mid / L2–L3.Location : Mississauga, On and Vancouver, BC.Owns configuration, platform stability, and enhancement support across multiple ServiceNow modules.Administer user...Voir plus
Dernière mise à jour : il y a 1 heure • Offre sponsorisée • Nouvelle offre
Firmware & Hardware Developer

Firmware & Hardware Developer

SST Wireless • Richmond, BC, Canada
Temps plein
With several new products in the design pipeline, this is an exciting time for creative thinkers who are adept in realizing technical solutions to join us in creating products where your contributi...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Director of Software Development - AI / ML

Director of Software Development - AI / ML

Autodesk • Vancouver, Metro Vancouver Regional District, Canada
Temps plein
Autodesk is looking for an innovative and dynamic AI and ML Engineering Director to define the AI technical strategy for our AIR organization. This role involves leading teams of AI engineers to dev...Voir plus
Dernière mise à jour : il y a 24 jours • Offre sponsorisée
Software Engineer - II

Software Engineer - II

FISPAN • Vancouver, BC, Canada
Permanent
FISPAN) is an Enterprise SaaS FinTech company that allows banks to deploy embedded financial products and services to create a seamless banking connection for their corporate clients.Our product ai...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Full Stack Engineer

Full Stack Engineer

Targeted Talent • Delta, BC, Canada
Temps plein
We are searching for a creative, flexible technical thinker capable of managing, planning and understanding team dynamics. Responsible for authoring, analyzing and translating User Stories into syst...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Full Stack Engineer

Full Stack Engineer

Vedan Technologies • richmond, bc, ca
Temps plein
Job Title : Full Stack Engineer.We are looking for a Full Stack Engineer who can design, build, and ship high-quality, production-ready features across the stack. You will work on modern front-end ap...Voir plus
Dernière mise à jour : il y a 1 heure • Offre sponsorisée • Nouvelle offre
Senior Generative AI Software Developer (ID#5114)

Senior Generative AI Software Developer (ID#5114)

freelance.ca • Richmond, Canada
Temps plein
This contract position follows a hybrid model and requires onsite presence in Richmond, BC a minimum of three days per week. Design and build applications using OpenAI, Azure OpenAI, and open-source...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Senior Software Engineer - Keela

Senior Software Engineer - Keela

Velora • Vancouver, BC, Canada
Temps plein
We're excited to share that Aplos, Raisely, and Keela have come together to form one unified company,.While we continue to offer the products you know and love, we now operate as one team, dedi...Voir plus
Dernière mise à jour : il y a 14 jours • Offre sponsorisée
Sr. Machine Learning Engineer, Off-board Perception

Sr. Machine Learning Engineer, Off-board Perception

Serve Robotics • Vancouver, BC, Canada
Temps plein
At Serve Robotics, we’re reimagining how things move in cities.Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, m...Voir plus
Dernière mise à jour : il y a 4 jours • Offre sponsorisée
Mechatronics Engineer

Mechatronics Engineer

Red Rabbit Robotics • Burnaby, BC, CA
Temps plein
Quick Apply
At Red Rabbit Robotics, we are on a mission to solve global labor shortage and create a future of abundance.We aim to deploy one million humanoid robots in the next 10 years, eventually producing a...Voir plus
Dernière mise à jour : il y a plus de 30 jours
Implementation Engineer

Implementation Engineer

Querentia • richmond, bc, ca
Temps plein
Saviynt and identity integration technologies.This role will focus on implementing and integrating identity and access management (IAM) solutions across enterprise environments, ensuring secure and...Voir plus
Dernière mise à jour : il y a 1 heure • Offre sponsorisée • Nouvelle offre
Senior Software Engineer - Credit

Senior Software Engineer - Credit

Marqeta, Inc. • Vancouver, Toronto, Metro Vancouver Regional District, Ontario, Canada
Télétravail
Temps plein
As a Senior Software Engineer on Marqeta’s Credit team you will play a pivotal role in shaping how credit is accessed, evaluated, and delivered at scale, directly impacting the financial lives of m...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
Software Development Engineer 1

Software Development Engineer 1

Actalent • Vancouver, BC, Canada
Temps plein
HIRING ASAP! If interested in more information / direct feedback, please reach out to me directly at .FEFF;Below are some details about the position : . PAY : $40-43 an hour depending on experi...Voir plus
Dernière mise à jour : il y a 12 jours • Offre sponsorisée
Technical Product Engineer

Technical Product Engineer

Progressive Automations • Richmond, BC, Canada
Temps plein
Progressive Automations is one of the top manufacturers and distributors of linear actuators and home / office automation. We have over a decade of experience in the industry and are quickly growing.O...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Design Engineer (Global Remote)

Design Engineer (Global Remote)

vidIQ • Vancouver, BC, CA
Télétravail
Temps plein
Product Managers did in 2019? AI can do it now.Software Engineers did? AI can do that too.The future isn’t about choosing one — it’s about becoming them all. At vidIQ, we call them Design Engineers....Voir plus
Dernière mise à jour : il y a 21 jours
Delivery Manager (Agile Software Projects) - HaknaSoft

Delivery Manager (Agile Software Projects) - HaknaSoft

HaknaSoft • delta, bc, ca
Temps plein
HaknaSoft is looking for an experienced.PI Planning, sprints, agile ceremonies) and will be responsible for managing and coordinating a. Your role is to deliver value to the end users by driving fea...Voir plus
Dernière mise à jour : il y a 1 heure • Offre sponsorisée • Nouvelle offre