Talent.com
I Machines, Inc.
AI/ML Model Compression & Quantization Engineer - I Machines, Inc.I Machines, Inc. • oshawa, on, ca
AI/ML Model Compression & Quantization Engineer - I Machines, Inc.

AI/ML Model Compression & Quantization Engineer - I Machines, Inc.

I Machines, Inc. • oshawa, on, ca
17 hours ago
Job type
  • Full-time
Job description

About Our Company

We’re a fast-paced, fabless semiconductor startup redefining the boundaries of AI through cutting-edge, scalable AI-infused multipurpose compute architecture. Our mission is to deliver scalable, efficient, and intelligent silicon solutions for the next generation of edge AI, robotics, autonomous systems, and mobile devices. Our leadership team brings together decades of experience in semiconductor innovation, spanning chip architecture, system design, and global business operations. The team includes pioneers behind several generations of groundbreaking compute architectures, experts in software-hardware co-design, SoC and AI development with hundreds of patents in our portfolio as well as leaders of multi-billion-dollar business units at top-tier technology companies.


Position Overview

This is a great opportunity to join a highly-skilled AI/ML Software team working at the intersection of HW/SW co-design. In this role, you will be responsible for designing and executing end-to-end model compression pipelines, including sensitivity analysis, quantization, pruning, and hybrid optimization techniques across large-scale transformer architectures.


Key Responsibilities and Duties

Build and own the end-to-end compression pipeline

  • Baseline benchmarking and instrumentation
  • Sensitivity analysis
  • Transformation mapping (quantization, sparsity, low-rank)

Implement layerwise sensitivity scoring frameworks

Design and apply quantization strategies

  • PTQ, QAT, mixed-precision quantization
  • INT8, INT4, FP8, FP4 exploration

Develop mixed-precision policies

  • Per-layer/tensor precision assignment
  • Dynamic range calibration and scaling strategies

Implement and evaluate pruning techniques

Apply hybrid compression methods

  • Sparse + quantized pipelines
  • Low-rank decomposition

Run post-transformation recovery

  • QAT, LoRA-based recovery, distillation

Benchmark across:

  • Accuracy degradation
  • Latency / throughput
  • Memory footprint

Optimize for iMachine Architecture


Qualifications and Skills

Successful candidates should possess the following qualifications and skills:

Required Qualifications (You must possess these qualifications to be considered for the position)

Bachelor of Science Degree in Electrical Engineering, Computer Science, Computer Engineering, or related field

1+ year of experience with PyTorch / JAX / TensorFlow

Understanding of:

  • Transformer architectures (LLM, VLM, VLA)
  • Numerical precision and quantization theory

Hands-on experience with:

  • TensorRT, ONNX Runtime, or similar inference stacks

Familiarity with:

  • Sparse representations (CSR, COO, RLC )
  • Low-rank approximation methods (SVD, factorization)

Ability to analyze:

  • Activation distributions
  • Gradient statistics
  • Numerical stability issue

Preferred Qualifications

MS or PhD in Electrical Engineering, Computer Engineering, Computer Science, or related field

Experience with:

  • FP8 / FP4 pipelines
  • Hardware-aware optimization

Prior work on:

  • Multimodal models (vision-language, robotics policies)

Knowledge of:

  • Compiler stacks (TVM, Triton, XLA)

Expectations

  • Deliver production-ready compressed models with minimal accuracy loss
  • Achieve quantifiable performance gains (latency, memory, throughput)
  • Provide clear layerwise transformation justifications
  • Build reusable tooling and automation pipelines
  • Iterate quickly using data-driven decision making


Why Join Us

  • Get in early at a breakthrough deep-tech startup reshaping AI compute
  • Work closely with industry innovators and experienced leaders where your work will have a direct impact on the success of the company
  • Be part of a mission-driven team building foundational technology for the future
  • We balance sharp execution with continuous innovation to push the boundaries
  • Competitive compensation, equity, and growth opportunities

Benefits and Perks

At I Machines, Inc., we offer competitive salaries and a comprehensive benefits package, including:

  • Health, dental, and vision insurance
  • Retirement savings plans
  • Paid time off and holidays
  • Professional development opportunities
  • Flexible Schedule


Equal Opportunity Employer

I Machines, Inc., is an equal opportunity employer and does not discriminate based on race, color, religion, gender, national origin, age, disability, or any other legally protected status. All qualified applicants will be considered for employment.

Create a job alert for this search

AI/ML Model Compression & Quantization Engineer - I Machines, Inc. • oshawa, on, ca

Similar jobs

Senior Machine Learning Engineer - People In AI

People In AIoshawa, on, ca
Full-time

Senior Machine Learning Engineer, Applied AI Systems.A rapidly growing SaaS platform transforming a large, underserved industry by bringing modern, cloud-based tooling to tens of thousands of busin... Show more

 • Promoted • New!

Physics Private Tutoring Jobs Beaverton

SuperprofBeaverton, Canada
CA$20.00 hourly
Full-time +1

Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi... Show more

 • Promoted

Senior Machine Learning Engineer (Fraud ML)

AffirmOshawa, Durham Region, CA
Full-time

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.On the ML Fraud team, you... Show more

 • Promoted

Machine Learning Software Engineer At Affirm

AffirmOshawa, Canada
Full-time

Elevate Affirm’s ML Feature Platform as a Machine Learning Software Engineer.Collaborate with cross-functional teams to enhance data serving and build scalable ML solutions remotely.Affirm is seeki... Show more

 • Promoted • New!

Trigonometry Private Tutoring Jobs Beaverton

SuperprofBeaverton, Canada
CA$20.00 hourly
Full-time +1

Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi... Show more

 • Promoted

Generative AI Developer – Retail/CPG Domain - Comprehensive Resources Inc

Comprehensive Resources Incoshawa, on, ca
Full-time

Generative AI Developer (Retail/CPG).Seeking a GenAI Developer with expertise in LLMs, RAG, and prompt engineering.Build scalable AI solutions using Python, LangChain, HuggingFace, and vector DBs l... Show more

 • Promoted

AI/ML Model Compression & Quantization Engineer

I Machines, Inc.oshawa, on, ca
Full-time

We’re a fast-paced, fabless semiconductor startup redefining the boundaries of AI through cutting-edge, scalable AI-infused multipurpose compute architecture.Our mission is to deliver scalable, eff... Show more

 • Promoted • New!

Survey Taker: Earn up to $25 per survey (Remote)

Earn HausBrock, ON, CA
Remote
Full-time +1

Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se... Show more

 • Promoted

Staff ML Engineer - Contract - Signify Technology

Signify Technologyoshawa, on, ca
Full-time

Signify has partnered with a key client who is currently hiring a Staff ML Engineer to drive the productionisation and scaling of ML systems.Remote | 3–6 months | ASAP Start.The team is based on th... Show more

 • Promoted • New!

Senior Machine Learning Engineer

People In AIoshawa, on, ca
Full-time

Senior Machine Learning Engineer, Applied AI Systems.A rapidly growing SaaS platform transforming a large, underserved industry by bringing modern, cloud-based tooling to tens of thousands of busin... Show more

 • Promoted • New!

Senior ML Research Engineer – AI Gaming Tech Startup (Hybrid) - oshawa

LocusXoshawa, on, ca
Full-time

LocusX is reimagining the game development pipeline by embedding intelligence at its core.AI-native platform for game bug fixing, connecting testers, developers, and.As a Senior ML Research Enginee... Show more

 • Promoted • New!

AI/ML Model Compression & Quantization Engineer - oshawa

I Machines, Inc.oshawa, on, ca
Full-time

We’re a fast-paced, fabless semiconductor startup redefining the boundaries of AI through cutting-edge, scalable AI-infused multipurpose compute architecture.Our mission is to deliver scalable, eff... Show more

 • Promoted • New!

Geometry Private Tutoring Jobs Beaverton

SuperprofBeaverton, Canada
CA$20.00 hourly
Full-time +1

Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi... Show more

 • Promoted

Customer Service Agent - 50k-60k/Year - Remote

Spade RecruitingBrock, Ontario
Remote
Full-time
Quick Apply

We’re looking for enthusiastic, self-driven, individuals to assist existing and prospective clients within our organization.This position will work with multiple clients throughout the day pr... Show more

 • Promoted

Senior Machine Learning Engineer - oshawa

People In AIoshawa, on, ca
Full-time

Senior Machine Learning Engineer, Applied AI Systems.A rapidly growing SaaS platform transforming a large, underserved industry by bringing modern, cloud-based tooling to tens of thousands of busin... Show more

 • Promoted • New!

Head of Machine Learning - People In AI

People In AIoshawa, on, ca
Full-time

Head of AI, GenAI & Agentic Systems.A fast-growing, remote-first B2B SaaS company transforming workflows in the construction technology space.A scaling product-led SaaS business is hiring a.AI stra... Show more

 • Promoted • New!

Senior ML Research Engineer – AI Gaming Tech Startup (Hybrid) - LocusX

LocusXoshawa, on, ca
Full-time

LocusX is reimagining the game development pipeline by embedding intelligence at its core.AI-native platform for game bug fixing, connecting testers, developers, and.As a Senior ML Research Enginee... Show more

 • Promoted • New!

Senior AI/ML Solutions Architect – MySQL - oshawa

Yochanaoshawa, on, ca
Full-time

Role: Senior AI/ML Solutions Architect – MySQL.Years of relevant experience in Big data, Databricks engineer, Sr.As a Data Architect (Custody Domain), you will design and lead the implementation of... Show more