Talent.com
PowerToFly
Lead Inference Platform Support Engineer - AI IPowerToFly • Winnipeg, Canada
No longer accepting applications
Lead Inference Platform Support Engineer - AI I

Lead Inference Platform Support Engineer - AI I

PowerToFly • Winnipeg, Canada
5 days ago
Job type
  • Full-time
Job description
About the Role As a Lead Inference Platform Engineer, you will:

Optimize LLMs and ML models for high-performance inference using techniques such as quantization, pruning, distillation, and hardware specific tuning

Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic hours, especially during business hours

Implement routing and failover strategies for OpenAI/Anthropic/Vertex AI traffic

Integrate models into production grade APIs supporting TR products and enterprise workflows.

Develop highly optimized environment and eliminate performance bottlenecks to reduce latency

Collaborate with Platform Engineering teams (Landing Zones, Network, Storage, Compute, AI) to ensure inference workloads align with TR’s cloud native patterns (AWS, Azure, GCP, OCI)

Build and optimize containerized inference pipelines using Kubernetes for large‑scale distributed workloads

Ensure compliance with TR’s AI standards for deployment, monitoring, governance, and drift detection

Profile inference performance, identify GPU/CPU bottlenecks, and optimize compute utilization across heterogeneous hardware

Implement observability and health monitoring for inference pipelines, ensuring reliability of enterprise AI services.

Collaborate with platform teams to enhance capacity forecasting for AI workloads

Work with Product, Data Science, Architecture, and Enterprise AI teams to onboard new research models into production

Collaborate closely with AI engineers to invent new quantization techniques, improve numerical precision, and explore non‑standard architectures.

Partner with Cloud Engineers (Azure, AWS, GCP) to develop guardrails and automation that support inference workload.

Support the scale out of AI infrastructure during critical releases and global product rollouts.

About You You are a potential fit for the role, Lead Inference Platform Engineer, if your background includes:

Required Skills & Qualifications

Strong understanding of ML/LLM fundamentals and inference optimization techniques.

Hands‑on experience with GPU programming (CUDA preferred), inference runtimes (TensorRT, ONNX Runtime), and deep learning frameworks (PyTorch/TensorFlow)

Proficiency in Python and at least one systems language (C++ strongly preferred for performance critical inference paths)

Experience deploying AI workloads to AWS/GCP/Azure and Kubernetes

Familiarity with vector search systems (OpenSearch vectors) and retrieval augmented generation pipelines

Knowledge of distributed systems, microservices, CI/CD, and cloud native architecture

Experience with AI networks, such as CNNs, transformers, and diffusion model architectures, and their performance characteristics

Understanding of GPU, Multithreading and/or other accelerators with vectorized instructions

Specialized experience in one or more of the following machine learning/deep learning domains: Model compression, hardware aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, machine learning compilers, high performance computing, performance optimizations, numerics and SW/HW co-design.

Preferred Qualifications

3+ years production experience deploying ML/LLM models at scale

Experience in managing GPU fleets or inference clusters across public cloud and container platforms.

Experience supporting enterprise grade AI workloads in regulated or compliance heavy environments.

What’s in it For You?

Hybrid Work Model: We’ve adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office-based roles while delivering a seamless experience that is digitally and physically connected.

Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, empowering employees to achieve a better work-life balance.

Career Development and Growth: By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrow’s challenges and deliver real-world solutions. Our Grow My Way programming and skills-first approach ensures you have the tools and knowledge to grow, lead, and thrive in an AI-enabled future.

Industry Competitive Benefits: We offer comprehensive benefit plans to include flexible vacation, two company-wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing.

Culture: Globally recognized, award-winning reputation for inclusion and belonging, flexibility, work-life balance, and more. We live by our values: Obsess over our Customers, Compete to Win, Challenge (Y)our Thinking, Act Fast / Learn Fast, and Stronger Together.

Social Impact: Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro-bono consulting projects and Environmental, Social, and Governance (ESG) initiatives.

Making a Real-World Impact: We are one of the few companies globally that helps its customers pursue justice, truth, and transparency. Together, with the professionals and institutions we serve, we help uphold the rule of law, turn the wheels of commerce, catch bad actors, report the facts, and provide trusted, unbiased information to people all over the world.

Thomson Reuters complies with local laws that require upfront disclosure of the expected pay range for a position. The base compensation range varies across locations. For Ontario, Canada, the base compensation range for this role is $140,000 CAD - $175,000 CAD. Base pay is positioned within the range based on several factors including an individual’s knowledge, skills and experience with consideration given to internal equity. Base pay is one part of a comprehensive Total Reward program which also includes flexible and supportive benefits and other wellbeing programs. This role may also be eligible for an Annual Bonus based on a combination of enterprise and individual performance.

As a global business, we rely on the unique backgrounds, perspectives, and experiences of all employees to deliver on our business goals. To ensure we can do that, we seek talented, qualified employees in all our operations around the world regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under applicable law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug-free workplace.

We also make reasonable accommodations for qualified individuals with disabilities and for sincerely held religious beliefs in accordance with applicable law. More information on requesting an accommodation here.

#J-18808-Ljbffr
Create a job alert for this search

Lead Inference Platform Support Engineer - AI I • Winnipeg, Canada

Similar jobs

Senior/ Lead - AI Engineer - FICO

FICOwinnipeg, mb, ca
Full-time

As a Senior Engineer on our Applied AI team, you will be at the forefront of building AI-powered software that transforms how our platform operates.You will design, build, and maintain production-g... Show more

 • Promoted

Senior Platform & Extensions Engineer (APIs & AI)

Jane.appWinnipeg, MB, CA
Full-time

A leading software company is seeking a Senior Developer to drive technical direction for integrations and extensibility.This role involves designing APIs, mentoring developers, and shaping platfor... Show more

 • Promoted

Senior Lead, AI & Automation Platform Ops

Postmedia Network Inc.Winnipeg, MB, CA
Full-time +1

A Canadian media company is seeking a Team Lead for AI & Automation Operations.This role involves designing and operating automation platforms like n8n and Power Automate, overseeing AI integration... Show more

 • Promoted

Impact-Driven Remote AI Solution Engineer

EstreetsecurityWinnipeg, MB, CA
Remote
Full-time

Make an immediate impact as a remote AI Solution Engineer, focusing on developing AI solutions that enhance organizational efficiency.Collaborate effectively to design tools that directly empower b... Show more

 • Promoted

Senior AI Platform Engineer

SamsaraWinnipeg, Manitoba, Canada
Full-time

Who we are Samsara (NYSE: IOT) is the pioneer of the Connected Operations™ Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (Io... Show more

 • Promoted

AI Technical Lead for Application Development

TYLinWinnipeg, MB, CA
Full-time

Become an AI Technical Lead focused on advancing application development within a remote workspace.Provide expertise in architecting and implementing AI systems.This role emphasizes technical leade... Show more

 • Promoted

Director of AI

People In AIwinnipeg, mb, ca
Full-time

Director, AI / ML (Applied AI & Agentic Systems).A scaled, product-led technology company operating at the intersection of data, AI, and vertical SaaS—focused on transforming how complex, real-worl... Show more

 • Promoted

Senior/ Lead - AI Engineer

FICOwinnipeg, mb, ca
Full-time

As a Senior Engineer on our Applied AI team, you will be at the forefront of building AI-powered software that transforms how our platform operates.You will design, build, and maintain production-g... Show more

 • Promoted

Remote Lead Architect - Cloud, AI & Data Platform

ResonaiteWinnipeg, MB, CA
Remote
Full-time

A client in the Fintech space is looking for a Lead Architect to evolve their core technology architecture across cloud infrastructure, distributed systems, data platforms, and AI-enabled systems.T... Show more

 • Promoted

Founding Engineer, AI

GuruLinkWinnipeg, MB, CA
Full-time

Location: REMOTE / Toronto, Ontario.This job allows you to work remotely.PathPilot is on a mission to help millions of people unlock their potential by navigating and optimizing their careers throu... Show more

 • Promoted

AI Data Platform Lead

AgiloftWinnipeg, Manitoba, Canada
Full-time

As the most trusted global leader in data-first contract lifecycle management (CLM) software, Agiloft helps organizations manage the end-to-end process of proposing, negotiating, signing, and lever... Show more

 • Promoted

Lead Engineer for AI Technology Startup

RedBranch Executive Search & RecruitmentWinnipeg, Manitoba, Canada
Full-time

Be at the forefront of AI innovation as a Lead Full Stack Engineer, driving the technical architecture and product vision within a dynamic startup environment.In this influential position, you will... Show more

 • Promoted

Cloud AI Solutions Engineer Leader

TDWinnipeg, Manitoba, Canada
Full-time

Elevate enterprise AI solutions as a Senior Engineer focused on generative technologies.Implement advanced machine learning capabilities to deliver impactful solutions across various platforms.You ... Show more

 • Promoted

AI Platform Engineering Lead — Hybrid, Hands-on & Mentorship

LotlinxWinnipeg, MB, CA
Full-time

A leading automotive technology firm in Oakville seeks a dedicated engineering manager to lead a team of engineers focusing on AI/ML initiatives.This hybrid role requires a blend of leadership and ... Show more

 • Promoted

Lead Software Engineer in AI Development

AlphaSenseWinnipeg, Manitoba, Canada
Full-time

Take the lead as a Software Engineer focusing on AI at a premier Market Intelligence firm.Drive innovation and architecture while fostering a collaborative and inclusive workplace.This role seeks c... Show more

 • Promoted

Lead Engineer For Ai Technology Startup

AI Technology StartupWinnipeg, Canada
Full-time

Lead Full Stack Engineer role focused on architectural innovation and system scalability for mission-critical HR platform applications, mentoring engineers and shaping product strategy. Show more

 • Promoted

Director of AI - People In AI

People In AIwinnipeg, mb, ca
Full-time

Director, AI / ML (Applied AI & Agentic Systems).A scaled, product-led technology company operating at the intersection of data, AI, and vertical SaaS—focused on transforming how complex, real-worl... Show more

 • Promoted

Ai Solutions Engineer - Azure Ai & Automation Lead

AidenSalesWinnipeg, Canada
Full-time

A health tech company in Mississauga is seeking an Ai Solution Engineer to build and support AI-powered workflows.The ideal candidate has a degree in computer science and experience with Microsoft ... Show more

 • Promoted

ML Infra Engineer — Scalable AI Platforms

StripeWinnipeg, MB, CA
Full-time

A prominent financial technology firm in Toronto is seeking a Software Engineer to work on machine learning infrastructure.You will design scalable systems and collaborate closely with ML and produ... Show more

 • Promoted

AI/ML Lead (Senior Machine Learning Engineer – Full-time Leadership Role)

Aurelian Venture AIWinnipeg, MB, CA
Full-time

AI/ML Lead (Senior Machine Learning Engineer – Full-time Leadership Role).Full-Time, Contract, Hands‑on Technical Leadership.CAD $180,000 – $250,000 base + significant equity + performance bonuses ... Show more