Talent.com
PowerToFly
Senior Inference Engineer - AIPowerToFly • Toronto, Canada
No longer accepting applications
Senior Inference Engineer - AI

Senior Inference Engineer - AI

PowerToFly • Toronto, Canada
11 days ago
Salary
CA$110,000.00 yearly
Job type
  • Full-time
Job description
New Position: This position is open due to an existing vacancy to support our evolving business needs.

Thomson Reuters is seeking a Senior Inference Engineer, AI. This person will collaborate with platform teams to enhance capacity forecasting for AI workloads and work with Product, Data Science, Architecture, and Enterprise AI teams to onboard new research models into production.

About the Role As a Senior Inference Engineer, AI, responsibilities include/you will:

Within Platform Engineering and Enterprise AI Services, an AI Inference Engineer is responsible for productionizing, optimizing, and scaling AI and LLM workloads that power TR’s AI driven products.

This role ensures that our trained models—from classical ML to generative AI—run efficiently across TR’s multi cloud footprint (AWS, Azure, GCP, OCI), meet strict enterprise reliability requirements, and integrate seamlessly with our data backbone (Snowflake, OpenSearch vector search, API managed model routing).

The successful candidate will help build the next generation of TR’s AI infrastructure, working alongside cloud engineering, data engineering, product teams, and AI Services.

Optimize LLMs and ML models for high-performance inference using techniques such as quantization, pruning, distillation, and hardware specific tuning

Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic hours, especially during business hours

Implement routing and failover strategies for OpenAI/Anthropic/Vertex AI traffic

Integrate models into production grade APIs supporting TR products and enterprise workflows.

Develop highly optimized environment and eliminate performance bottlenecks to reduce latency.

Collaborate with Platform Engineering teams (Landing Zones, Network, Storage, Compute, AI) to ensure inference workloads align with TR’s cloud native patterns (AWS, Azure, GCP, OCI)

Build and optimize containerized inference pipelines using Kubernetes for large-scale distributed workloads

Ensure compliance with TR’s AI standards for deployment, monitoring, governance, and drift detection

Profile inference performance, identify GPU/CPU bottlenecks, and optimize compute utilization across heterogeneous hardware

Implement observability and health monitoring for inference pipelines, ensuring reliability of enterprise AI services

Collaborates closely with AI engineers to invent new quantization techniques, improve numerical precision, and explore non‑standard architectures, and support the scale out of AI infrastructure during critical releases and global product rollouts

Partner with Cloud Engineers (Azure, AWS, GCP) to develop guardrails and automation that support inference workloads

About You You are a potential fit for the role, Senior Inference Engineer, AI, if your background includes:

5+ years of relevant experience

Strong understanding of ML/LLM fundamentals and inference optimization techniques.

Hands‑on experience with GPU programming (CUDA preferred), inference runtimes (TensorRT, ONNX Runtime), and deep learning frameworks (PyTorch/TensorFlow)

Proficiency in Python and at least one systems language (C++ strongly preferred for performance critical inference paths)

Experience deploying AI workloads to AWS/GCP/Azure and Kubernetes

Familiarity with vector search systems (OpenSearch vectors) and retrieval augmented generation pipelines

Knowledge of distributed systems, microservices, CI/CD, and cloud native architecture

#LI-MW1

What’s in it For You?

Hybrid Work Model: We’ve adopted a flexible hybrid working environment (2-3 days a week in the office depending on the role) for our office‑based roles while delivering a seamless experience that is digitally and physically connected.

Flexibility & Work-Life Balance: Flex My Way is a set of supportive workplace policies designed to help manage personal and professional responsibilities, whether caring for family, giving back to the community, or finding time to refresh and reset. This builds upon our flexible work arrangements, including work from anywhere for up to 8 weeks per year, empowering employees to achieve a better work‑life balance.

Career Development and Growth: By fostering a culture of continuous learning and skill development, we prepare our talent to tackle tomorrow’s challenges and deliver real‑world solutions. Our Grow My Way programming and skills‑first approach ensures you have the tools and knowledge to grow, lead, and thrive in an AI‑enabled future.

Industry Competitive Benefits: We offer comprehensive benefit plans to include flexible vacation, two company‑wide Mental Health Days off, access to the Headspace app, retirement savings, tuition reimbursement, employee incentive programs, and resources for mental, physical, and financial wellbeing.

Culture: Globally recognized, award‑winning reputation for inclusion and belonging, flexibility, work‑life balance, and more. We live by our values: Obsess over our Customers, Compete to Win, Challenge (Y)our Thinking, Act Fast / Learn Fast, and Stronger Together.

Social Impact: Make an impact in your community with our Social Impact Institute. We offer employees two paid volunteer days off annually and opportunities to get involved with pro‑bono consulting projects and Environmental, Social, and Governance (ESG) initiatives.

Making a Real-World Impact:We are one of the few companies globally that helps its customers pursue justice, truth, and transparency. Together, with the professionals and institutions we serve, we help uphold the rule of law, turn the wheels of commerce, catch bad actors, report the facts, and provide trusted, unbiased information to people all over the world.

Our use of AI within the recruitment process Thomson Reuters utilizes Artificial Intelligence (AI) to support parts of our global recruitment process. Unless you opt‑out, our AI system will assess the information provided by you and compare it to the requirements listed for the role, and present the result to our recruitment personnel for further review. The AI system acts as a supporting tool, but there is always a human making the decision if you will be considered for the role.

In the United States, Thomson Reuters offers a comprehensive benefits package to our employees. Our benefit package includes market competitive health, dental, vision, disability, and life insurance programs, as well as a competitive 401k plan with company match. In addition, Thomson Reuters offers market leading work life benefits with competitive vacation, sick and safe paid time off, paid holidays (including two company mental health days off), parental leave, sabbatical leave. These benefits meet or exceeds the requirements of paid time off in accordance with any applicable state or municipal laws. Finally, Thomson Reuters offers the following additional benefits: optional hospital, accident and sickness insurance paid 100% by the employee; optional life and AD&D insurance paid 100% by the employee; Flexible Spending and Health Savings Accounts; fitness reimbursement; access to Employee Assistance Program; Group Legal Identity Theft Protection benefit paid 100% by employee; access to 529 Plan; commuter benefits; Adoption & Surrogacy Assistance; Tuition Reimbursement; and access to Employee Stock Purchase Plan.

Thomson Reuters complies with local laws that require upfront disclosure of the expected pay range for a position. The base compensation range varies across locations. For any eligible US locations, unless otherwise noted, the base compensation range for this role is $110,000 USD - $204,200 USD. For Ontario, Canada, the base compensation range for this role is $100,000 CAD - $145,000 CAD. Base pay is positioned within the range based on several factors including an individual’s knowledge, skills and experience with consideration given to internal equity. Base pay is one part of a comprehensive Total Reward program which also includes flexible and supportive benefits and other wellbeing programs. This role may also be eligible for an Annual Bonus based on a combination of enterprise and individual performance.

About Us Thomson Reuters informs the way forward by bringing together the trusted content and technology that people and organizations need to make the right decisions. We serve professionals across legal, tax, accounting, compliance, government, and media. Our products combine highly specialized software and insights to empower professionals with the data, intelligence, and solutions needed to make informed decisions, and to help institutions in their pursuit of justice, truth, and transparency. Reuters, part of Thomson Reuters, is a world leading provider of trusted journalism and news.

We are powered by the talents of 26,000 employees across more than 70 countries, where everyone has a chance to contribute and grow professionally in flexible work environments. At a time when objectivity, accuracy, fairness, and transparency are under attack, we consider it our duty to pursue them. Sound exciting? Join us and help shape the industries that move society forward.

As a global business, we rely on the unique backgrounds, perspectives, and experiences of all employees to deliver on our business goals. To ensure we can do that, we seek talented, qualified employees in all our operations around the world regardless of race, color, sex/gender, including pregnancy, gender identity and expression, national origin, religion, sexual orientation, disability, age, marital status, citizen status, veteran status, or any other protected classification under applicable law. Thomson Reuters is proud to be an Equal Employment Opportunity Employer providing a drug‑free workplace.

Thomson Reuters makes reasonable accommodations for applicants with disabilities, including veterans with disabilities, and for sincerely held religious beliefs in accordance with applicable law. If you reside in the United States and require an accommodation in the recruiting process, you may contact our Human Resources Department atHR.Leave-Expert@thomsonreuters.com. Disability accommodations in the recruiting process may include things like a sign language interpreter, making interview rooms accessible, providing assistive technology, or other relevant accommodations. Please note this email is not intended for general recruitment questions and we will promptly respond to inquiries regarding accommodations. More information on requesting an accommodation here.

Learn more on how to protect yourself from fraudulent job postings here.

More information about Thomson Reuters can be found on thomsonreuters.com

#J-18808-Ljbffr
Create a job alert for this search

Senior Inference Engineer - AI • Toronto, Canada

Similar jobs

Senior AI Engineer

MasterCardToronto, ON, CA
Full-time

Senior AI Engineer page is loaded## Senior AI Engineerlocations: Toronto, Canadatime type: Full timeposted on: Posted Yesterdayjob requisition id: R-277428**Our Purpose***Mastercard powers ... Show more

 • Promoted

Senior AI Engineer

TD BankToronto, ON, CA
Full-time

Nous et certains tiers sélectionnés utilisons des technologies et des outils de suivi (témoins) pour recueillir des renseignements sur votre utilisation de ce site Web.Les témoins essentiels soutie... Show more

 • Promoted

Senior Engineer Shaping AI Analytics for Strategic Business Intelligence

OpendoorToronto, ON, CA
Full-time

Reimagine business analytics through AI innovation as a Senior Analytics Engineer.Drive proactive analytics, turning insights into strategic actions that shape decision-making processes.In this rol... Show more

 • Promoted

Expert Senior AI Engineer - Remote Role

NTT DATA, Inc.Toronto, ON, CA
Remote
Full-time

Make a difference in AI technology with NTT DATA as a Remote Senior AI Engineer from Toronto, Ontario.Focus on designing AI models and enhancing existing architectures for impactful solutions.As a ... Show more

 • Promoted

Senior AI Solutions Engineer

TYLinToronto, ON, CA
Full-time

With over 3,000 employees throughout the Americas, Asia, and Europe, the firm provides support on projects of varying size and complexity.Together, we enhance conventional designs with smarter, mor... Show more

 • Promoted

Senior AI Engineer

Updata PartnersToronto, ON, CA
Full-time

Join the Engineering Team, where you’ll help shape and build our agentic future.You’ll work closely with our Product and Engineering Teams, and our Chief Product & Technology Officer (CPTO), to tur... Show more

 • Promoted

Senior GenAI Engineer for PathWise & Actuarial AI

Aon CorporationToronto, ON, CA
Full-time

A leading insurance solutions firm is seeking a Senior AI Developer to innovate using Generative AI for insurance challenges.In this role, you will design GenAI solutions, collaborate with actuarie... Show more

 • Promoted

Senior Inference Engineer - AI

RefinitivToronto, ON, CA
Full-time

Our Privacy Statement & Cookie Policy**New Position: This position is open due to an existing vacancy to support our evolving business needs.Thomson Reuters is seeking a Senior Inference Engineer, ... Show more

 • Promoted

Senior AI Platform Engineer

Diligente Technologiestoronto, on, ca
Full-time

Title: Senior Platform Engineer.We are seeking a highly skilled AI Platform Engineer to join our AI Enablement team.In this role, you will be responsible for building, maintaining, and scaling our ... Show more

 • Promoted

Senior AI Engineer

Guidewire SoftwareToronto, ON, CA
Full-time

Orchestration Architecture: Define and drive the architecture for complex agentic flows, ensuring agents remain reliable, steerable, and seamlessly integrated into the Guidewire ecosystem.Domain‑Sp... Show more

 • Promoted

Senior AI Engineer, AI Platform

Menlo VenturesToronto, ON, CA
Full-time

Affinity stitches together billions of data points from massive datasets to create a powerful, accurate representation of the world's professional relationship graph.Based on this data, we offer ou... Show more

 • Promoted

Senior Generative AI Engineer Role

Inizio Partners CorpToronto, ON, CA
Full-time

Elevate your career as a Senior Generative AI Engineer, focusing on building scalable applications and leveraging cloud services.Design and implement robust solutions that drive business value acro... Show more

 • Promoted

Senior AI Engineer

ScotiabankToronto, ON, CA
Full-time

The Senior AI Engineer is a senior technical individual contributor responsible for designing, building, and operationalizing enterprise‐grade AI solutions in a highly regulated banking environment... Show more

 • Promoted

Innovative Senior AI Engineer Opportunity

Robots and PencilsToronto, ON, CA
Full-time

Join Robots & Pencils as a Senior AI Engineer and drive innovation in scalable digital products.This hands-on role tackles complex integration challenges in a vibrant production environment.You’ll ... Show more

 • Promoted

Senior Engineer - Generative AI

Rubicon PathToronto, ON, CA
Full-time

About the job Senior Engineer - Generative AI.Our client in the financial service space in Toronto is looking for a Senior Engineer with experience in Generative AI.Bachelors degree in Computer Sci... Show more

 • Promoted

Senior AI Engineer

RootlyToronto, ON, CA
Full-time

At Rootly, we are on a mission to be the go-to way companies respond when things go wrong, helping every organization be more reliable.We build an industry‑leading incident management platform that... Show more

 • Promoted

Senior AI Engineer

LeagueToronto, ON, CA
Full-time

Founded in 2014, League is the leading healthcare consumer experience (CX) platform, powered by artificial intelligence (AI), reaching more than 63 million people around the world and delivering th... Show more

 • Promoted

Senior Generative AI Engineer for Actuarial Solutions

AonToronto, ON, CA
Full-time

Transform the actuarial landscape using Generative AI as a Senior Engineer.Implement advanced AI technologies to improve model documentation and support decision-making for global insurers.In this ... Show more

 • Promoted

Expert Senior AI Engineer for Generative AI and Transformational Strategies

Celestica Inc.Toronto, ON, CA
Full-time

Shape the future of AI as a Senior Engineer focused on Generative AI solutions.Design and deliver innovative architectures that leverage advanced analytics for impactful decision-making and operati... Show more

 • Promoted

Senior AI Engineer Hybrid Role

ManulifeToronto, ON, CA
Full-time

Join the innovative AI Team at Manulife/John Hancock as a Senior AI Engineer, focusing on developing state-of-the-art AI applications in a hybrid setting.Collaborate with experts to optimize market... Show more