Talent.com
Themesoft Inc.
Reliability Engineer with AI Automation FocusThemesoft Inc. • Toronto, Canada
No longer accepting applications
Reliability Engineer with AI Automation Focus

Reliability Engineer with AI Automation Focus

Themesoft Inc. • Toronto, Canada
13 days ago
Job type
  • Full-time
Job description
Shape the future of operations as a Reliability Engineer specializing in AI. Utilize tools like Dynatrace and Moogsoft to create intelligent automation solutions that enhance system resilience.

In this role, you will merge Site Reliability Engineering with AI Ops practices to optimize production environments. Focus on implementing monitoring solutions, designing automated operational workflows, and fostering self-healing systems. Your expertise will play a critical role in reducing alerts, improving performance, and driving system reliability.

Key Responsibilities: • Optimize AI-driven observability and monitoring tools • Engineer automated workflows for remediation and analysis • Configure PagerDuty for efficient incident management • Apply SRE frameworks to enhance system stability • Develop automation logic through Python scripting

Requirements: • Background in cloud infrastructure and distributed systems • Proficient in Dynatrace, Moogsoft, and Ansible • Experience with CI/CD integrations via Git • Strong understanding of performance metrics and reliability practices • Passion for continuous improvement in operational processes

Lead the transformation of operations with AI-based solutions that promote efficiency and reliability. #J-18808-Ljbffr
Create a job alert for this search

Reliability Engineer with AI Automation Focus • Toronto, Canada

Similar jobs

AI-Focused Senior Site Reliability Engineer

Tech InsightsToronto, ON, CA
Full-time

Advance your career at TechInsights as a Senior Site Reliability Engineer with a focus on AI operations.Shape AI infrastructure reliability and lead innovative solutions remotely in Canada.In this ... Show more

 • Promoted

RPA & Agentic AI Engineer - Reliable Automations

VancityToronto, ON, CA
Full-time

A leading financial cooperative is seeking an RPA Automation Engineer to develop and deploy automation solutions using UiPath.This role blends engineering and collaboration with teams, ensuring rob... Show more

 • Promoted

Lead Site Reliability Engineer Innovating AI Tools and Standards

Coalition IncToronto, ON, CA
Full-time

Shape the future of AI in site reliability engineering as a Staff SRE.Drive impactful standards, tooling, and integrations while ensuring reliable development practices in a remote-first culture.As... Show more

 • Promoted

Innovative AI Automation Engineer for Remote Product Development

Valsoft CorpToronto, ON, CA
Remote
Full-time

Transform business operations as an AI Automation Engineer.Design and deploy AI agents that drive efficiency and generate revenue in a fully remote role.This position focuses on developing AI solut... Show more

 • Promoted

AI Systems Reliability Engineer

Tenstorrent Inc.Toronto, ON, CA
Full-time

Be a part of pioneering AI technology as an AI Systems Reliability Engineer.Ensure operational health and system reliability across varied environments in a hybrid working scenario.In this role, yo... Show more

 • Promoted

Lead Platform Reliability Engineer, Global AI Platform & Solutions

ManulifeToronto, ON, CA
Full-time

The Lead Platform Reliability Engineer (PRE) ensures the stability, performance, and scalability of the shared platform that supports internal AI solution development.It combines software engineeri... Show more

 • Promoted

Production Kubernetes Engineer—Reliability & Automation

PaymentusRichmond Hill, York Region, CA
Full-time

A leading payments technology company is seeking an Operations Engineer in Richmond Hill, Canada, to ensure production uptime.The role requires managing critical operations tasks and optimizing Kub... Show more

 • Promoted

AI & Automation Leader for Enterprise Productivity

Choice PropertiesToronto, ON, CA
Full-time

Clair Avenue East, Toronto, Ontario, M4T 2S5.Choice Properties is looking for a Manager, AI & Automation to join our team! The Manager, AI & Automation must be a self-starter, a strategic thinker, ... Show more

 • Promoted

Lead Platform Reliability Engineer – Global AI Platform (Hybrid)

Manulife FinancialToronto, ON, CA
Full-time

A leading international financial services provider is seeking a Lead Platform Reliability Engineer to ensure the stability and performance of their shared platform.This role involves defining reli... Show more

 • Promoted

Remote Applied AI Engineer: Automation

North Eastern ServicesToronto, ON, CA
Remote
Full-time

A leading AI solutions company is seeking an Applied AI Engineer in Toronto to design and implement high-impact AI and automation solutions for clients.This mid-to-senior role demands excellent Pyt... Show more

 • Promoted

AI Systems Reliability Engineer Position

TenstorrentToronto, ON, CA
Full-time

Become a Site Reliability Engineer to support cutting-edge AI technologies.Ensure system reliability and operational effectiveness utilizing your Linux and automation skills in a hybrid setup.In th... Show more

 • Promoted

Senior AI/ML Engineer - Site Reliability Engineering

RBCToronto, ON, CA
Full-time

Join RBC's Site Reliability Engineering team as a founding member building the bank's first‑ever Agentic AI platform for software reliability and resiliency.You'll pioneer intelligent automation sy... Show more

 • Promoted

Senior Solution Engineer – AI & Automation (Remote)

Automation Anywhere Inc.Toronto, ON, CA
Remote
Full-time

A leader in innovative automation solutions, based in Toronto, is seeking a Sr.Solution Engineer to drive Agentic Process Automation.You'll be responsible for generating high-value automation ideas... Show more

 • Promoted

Site Reliability Engineer with Automation Focus

YelpToronto, ON, CA
Full-time

Join a collaborative, remote SRE team dedicated to ensuring service reliability.In this role, leverage your expertise in automation and systems management to support a platform serving millions.You... Show more

 • Promoted

SaaS Operations Engineer | Reliability & Automation

GuidepointToronto, ON, CA
Full-time

A leading data intelligence firm is seeking an Operations Engineer to manage SaaS platform operations and provide tier-2 support.You will be responsible for system administration, incident response... Show more

 • Promoted

Senior AI Engineer - LLM & Reliability Lead (Equity)

RootlyToronto
Full-time

A cutting-edge tech company in Toronto is seeking a Senior AI Engineer to lead the development of an AI-powered reliability assistant.This is a unique role focused on LLM engineering, allowing for ... Show more

 • Promoted

Platform Reliability Engineer for AI Solutions and Development

Société Financière ManuvieToronto, ON, CA
Full-time

Enhance the functionality of AI technology as a Lead Platform Reliability Engineer.Focus on performance metrics, automated solutions, and collaborative efforts to project success and stability.In t... Show more

 • Promoted

Site Reliability Engineer, AI/ML Infrastructure

Boson AIToronto, ON, CA
Full-time

We2;re looking for a Senior Site Reliability Engineer to help us run one of the most exciting GPU clusters aroundour Toronto datacenter packed with NVIDIA H100 and A100 GPUs, over 20PB of Ceph stor... Show more

 • Promoted

Remote AI Automation Engineer — Build and Ship AI Agents

Valsoft CorporationToronto, ON, CA
Remote
Full-time

A growing technology firm in Canada is seeking an AI Automation Engineer to design and build AI agents for operational tasks and new AI products.This role involves collaborating with teams, automat... Show more

 • Promoted

Remote Applied AI Engineer: Automation & LLMs

FusemachinesToronto, ON, CA
Remote
Full-time

A leading AI services provider in Toronto is seeking an Applied AI Engineer(Automation) to create and deploy impactful AI solutions.This role involves collaborating with various stakeholders to int... Show more