Talent.com

Reliability engineer Jobs in Etobicoke, ON

Create a job alert for this search

Reliability engineer • etobicoke on

Last updated: 3 days ago
Site Reliability Engineer

Site Reliability Engineer

ScotiabankToronto, ON, CA
Full-time
Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.We’re looking for an SRE with deep experience in production observability and incident response...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer (SRE) - Production Management, Operate

Senior Site Reliability Engineer (SRE) - Production Management, Operate

DeloitteToronto, ON, ON, CA
Temporary
At Deloitte, our Purpose is to make an impact that matters.We exist to inspire and help our people, organizations, communities, and countries to thrive by building a better future.Our work underpin...Show moreLast updated: 4 days ago
Reliability Engineer

Reliability Engineer

AlstomToronto, ON, CA
Full-time
University degree in an Engineering discipline or technologist course relevant to job/equipment function.System/Vehicle Engineer preferably in Rail Transit industries and a track record of at least...Show moreLast updated: 10 days ago
Site Reliability Engineer - Tangerine

Site Reliability Engineer - Tangerine

TangerineToronto, ON, CA
Permanent
Manages the team workflow to maximize business and technical efficiencies.Develop and guide the team members in enhancing their technical capabilities and increasing productivity.Supervises IT Supp...Show moreLast updated: 6 days ago
Site Reliability Engineer – Dynatrace

Site Reliability Engineer – Dynatrace

Astra North Infoteck Inc.Toronto, ON, ca
Permanent
Quick Apply
Job Description: Skills: Dynatrace, Observability, Monitoring Engineering, SRE Practices.We are seeking a highly skilled Dynatrace Monitoring Engineer / Site Reliability Engineer (SRE) responsible ...Show moreLast updated: 3 days ago
OPENSHIFT SRE {Site Reliability Engineer}

OPENSHIFT SRE {Site Reliability Engineer}

Randstad CanadaMississauga, Ontario, CA
Permanent
Quick Apply
We’re looking for an OpenShift Engineer / Administrator to design, implement, and maintain secure, resilient OpenShift/Kubernetes clusters.This is not a DevOps or app deployment role—it’s a true en...Show moreLast updated: 30+ days ago
Site Reliability Engineer/ Entertainment/ Toronto

Site Reliability Engineer/ Entertainment/ Toronto

Motion RecruitmentToronto, ON, Canada
Full-time
An innovative interactive entertainment technology organization is seeking a Software Engineer with a strong foundation in software development and a passion for building the infrastructure that po...Show moreLast updated: 10 days ago
Ansible Engineer

Ansible Engineer

OpticcaToronto, ON, USA
Full-time
Opticca Consulting is seeking a Senior Ansible Automation Engineer involving the deployment and migration to Red Hat Ansible Automation Platform (AAP) on Microsoft Azure.The consultant will play a ...Show moreLast updated: 11 days ago
Data Engineer

Data Engineer

ManulifeCAN, Ontario, Toronto, 200 Bloor Street East
Full-time
This Data Engineer role is essential for driving efficiency and innovation within our organization.By developing and maintaining automation, this position helps streamline operations and improve ef...Show moreLast updated: 30+ days ago
Rotating Engineer – Onshore Reliability

Rotating Engineer – Onshore Reliability

Hudson ManpowerToronto, ON, CA
Full-time
Rotating Engineer – Onshore Reliability.Bachelor’s Degree in Mechanical Engineering.Oil & Gas / Refinery (Onshore).The Rotating Engineer – Onshore Reliability will be responsible for ensuring the o...Show moreLast updated: 5 days ago
Engineer

Engineer

Toronto HydroToronto, ON, CA
Full-time
Target Variable Performance Pa.The salary range shown above reflects the expected compensation for this position.The final salary offered will be determined based on a holistic assessment of the ca...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Royal Bank of Canada>TORONTO, Canada
Full-time
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.As t...Show moreLast updated: 14 days ago
Site Reliability Engineer

Site Reliability Engineer

iManageToronto, ON, CA
Full-time
Quick Apply
SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe.We organize ourselves into distributed teams -- SRE teams are anchored ...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

0000050007 Royal Bank of CanadaTORONTO, Ontario, Canada
Full-time
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.As t...Show moreLast updated: 14 days ago
Sales Engineer

Sales Engineer

NetDynamic Consulting IncorporatedMississauga, Ontario, CA
Full-time
NetDynamic Consulting is a certified SuiteSuccess partner and official NetSuite Alliance partner.We work across multiple industries, serving clients with tailored ERP solutions and a proven record ...Show moreLast updated: 26 days ago
Process Engineer

Process Engineer

Martinrea International Inc.Mississauga, ON, CAN
Full-time
Quick Apply
Lightweight Structures and Propulsion Systems.We employ approximately 19,000 skilled and currently operate in 59 locations in Canada, the United States, Mexico, Brazil, Germany, Slovakia, Spain, Ch...Show moreLast updated: 3 days ago
Flutter Engineer

Flutter Engineer

CMiCToronto, ON, CA
Full-time
Quick Apply
We believe that working with Flutter is pure joy for any person that appreciates the intricacies of architecture and engineering.The principle of widget composition is genius.It not only gives a fr...Show moreLast updated: 30+ days ago
Security Engineer - Crypto Engineer

Security Engineer - Crypto Engineer

The Toronto-Dominion Bank (Canada)Toronto, Ontario
Full-time
The Crypto Solution Validation team is responsible for certifying data protection enterprise solution or new use cases for the bank.We are tasked with Lab build, test planning and design, test exec...Show moreLast updated: 12 days ago
Geotechnical Engineer

Geotechnical Engineer

TalentSphereMississauga, ON, Canada
Full-time
Location: Mississauga, Ontario.Salary: $120,000-$150,000 (based on experience).Our client is a Canadian-owned and operated engineering consulting firm specialized in geotechnical and environmental ...Show moreLast updated: 30+ days ago
People also ask
Site Reliability Engineer

Site Reliability Engineer

ScotiabankToronto, ON, CA
30+ days ago
Job type
  • Full-time
Job description

Requisition ID: 251796

Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.

We’re looking for an SRE with deep experience in production observability and incident response to raise the reliability and transparency of our customer-facing services. You will own the end-to-end observability stack across Dynatrace, Splunk, Power BI, and Google Cloud (GCP) Monitoring, drive proactive detection and reduction of toil, and lead major incident response. This role focuses on operational excellence and service health and NOT platform engineering or DevOps provisioning.

Is this role right for you? In this role you will:

  • Design and maintain end-to-end monitoring for critical services using Dynatrace (APM, Real User Monitoring, Synthetic, Davis AI, Smartscape) and GCP Cloud Monitoring (metrics, alerting policies, SLOs/SLIs, uptime checks, dashboards).
  • Build service maps, dependency models, and problem detection in Dynatrace; tune Davis AI problem rules and reduce alert noise through thresholds, baselining, and tagging.
  • Implement SLOs/SLIs with error budgets; continuously review burn rates and align alerting to customer impact.
  • Partner with application teams to instrument code paths (e.g., Dynatrace OneAgent), trace distributed transactions, and validate golden signals (latency, traffic, errors, saturation).
  • Create and optimize Splunk data models, indexes, sourcetypes, ingestion pipelines, and SPL searches; build actionable dashboards for NOC/SRE/Engineering.
  • Develop operational analytics and executive reporting in Power BI (data modeling, DAX/Measures, scheduled refresh) to track reliability KPIs, incident trends, MTTR/MTTD, SLO compliance, and capacity signals.
  • Establish governance for data quality, field extractions, and retention to ensure fast, accurate investigations.
  • Lead incident response (Sev1/Sev2): run bridges, coordinate SMEs, communicate status/timelines, drive mitigation and customer updates.
  • Maintain runbooks, decision trees, and standard operating procedures; ensure blameless post-incident reviews (PIRs) with clear RCA, corrective actions, and preventative measures.
  • Track and close problem tickets tied to recurring failure modes; verify effectiveness of fixes via metrics and error budgets.
  • Use light coding/scripting to automate recurring tasks: alert tuning, data enrichment, log parsing, playbook triggers, service health checks.
  • Build small utilities or bots for on-call workflows (e.g., auto-triage, context gathering, incident timelines).
  • Contribute to observability standards and best practices (naming, tags, SLIs, alert policies), and mentor teams on instrumenting for reliability.

Note: This role does NOT manage CI/CD, infrastructure provisioning, or platform build (Terraform/Kubernetes cluster ops). Collaboration with those teams is expected, but ownership remains on monitoring, analytics, incident response, and reliability outcomes.

Do you have the skills that will enable you to succeed in this role? We’d love to work with you if you have:

  • 5+ years in SRE/Production Operations/Observability with Dynatrace and Splunk in high-availability environments.
  • Hands-on with GCP operations: Cloud Monitoring, Cloud Logging, Alerting Policies, Uptime Checks, SLOs/SLIs; familiarity with Error Reporting/Trace is a plus.
  • Strong SPL (Splunk) and Dynatrace (APM/RUM/Synthetic) expertise—including alert design, dashboards, and noise reduction.
  • Power BI proficiency: data modeling, DAX measures, role-level security, and scheduled refresh for operational/Exec reporting.
  • Proven incident commander experience for Sev1/Sev2 with clear comms, stakeholder management, and PIR facilitation.
  • Coding/scripting for automation and data manipulation (e.g., Python or PowerShell; Go/Bash a plus).
  • Solid understanding of service reliability concepts: golden signals, SLOs/error budgets, capacity and saturation, graceful degradation.
  • Strong analytical mindset with a bias to measurable outcomes (MTTD/MTTR, alert volume, SLO compliance).

What's in it for you?

  • Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and are embraced through bias-free practices and inclusive values across Scotiabank. We embrace diversity and provide opportunities for all employee to learn, grow & participate through our various Employee Resource Groups (ERGs) that span across diverse gender identities, ethnicity, race, age, ability & veterans.
  • Accessibility and Workplace Accommodations - We value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. Scotiabank continues to locate, remove and prevent barriers so that we can build a diverse and inclusive environment while meeting accessibility requirements.
  • Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
  • Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
  • Dynamic Ecosystem - Free tea & coffee, universal washrooms, and lots of space for team collaboration.
  • Community Engagement - No matter where you choose to work from; we offer opportunities for community engagement & belonging with our various programs.

Location(s): Canada : Ontario : Toronto

Scotiabank is a leading bank in the Americas. Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.

At Scotiabank, we value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. If you require accommodation (including, but not limited to, an accessible interview site, alternate format documents, ASL Interpreter, or Assistive Technology) during the recruitment and selection process, please let our Recruitment team know. If you require technical assistance, please click here. Candidates must apply directly online to be considered for this role. We thank all applicants for their interest in a career at Scotiabank; however, only those candidates who are selected for an interview will be contacted.