Talent.com
Site Reliability Engineer
Site Reliability EngineerHCLTech • Toronto, Canada
Site Reliability Engineer

Site Reliability Engineer

HCLTech • Toronto, Canada
Il y a 25 jours
Type de contrat
  • Temps plein
Description de poste

Join our SRE squad supporting ~1000 AWS-hosted services for BMO. You’ll own operational reliability, rapid triage, and proactive maintenance across production and non-prod, partnering closely with Cloud Engineering, SOC, and application teams.

Key Responsibilities

Deliver 24×7 monitoring, incident response, and problem management; drive MTTA / MTTR reduction and SLO / SLI adherence.

Perform preventive health checks; analyze ticket trends to implement continual service improvements and automation to reduce toil.

Execute blameless postmortems and high-quality RCA; maintain SOPs / runbooks and reliability dashboards.

Configure / tune observability (Dynatrace, CloudWatch, ELK); enable self-healing workflows and workload optimizations.

Support change / service requests within agreed SLAs; collaborate during transitions and onboard new AWS services.

Core Skills & Tools

AWS :

Lambda, ECS / Fargate / EC2, API Gateway, SNS / SQS, Kinesis, RDS; IAM / KMS foundations.

Observability & ITSM :

Dynatrace, CloudWatch, ELK; ServiceNow for incidents / changes; SLI / SLO dashboards.

Reliability Practices :

Error budgets, capacity / performance benchmarking, automation / runbook execution, FinOps awareness.

Qualifications

5+ years SRE / DevOps or L2 operations for cloud-native stacks; strong AWS production experience.

Proven incident / change / problem management in 24×7 environments; adept at RCA and postmortems.

Hands‑on with observability tooling and operational automation; excellent collaboration and documentation skills.

Shift Coverage & Locations

Follow-the-sun model with overlapping handoffs across Canada / India to ensure continuous support. Success is measured by uptime, MTTR / MTTD, change failure rate, error‑budget consumption, SLO adherence, RCA quality, and CSI throughput.

#J-18808-Ljbffr

Créer une alerte emploi pour cette recherche

Site Reliability Engineer • Toronto, Canada

Offres similaires
Site Reliability Engineer

Site Reliability Engineer

Staples • Richmond Hill
Temps plein
The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and operational excellence of Staples Canada’s digital platforms. This role supports production systems...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Site Reliability Engineer 3

Site Reliability Engineer 3

Behavox • Toronto
Temps plein
Behavox is shaping the future of how businesses harness their most important raw material - data.Our mission is bold : Organize enterprise data into actionable information that protects and promotes...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Acquird.io • Toronto
Temps plein
B2B SaaS company, teams are based out of North America.Role is 95% remote in Toronto (we meetup 1x a month).Must be able to legally work in Canada (visa or sponsorship won't be provided).Our Platfo...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Staff Site Reliability Engineer

Staff Site Reliability Engineer

ContactMonkey • Toronto
Temps plein
Our mission? To power measurable employee engagement worldwide.And we’d love for you to join us!.About the job - Staff Site Reliability Engineer. You are not just building infrastructure—you are rad...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Tubi, Inc. • Toronto
Temps plein
Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users.Tubi offers the world's largest collection of Hollywood movies and TV shows, th...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Site Reliability Engineer, Inference Infrastructure

Site Reliability Engineer, Inference Infrastructure

Cohere • Toronto
Temps plein
Our mission is to scale intelligence to serve humanity.We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like cont...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Azure Site Reliability Engineer

Azure Site Reliability Engineer

Epsilon Solutions Ltd. • Toronto
Temps plein
Team Lead Recruitment @ Epsilon Solutions Ltd.Azure Site Reliability Engineer.Implement and maintain monitoring systems to proactively identify potential issues and alert engineers to problems befo...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Site Reliability Engineer, AI / ML Infrastructure

Site Reliability Engineer, AI / ML Infrastructure

Boson AI • Toronto
Temps plein
We2;re looking for a Senior Site Reliability Engineer to help us run one of the most exciting GPU clusters aroundour Toronto datacenter packed with NVIDIA H100 and A100 GPUs, over 20PB of Ceph stor...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Sr. Manager, Site Reliability Engineering

Sr. Manager, Site Reliability Engineering

OpenText • Richmond Hill
Temps plein
OpenText - The Information Company.OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture.As a member...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

Tata Consultancy Services • Toronto
Temps plein
Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to c...Voir plus
Dernière mise à jour : il y a 8 heures • Offre sponsorisée • Nouvelle offre
Site Reliability Engineer

Site Reliability Engineer

Aarorn Technologies Inc • Toronto
Temps plein
Toronto, ON (3x onsite a week).We are seeking a skilled Site Reliability Engineer (SRE) to enhance the reliability, scalability, and performance of our systems and applications.The ideal candidate ...Voir plus
Dernière mise à jour : il y a 19 jours • Offre sponsorisée
Global SaaS Site Reliability Engineer

Global SaaS Site Reliability Engineer

Kong • Toronto
Temps plein
A leading developer of cloud API technologies is seeking a Site Reliability Engineer to join their global Platform SRE team in Toronto, Ontario. The role involves managing and scaling a multi-region...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Senior Site Reliability Engineer, Kong Konnect

Senior Site Reliability Engineer, Kong Konnect

Kong Inc. • Toronto
Temps plein
Senior Site Reliability Engineer, Kong Konnect.This range is provided by Kong Inc.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Are you ready ...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

Manulife • Toronto
Temps plein
We are seeking a motivated Site Reliability Engineer (SRE) to join the Manulife Bank Service Delivery Management (SDM) team. In this role, you will be responsible for ensuring the reliability, avail...Voir plus
Dernière mise à jour : il y a 10 jours • Offre sponsorisée
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Accelerate Her Future® • Toronto
Temps plein +1
Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative products, and award-winning Client service. The reason why Tangerine employees come to work eac...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Movable Ink • Toronto
Temps plein
Movable Ink scales content personalization for marketers through data-activated content generation and AI decisioning.The world’s most innovative brands rely on Movable Ink to maximize revenue, sim...Voir plus
Dernière mise à jour : il y a 1 jour • Offre sponsorisée
Site Reliability Engineer - FedRAMP (Toronto - Canada) NEW

Site Reliability Engineer - FedRAMP (Toronto - Canada) NEW

Confluent Inc • Toronto
Temps plein
Site Reliability Engineer - FedRAMP (Toronto - Canada).We’re not just building better tech.We’re rewriting how data moves and what the world can do with it. With Confluent, data doesn’t sit still.Ou...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Tangerine Bank • Toronto
Temps plein +1
Press Tab to Move to Skip to Content Link.Select how often (in days) to receive an alert : .Tangerine is Canada’s leading direct bank. We offer flexible and accessible banking options, innovative prod...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée