Talent.com
Astra North Infoteck Inc.
SRE Observability - Kubernetes, DynatraceAstra North Infoteck Inc. • Markham, Ontario, CA
SRE Observability - Kubernetes, Dynatrace

SRE Observability - Kubernetes, Dynatrace

Astra North Infoteck Inc. • Markham, Ontario, CA
6 days ago
Job type
  • Full-time
Job description

Required Skills:

· Kubernetes

· Site Reliability Engineering (SRE)

· Dynatrace


Role Description:

· Responsible for developing and leading the company’s enterprise observability and reliability capability.

· The SRE and Observability Lead will collaborate across multiple teams to ensure comprehensive monitoring of all environmental components.

· This role will designate Dynatrace as the system of record for platform health and apply SRE practices to improve availability, performance, and incident outcomes across applications, infrastructure, and integrations.

· Own enterprise observability using Dynatrace across cloud, on-prem, ERP, WMS, eCommerce, APIs, and integrations.

· Design service topology, dashboards, alerts, and health indicators that reflect business impact.

· Apply SRE principles (SLIs, SLOs, error budgets where appropriate) to reduce incidents and improve resilience.

· Accelerate incident detection and root-cause analysis lead post-incident reviews focused on systemic fixes.

· Identify reliability, performance, and capacity risks before they impact the business.

· Define observability and SRE standards and enable teams to use them effectively.

· Must have 5 years in infrastructure, platform, operations, or reliability engineering.

· Must demonstrate hands-on experience implementing and operating Dynatrace.

· Must have a strong understanding of distributed systems, cloud hybrid environments, and integrations.

· Must have practical experience with SRE or reliability engineering concepts.

· Must be comfortable operating in high-impact incident and production environments



Create a job alert for this search

SRE Observability - Kubernetes, Dynatrace • Markham, Ontario, CA

Similar jobs

DEVOPS – Ingénierie de la fiabilité des sites (SRE)

Asteknewmarket, on, ca
Full-time

Organisation technologique d’envergure, nous accompagnons des équipes travaillant sur des plateformes critiques à forte valeur ajoutée.L’innovation, la collaboration et l’excellence opérationnelle ... Show more

 • Promoted

SRE Role with Expertise in Dynatrace

Pacer GroupToronto, ON, CA
Full-time

Advance your career as an SRE specializing in Dynatrace and observability solutions.This position focuses on leveraging Python and AWS in a financial services setting.We require a knowledgeable SRE... Show more

 • Promoted

Senior SRE Leader: AI-Powered Reliability & Observability

Tubi, Inc.Toronto, ON, CA
Full-time

A leading streaming service in Toronto seeks a Senior Manager for Site Reliability Engineering to lead a team, ensuring the availability and performance of services.You will define technical strate... Show more

 • Promoted

SRE-DevSecOps Engineer

High Tech GenesisToronto, Ontario, Canada
Full-time

High Tech Genesis Allowed Staffing Countries: Canada, Costa Rica, Mexico or Brazil, (Remote) Term: Contract High Tech Genesis is seeking a 3-month contractor who can hit the ground running to suppo... Show more

 • Promoted

Senior SRE Leader: Scale Reliability & Observability

RootlyToronto, ON, CA
Full-time

A fast-growing tech startup in Toronto is seeking an experienced Site Reliability Engineer.The role involves enhancing service performance, owning CI/CD pipelines, and building automation tools.Ide... Show more

 • Promoted

DEVOPS – Ingénierie de la fiabilité des sites (SRE) - richmond hill

Astekrichmond hill, on, ca
Full-time

Organisation technologique d’envergure, nous accompagnons des équipes travaillant sur des plateformes critiques à forte valeur ajoutée.L’innovation, la collaboration et l’excellence opérationnelle ... Show more

 • Promoted

Dynatrace SRE — End-to-End Observability (Hybrid)

DexianToronto, ON, CA
Full-time

A leading staffing and IT solutions provider in Toronto is seeking a Site Reliability Engineer with strong Dynatrace expertise.This role focuses on ensuring the reliability, performance, and observ... Show more

 • Promoted

Senior SRE – Observability & Real-Time Telemetry

Framework VenturesToronto, ON, CA
Full-time

A prominent blockchain company is seeking a Senior Site Reliability Engineer (SRE) in Canada, British Columbia.This role involves building and orchestrating a modern observability platform, ensurin... Show more

 • Promoted

Senior SRE – Kubernetes Platform & CRE Scaling

Chainlink LabsToronto, ON, CA
Full-time

A global blockchain technology company is seeking an experienced Infrastructure Engineer to design and build foundational infrastructure for its decentralized oracle networks.The ideal candidate wi... Show more

 • Promoted

Senior SRE

ViafouraToronto, ON, CA
Full-time

Senior Site Reliability Engineer.Viafoura is a leading audience engagement platform that powers real-time conversations and community experiences for digital publishers and brands worldwide.We're s... Show more

 • Promoted

AI-Driven SRE: Observability & Cloud Reliability Engineer

Themesoft Inc.Toronto, ON, CA
Full-time

A technology solutions firm located in Toronto is seeking candidates with hands-on experience in observability tools and strong Python scripting skills.The ideal applicant will have knowledge of di... Show more

 • Promoted

Cloud SRE – Automation, Observability & Resilience

iManageToronto, ON, CA
Full-time

A leading SaaS company in Toronto is seeking a Site Reliability Engineer to join their rapidly growing cloud platform team.The role involves creating cloud-native solutions and reducing operational... Show more

 • Promoted

Senior SRE: Automation, Observability & Batch Performance

KyndrylToronto, Ontario, Canada
Full-time

A global technology services provider is seeking a Site Reliability Engineer in Toronto to enhance the reliability and efficiency of critical batch workloads.This mid-senior level contract role emp... Show more

 • Promoted

SRE Developer I — Flexible, Remote/Hybrid, Observability Focus

Vena SolutionsToronto, ON, CA
Remote
Full-time

A tech-driven solutions provider is seeking a Site Reliability Developer in Toronto.This flexible position allows for in-office, hybrid, or remote working.Responsibilities include supporting IT pro... Show more

 • Promoted

Remote Senior SRE: Scale AWS & Kubernetes

Third-Party Job PostsToronto, ON, CA
Remote
Full-time

A leading hospitality technology firm is seeking a Sr.Site Reliability Engineer to ensure the reliability and performance of its platform.This position involves architecting scalable AWS solutions ... Show more

 • Promoted

SRE Observability Engineer

Tata Consultancy ServicesToronto, ON, CA
Full-time

Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to c... Show more

 • Promoted

Senior SRE: Global SaaS Platform & Kubernetes

Kong Inc.Toronto, ON, CA
Full-time

A leading developer of API technologies is seeking a Senior Site Reliability Engineer to join the global Platform SRE team in Toronto.The role involves building, operating, and scaling a multi-regi... Show more

 • Promoted

SRE, Streaming: Alpaca Careers

AlpacaToronto, ON, CA
Full-time

Elevate your engineering career as a Site Reliability Engineer at Alpaca.Focus on enhancing system reliability and scalability in a fully remote setup.In the SRE role at Alpaca, you’ll be responsib... Show more