Site Reliability Engineer (Intermediate)Bitcomplete • Toronto, Canada

No longer accepting applications

Site Reliability Engineer (Intermediate)

Bitcomplete • Toronto, Canada

11 days ago

Job type

Full-time

Job description

Join us as an Intermediate Site Reliability Engineer helping build reliable, scalable cloudinfrastructure. You’ll work alongside senior engineers to own projects, deepen platform skills, and support teams operating large distributed systems.

You’ll focus on one of three streams :

Kubernetes, Observability, or Developer Experience .

What you'll be doing

Improve infrastructure reliability, scale, and security across cloud-native systems.

Deliver features and upgrades through infrastructure-as-code.

Collaborate with product teams on debugging, migrations, and operational readiness.

Support incident response, capacity planning, and performance improvements.

Automate repeatable workflows to reduce operational load across engineering.

Stream Focus Areas

You’ll help operate and evolve shared Kubernetes platforms used by many product teams.

Typical work :

Maintain and upgrade clusters, networking, ArgoCD, and IaC patterns.

Build or extend reusable infra modules (XRDs, Helm, Terraform) to standardize onboarding.

Partner with teams to plan and execute migrations safely

Handle inbound maintenance, patching, and legacy stack stability work.

Observability Platform

You’ll help deliver a modern telemetry platform powering metrics, logs, and traces for engineering teams.

Typical work :

Build and operate OTEL-based telemetry pipelines across environments.

Support migrations to VictoriaMetrics and maintain data accuracy during transitions.

Improve SLOs, alerting strategies, and reliability of observability systems.

Contribute to IaC automation for observability deployments.

Ideal tools : OTEL, Prometheus, VictoriaMetrics, VM Alert, Grafana, Terraform, GitHub Actions.

Developer Experience / CI / CD

You’ll help maintain and strengthen the CI / CD ecosystem powering builds, tests, and deployments.

Typical work :

Maintain pipelines, update dependencies, and improve the reliability of GitHub Actions.

Migrate workloads away from legacy tooling to a new Tailscale / OIDC-based platform.

Triage support requests, follow runbooks, and assist product teams during migrations.

Reduce operational load by standardizing patterns and supporting migrations.

Ideal tools : GitHub Actions, Docker, Tailscale, Terraform, and container registry best practices.

Your Background

3 - 5 years of experience as an SRE. Minimum 1+ years as a software engineer.

Keen to deepen your software engineering skills and play a bigger role in how our systems are built and operated.

Comfortable writing and debugging code in Go, Python, or a similar language.

Curious about platform reliability, excited to learn deeper system internals over time.

Communicate clearly with engineers across teams and time zones.

Focus on automation, reproducibility, and practical reliability over “heroics.”

Bring some experience in cloud infrastructure and want to grow into owning larger systems.

About Us

CAD $117,610 - $158,240 annually.

Our ranges include base salary and conservative bonus target.

Interested?

We're excited about working with you, so get in touch! Submit your application here .

We believe people from diverse backgrounds, with different identities and experiences, make our company better. No matter your background, we'd love to hear from you! Alignment with our values is just as important as experience. Also, please let us know if there are ways we can make our interview process better for you - we're always happy to listen and accommodate where possible.

J-18808-Ljbffr

Create a job alert for this search

Site Reliability Engineer (Intermediate) • Toronto, Canada

Similar jobs

Site Reliability Engineer

Capgemini • Toronto, ON, CA

Full-time

Talent Acquisition Business Partner – Strategic Business Unit at Capgemini America Inc.Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d ...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

RBC • Toronto, ON, CA

Full-time

This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.As t...Show more

Last updated: 24 days ago • Promoted

Site Reliability Engineer - Interface & Connectivity

SimCorp • Toronto, ON, CA

Full-time

If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to SimCorp!At its foun...Show more

Last updated: 13 days ago • Promoted

Site Reliability Engineer

iManage • Toronto, ON, CA

Full-time

SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe.We organize ourselves into distributed teams – SRE teams are anchored t...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Dexian • Toronto, ON, CA

Full-time

Working Location: Toronto, ON [Hybrid 2 days a week in office].The DevOps and Automation is looking for a Site Reliability Engineer with strong expertise in Dynatrace to ensure the reliability, per...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer II

Pager • Toronto, ON, CA

Full-time

PagerDuty (NYSE:PD) is a leader in Digital Operations Management.In an always-on world, organizations of all sizes trust PagerDuty to help them deliver a perfect digital experience to their custome...Show more

Last updated: 13 days ago • Promoted

Lead Site Reliability Engineer

Movable Ink • Toronto, ON, CA

Full-time

Movable Ink scales content personalization for marketers through data-activated content generation and AI decisioning.The world’s most innovative brands rely on Movable Ink to maximize revenue, sim...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Scotiabank • Toronto, ON, CA

Full-time

As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications.You will have...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer — Scale Observability & Autonomy

MaintainX, Inc. • Toronto, ON, CA

Full-time

A leading technology company seeks a Site Reliability Engineer (SRE) to enhance service reliability and observability as it scales its cloud-based platform.The role involves assessing service matur...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Kyndryl • Toronto, ON, CA

Full-time +1

Join to apply for the Site Reliability Engineer role at Kyndryl.Direct message the job poster from Kyndryl.Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Se...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer - Identity and Platform Services

OMERS • Toronto, ON, CA

Full-time

Choose a workplace that empowers your impact.Join a global workplace where employees thrive.One that embraces diversity of thought, expertise and experience.A place where you can personalize your e...Show more

Last updated: 29 days ago • Promoted

Senior Site Reliability Engineer, Observability

Framework Ventures • Toronto, ON, CA

Full-time

Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi).The Chainlink stack provides essential data, intero...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Deltatre • Toronto, ON, CA

Permanent

The Site Reliability Engineer (SRE) is responsible for improving the reliability, stability, and operational readiness of critical digital platforms.The role focuses on proactively reducing risk, s...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer Focused on Kubernetes Infrastructure

Chainlink Labs • Toronto, ON, CA

Full-time

Elevate decentralized architecture as a Senior Site Reliability Engineer.Spearhead Kubernetes-based infrastructure for decentralized applications, driving scalability, security, and operational eff...Show more

Last updated: 8 days ago • Promoted

Site Reliability Engineer

Dayforce US, Inc. • Toronto, ON, CA

Full-time

Posted Friday, March 27, 2026 at 12:00 AM | Expires Friday, May 29, 2026 at 10:59 PM.For this role, we are open to remote work and can hire anywhere in Great Britain.Dayforce is a global human capi...Show more

Last updated: 6 days ago • Promoted

Site Reliability Engineer II — Observability Platform

Loblaw Companies Limited • Toronto, ON, CA

Full-time

A leading Canadian retail company is seeking a Site Reliability Engineer II to enhance their observability and reliability platform.In this hands-on role, you will design, operate, and improve syst...Show more

Last updated: 21 days ago • Promoted

Site Reliability Engineer - Interface & Connectivity

Sim • Toronto, ON, CA

Full-time

Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology.If you are an innovative, curious, collaborative person who embraces challenges and wants to gr...Show more

Last updated: 8 days ago • Promoted

Site Reliability Engineer - Tangerine

Tangerine • Toronto, ON, CA

Permanent

Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative products, and award‑winning Client service.The reason why Tangerine employees come to work eac...Show more

Last updated: 14 days ago • Promoted