Talent.com
Bitcomplete
Intermediate Site Reliability Engineer (Canada)Bitcomplete • Toronto, Canada
No longer accepting applications
Intermediate Site Reliability Engineer (Canada)

Intermediate Site Reliability Engineer (Canada)

Bitcomplete • Toronto, Canada
12 days ago
Job type
  • Full-time
Job description
Join us as an Intermediate Site Reliability Engineer helping build reliable, scalable cloudinfrastructure. You’ll work alongside senior engineers to own projects, deepen platform skills, and support teams operating large distributed systems.

You’ll focus on one of three streams:

Kubernetes, Observability, or Developer Experience .

What you'll be doing

Improve infrastructure reliability, scale, and security across cloud-native systems.

Deliver features and upgrades through infrastructure-as-code.

Collaborate with product teams on debugging, migrations, and operational readiness.

Support incident response, capacity planning, and performance improvements.

Automate repeatable workflows to reduce operational load across engineering.

Stream Focus Areas You’ll help operate and evolve shared Kubernetes platforms used by many product teams.

Typical work:

Maintain and upgrade clusters, networking, ArgoCD, and IaC patterns.

Build or extend reusable infra modules (XRDs, Helm, Terraform) to standardize onboarding.

Partner with teams to plan and execute migrations safely

Handle inbound maintenance, patching, and legacy stack stability work.

Observability Platform

You’ll help deliver a modern telemetry platform powering metrics, logs, and traces for engineering teams.

Typical work:

Build and operate OTEL-based telemetry pipelines across environments.

Support migrations to VictoriaMetrics and maintain data accuracy during transitions.

Improve SLOs, alerting strategies, and reliability of observability systems.

Contribute to IaC automation for observability deployments.

Ideal tools: OTEL, Prometheus, VictoriaMetrics, VM Alert, Grafana, Terraform, GitHub Actions.

Developer Experience / CI/CD

You’ll help maintain and strengthen the CI/CD ecosystem powering builds, tests, and deployments.

Typical work:

Maintain pipelines, update dependencies, and improve the reliability of GitHub Actions.

Migrate workloads away from legacy tooling to a new Tailscale / OIDC-based platform.

Triage support requests, follow runbooks, and assist product teams during migrations.

Reduce operational load by standardizing patterns and supporting migrations.

Ideal tools: GitHub Actions, Docker, Tailscale, Terraform, and container registry best practices.

Your Background

3 - 5 years of experience as an SRE. Minimum 1+ years as a software engineer.

Keen to deepen your software engineering skills and play a bigger role in how our systems are built and operated.

Comfortable writing and debugging code in Go, Python, or a similar language.

Curious about platform reliability, excited to learn deeper system internals over time.

Communicate clearly with engineers across teams and time zones.

Focus on automation, reproducibility, and practical reliability over “heroics.”

Bring some experience in cloud infrastructure and want to grow into owning larger systems.

About Us CAD $117,610 - $158,240 annually. Our ranges include base salary and conservative bonus target.

Interested? We're excited about working with you, so get in touch! Submit your application here .

We believe people from diverse backgrounds, with different identities and experiences, make our company better. No matter your background, we'd love to hear from you! Alignment with our values is just as important as experience. Also, please let us know if there are ways we can make our interview process better for you - we're always happy to listen and accommodate where possible.

#J-18808-Ljbffr
Create a job alert for this search

Intermediate Site Reliability Engineer (Canada) • Toronto, Canada

Similar jobs

Site Reliability Engineer - toronto

HCLTechtoronto, on, ca
Full-time

Hands-on experience with at least one major public cloud platform (Azure, AWS, or GCP).Strong understanding of cloud infrastructure and application runtime components, including compute, storage, n... Show more

 • Promoted

Site Reliability Engineer

KyndrylToronto, ON, CA
Full-time +1

Join to apply for the Site Reliability Engineer role at Kyndryl.Direct message the job poster from Kyndryl.Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Se... Show more

 • Promoted

Impactful Site Reliability Engineer Fostering Reliability and Performance

RootlyToronto, ON, CA
Full-time

Join as an impactful Site Reliability Engineer, shaping the technical future and enhancing system reliability.Tackle rewarding challenges in a collaborative startup atmosphere.As a key player, you’... Show more

 • Promoted

Senior Site Reliability Engineer

SimCorpToronto, ON, CA
Full-time

Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: Torontotime type: Full timeposted on: Posted Todayjob requisition id: R-211168Job Advertisement*... Show more

 • Promoted

Site Reliability Engineer, Observability

PricelineToronto, ON, CA
Full-time

This role is eligible for our hybrid work model: Two days in-office.Site Reliability Engineer, Observability.Our Technology team is the backbone of our company: constantly creating, testing, learni... Show more

 • Promoted

Sr. Site Reliability Engineer I

Axon EnterpriseToronto, ON, CA
Full-time

At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud software.Like our products, we work b... Show more

 • Promoted

Senior Site Reliability Engineer

Apptoza Inc.Toronto, ON, CA
Full-time

Job Title: Senior Platform Engineer / Senior SRE Developer – Observability (Dynatrace).Work Style: Hybrid (2 days per week in-person at Toronto office preferred).Skills: Digital : Python~Digital : ... Show more

 • Promoted

Senior Site Reliability Engineer II - Remote, Scale-Focused

InstacartToronto, ON, CA
Remote
Full-time

A leading grocery delivery service is seeking a Senior Site Reliability Engineer II in Calgary, Alberta.You will ensure optimal performance and reliability of the platform while establishing incide... Show more

 • Promoted

Senior Site Reliability Engineer

RootlyToronto, ON, CA
Full-time

At Rootly, we are on a mission to be the go‑to way companies respond when things go wrong, helping every organization be more reliable.We do this by building an industry‑leading incident management... Show more

 • Promoted

Toronto Site Reliability Engineer Needed

PheedLoop Inc.Toronto, ON, CA
Full-time

Be a part of PheedLoop as a Site Reliability Engineer in Toronto, ON, where you can leverage your skills in system automation and reliability.Work at the forefront of event technology.In this full-... Show more

 • Promoted • New!

Site Reliability Engineer

HCLTechtoronto, on, ca
Full-time

Hands-on experience with at least one major public cloud platform (Azure, AWS, or GCP).Strong understanding of cloud infrastructure and application runtime components, including compute, storage, n... Show more

 • Promoted

Senior Site Reliability Engineer

CaptivateIQToronto, ON, CA
Full-time

The Site Reliability Engineering team in CaptivateIQ operates across the engineering organization, supporting our development teams by providing them with the tools and processes they need to get t... Show more

 • Promoted

Senior Site Reliability Engineer Role

ITRidersToronto, ON, CA
Full-time

Elevate your career as a Senior Site Reliability Engineer at our company.Craft observability-as-code solutions using Terraform while optimizing system reliability across diverse environments.We see... Show more

 • Promoted

Site Reliability Engineer

Insight GlobalToronto, ON, CA
Full-time

Insight Global is looking for a Site Reliability Engineer/Implementation Lead to support a CCaaS transformation program.The role will focus on implementing monitoring solutions across a distributed... Show more

 • Promoted

Senior Site Reliability Engineer Focused on Kubernetes Infrastructure

Chainlink LabsToronto, ON, CA
Full-time

Elevate decentralized architecture as a Senior Site Reliability Engineer.Spearhead Kubernetes-based infrastructure for decentralized applications, driving scalability, security, and operational eff... Show more

 • Promoted

Site Reliability Engineer

LongbridgeToronto, ON, CA
Full-time

Longbridge is a fast-growing online brokerage platform on a mission to make investing smarter, simpler, and more accessible for everyone.As part of our global expansion, we’re looking for a.Site Re... Show more

 • Promoted • New!

Intermediate Site Reliability Engineer (Canada)

BitcompleteToronto, Ontario, Canada
Full-time

Join us as an Intermediate Site Reliability Engineer helping build reliable, scalable cloudinfrastructure.You’ll work alongside senior engineers to own projects, deepen platform skills, and support... Show more

 • Promoted

Site Reliability Engineer - Canada Wide - Remote

NewtonToronto, ON, CA
Remote
Full-time

Say hello to Newton! We're changing how Canadians trade crypto.Our goal? To make financial freedom something everyone can achieve.We give our customers the tools and knowledge they need to navigate... Show more

 • Promoted

Senior Site Reliability Engineer- Remote

ClickHouseToronto, ON, CA
Remote
Full-time

Senior Site Reliability Engineer- Remote.Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies.With more than 3,000 custome... Show more

 • Promoted

Site Reliability Engineer

Momentum Financial Services GroupToronto
Full-time

At Momentum Financial Services Group, we help people move forward by reimagining how money works for those who need it most.With more than 40 years of experience, we’re the team behind Money Mart—C... Show more