Talent.com
Site Reliability Engineer
Site Reliability EngineerTecsys Inc. • Toronto, Canada
Site Reliability Engineer

Site Reliability Engineer

Tecsys Inc. • Toronto, Canada
Il y a plus de 30 jours
Type de contrat
  • Permanent
Description de poste

Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our conveniently located offices and collaborative workspaces, provide our team with the freedom and flexibility to work in the way that makes our employees most productive.

About us

Tecsys is a fast-growing innovator offering supply chain solutions to industry leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs. We work with industry leaders to transform their supply chains through technology. If you thrive on tackling interesting challenges with continuous learning opportunities, then Tescys could be a good fit for you!

About the Role

We are looking for a Site Reliability Engineer to join our Network and Security Operations Center (NOC), a team at the heart of platform reliability for mission-critical SaaS environments. You will help

maintain, optimize, and ensure the reliability and performance

of the systems that power our cloud infrastructure across AWS and Kubernetes, with a strong focus on automation, observability, and continuous improvement. This role blends reliability engineering with incident command, giving you real ownership over uptime, performance, and innovation. You will be part of a highly skilled team that values creative problem-solving, operational excellence, and continuous improvement through automation and resilience engineering.

Your responsibilities

Collaborate with other Engineering teams to support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

Innovate relentlessly : Identify pain points, propose creative solutions, and drive initiatives that simplify, scale, and strengthen the platform.

Maintain services once they are live by measuring and monitoring availability, latency and overall system health.

Own observability : Enhance and expand monitoring and alerting using Datadog; define SLOs / SLIs and create actionable dashboards that drive reliability outcomes.

Drive automation : Develop and improve internal tooling, IaC frameworks, and pipelines (Terraform, GitLab CI / CD) to reduce manual intervention and enable self-healing systems.

Scale systems sustainably through automation and evolve systems by pushing for changes that improve reliability and velocity.

Be on‑call.

Practice sustainable incident response and blameless postmortems. Lead post‑incident reviews (RCAs) and identify long‑term fixes that improve stability, reliability, and developer experience.

Implement monitoring, Logging, alerting, and SLA Reporting.

Create and maintain technical documentation.

Implement, maintain and mature SRE best practices.

Lead incidents : Act as Incident Commander for Incidents; coordinate cross‑team response, manage communications, and ensure rapid service restoration.

Provide support for our planning and deployment teams to enable stability, predictability, and scale in our continued growth.

Collaborate with members of the Platform Engineering team to implement and support far‑reaching strategic efforts, provide constructive feedback, and foster a collaborative environment.

Work cross‑functionally with internal teams and vendors to manage our growth around the globe, with a strong focus on maintaining the high level of performance, availability, and reliability for our users.

5+ years in Site Reliability, Cloud, or DevOps Engineering, ideally in SaaS or large‑scale production environments.

Experience designing and deploying large scale systems, multi‑vendor platforms and globally distributed infrastructure.

Proven experience managing cloud infrastructure in AWS (multi‑account, VPC, EC2, EKS) and Kubernetes at scale.

Strong hands‑on experience with IaC and automation (Terraform, Ansible, or similar).

Familiarity with CI / CD pipelines and release automation (GitLab preferred, Jenkins acceptable).

Deep understanding of monitoring and observability using Datadog (or equivalent), including metric design, log pipelines, alerting, and dashboards.

Experience with incident management, on‑call participation, escalation, and structured postmortems.

Scripting skills in Python, Bash, Java or equivalent for automation and diagnostics.

Curiosity, ownership, and a bias for action; you see a problem, you solve it, and you share the lessons learned.

Experience with Fedramp (The Federal Risk and Authorization Management Program) compliance is a strong asset.

Basic knowledge of Java‑ or .Net‑based development required.

Strong English communication skills, both written and spoken, are essential for effective correspondence with customers, business partners and colleagues beyond the province of Quebec.

Additional requirements :

Escalation on‑call rotation

Occasional travel (quarterly offsites, conferences – less than 10%)

At Tecsys, we are committed to fostering a diverse and inclusive workplace where all employees feel valued, respected, and empowered. We believe that diversity drives innovation and strengthens our ability to deliver exceptional solutions. We welcome and encourage applicants from all backgrounds, experiences, and perspectives to join our team.

Tecsys is an equal opportunity employer. Accommodation is available for applicants selected for an interview.

NB : if you are applying to this position, you must be a Canadian Citizen or a Permanent Resident of Canada,

OR , have a valid Canadian work permit.

#J-18808-Ljbffr

Créer une alerte emploi pour cette recherche

Site Reliability Engineer • Toronto, Canada

Offres similaires
Technical Consultant - SyteLine / CSI

Technical Consultant - SyteLine / CSI

PwC Canada • Greater Toronto Area, Canada
Temps plein
SyteLine / CSI hands on implementation Experience for their private sector client.Implementation of enhancements to the Advanced Planning & Scheduling module within SyteLine / CSI, for a Manufacturing ...Voir plus
Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
Site Lead

Site Lead

St. Alban's Boys And Girls Club • Toronto C6A, ON, Canada
Temps plein
Humber Boulevard South (Humber Children's TCHC).BGC Weston Mount Dennis & Lawrence Heights Club serves children and youth in the Weston Mount Dennis / Lawrence Heights and surrounding communities pr...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée
Remote Senior Property Engineer — Wildfire Expert

Remote Senior Property Engineer — Wildfire Expert

Allianz Commercial • Toronto C6A, ON, Canada
Télétravail
Temps plein
A leading global insurance provider is seeking a Senior Property Engineer – Wildfire Expert to support clients in risk evaluation and management. This remote role requires approximately 30% travel f...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
Test Development Engineer

Test Development Engineer

Actalent • Newmarket, ON, Canada
Temps plein
The ideal candidate for the Test Development Engineering position thrives at the hardware / software boundary and is motivated by solving complex problems in a fast-paced environment.They are looking...Voir plus
Dernière mise à jour : il y a 13 jours • Offre sponsorisée
Staff Site Reliability Engineer, Database

Staff Site Reliability Engineer, Database

Alpaca • Toronto, ON, Canada
Temps plein
Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24 / 5 trading, and more. Our recent Series C funding round broug...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Tooling Manager

Tooling Manager

AppleOne Employment Services • Newmarket, ON, Canada
Temps plein
We are seeking a seasoned leader to oversee new build projects from inception to completion.This role acts as the bridge between design, procurement, and the shop floor, ensuring that every tool is...Voir plus
Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
AWS data engineer DWIDC5725169

AWS data engineer DWIDC5725169

Compunnel Inc. • Greater Toronto Area, Canada
Temps plein
Data Engineer with AWS, Glue, Lambda, SQL, Python, Redshift.Must have working knowledge in designing and implementing data pipelines on any of the cloud providers (AWS is preferred).Must be able to...Voir plus
Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
Lead Systems Engineer, Launch Program

Lead Systems Engineer, Launch Program

The Wohl Group - Recruitment Made Easy! • Markham, ON, Canada
Temps plein
The Lead Systems Engineer owns cross-functional planning and execution for the launch program at a systems engineering level, ensuring work meets the company’s technical requirements, risk po...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
Solutions Engineer, Canada

Solutions Engineer, Canada

Procore • Toronto, ON, Canada
Temps plein
You’ll partner with the Canadian Account Executive and Account Management team, working with mid to large size companies in Canada, helping to articulate Procore’s overall value proposi...Voir plus
Dernière mise à jour : il y a 3 jours • Offre sponsorisée
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Accelerate Her Future® • Toronto C6A, ON, Canada
Temps plein +1
Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative products, and award-winning Client service. The reason why Tangerine employees come to work eac...Voir plus
Dernière mise à jour : il y a 5 jours • Offre sponsorisée
Site Superintendent

Site Superintendent

SSA Recruitment (CA) • Greater Toronto Area, ON, Canada
Temps plein
ICI Construction Site Superintendent.Salary : $100,000 – $130,000 (based on experience).Our client is a well-established General Contractor with a strong pipeline of.They are currently seeking...Voir plus
Dernière mise à jour : il y a 7 jours • Offre sponsorisée
Senior Site Reliability / Infrastructure Platform Engineer

Senior Site Reliability / Infrastructure Platform Engineer

Nextologies Limited • Markham, ON, Canada
Temps plein
Senior Site Reliability / Infrastructure Platform Engineer.Virtualization, distributed systems, Linux performance, and service reliability). Act as senior escalation point for service outages, platf...Voir plus
Dernière mise à jour : il y a 10 jours • Offre sponsorisée
Database Reliability Engineer IV

Database Reliability Engineer IV

PagerDuty • Toronto, ON, Canada
Temps plein
NYSE : PD) is a global leader in digital operations management.Trusted by nearly half of both the Fortune 500 and the Forbes AI 50, as well as approximately two-thirds of the Fortune 100, PagerDuty i...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Staff Software Engineer – Developer Tooling

Staff Software Engineer – Developer Tooling

icon. • Greater Toronto Area, Canada
Temps plein
Staff Software Engineer – Developer Tooling & IDE Platforms.Toronto, ON | Full-Time | On-Site.We are building a new class of desktop-first developer tooling designed for highly technical engineers ...Voir plus
Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
Senior Systems Engineer

Senior Systems Engineer

Essence Coaching Group • Markham, ON, Canada
Temps plein
Lindsay, Ontario, Canada (Hybrid).CAD 165,000 – 210,000 gross / year.A senior-level Systems Engineer is sought to lead aircraft- and system-level engineering activities for next-generation elec...Voir plus
Dernière mise à jour : il y a 17 jours • Offre sponsorisée
Site Administrator

Site Administrator

FirstService Residential • Stouffville, ON, Canada
Temps partiel
FirstService Residential is owned by FirstService Corporation, a proudly Canadian company and one of Canada’s great business success stories. FirstService Residential transforms the property m...Voir plus
Dernière mise à jour : il y a 11 jours • Offre sponsorisée
DevOps Software Development Engineer

DevOps Software Development Engineer

TekWissen ® • Markham, ON, Canada
Temporaire
Position : DevOps Software Development Engineer.Job Type : Temporary Assignment.TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent s...Voir plus
Dernière mise à jour : il y a 18 heures • Offre sponsorisée • Nouvelle offre
Distribution System Construction Manager

Distribution System Construction Manager

Valard Construction • Toronto, ON, Canada
Temps plein
Our Distribution team is currently seeking.Reporting to the Vice President, Distribution Operations, these positions plan and direct all crew activities, lead the site work force, coordinate daily ...Voir plus
Dernière mise à jour : il y a 20 jours • Offre sponsorisée