Talent.com
Principal Site Reliability Engineering Specialist (SRE)
Principal Site Reliability Engineering Specialist (SRE)CGI • Vancouver, Canada
Principal Site Reliability Engineering Specialist (SRE)

Principal Site Reliability Engineering Specialist (SRE)

CGI • Vancouver, Canada
Il y a 16 jours
Type de contrat
  • Temps plein
Description de poste

Position Description :

Location : Edmonton

Open to other locations within proximity to a CGI Office

Hybrid work model

We are hiring a Senior Site Reliability Engineer (SRE) with a strong foundation in building and operating reliable, scalable, and resilient cloud platforms. You bring a reliability and performance engineering mindset to everything you do—balancing operational stability with modernization and automation. In this role, you will apply core SRE practices—including SLIs / SLOs, observability, incident management, and operational automation—while temporarily supporting a regional support strategy engagement focused on assessing and strengthening large-scale operational environments. You will work closely with platform, operations, and architecture teams to evaluate current-state practices, identify reliability and support gaps, and contribute to the definition of future-state operating models and implementation roadmaps. Beyond this engagement, the role is designed for ongoing, hands-on SRE delivery, where you will lead and implement monitoring, reliability engineering, automation, and tooling across cloud and hybrid environments. You will collaborate with cross-functional teams to design, build, and continuously improve platform reliability, engineering standards, and operational excellence practices for mission-critical services. This position places you in a client-facing, high-impact environment, where your technical depth, operational judgment, and ability to translate reliability principles into practical outcomes will directly influence service stability, modernization efforts, and future cloud initiatives. If you are a proven SRE who thrives in complex environments and values both hands-on engineering and operational leadership, this role offers the opportunity to make a meaningful and lasting impact.

Your future duties and responsibilities :

Who are You?

You are a senior Site Reliability Engineer who thrives on solving complex reliability and operational challenges at scale. You are curious, collaborative, and continuously focused on improving how platforms, infrastructure, and services are operated and supported. Your strength lies in applying sound engineering judgment to real-world operational problems, balancing reliability, performance, and maintainability. You are equally comfortable working hands-on with tools and systems and stepping back to assess how operational practices, support models, and workflows impact service reliability. You can engage confidently in technical discussions with engineers while also communicating clearly with operational leaders and stakeholders to explain risks, trade-offs, and improvement opportunities.

With a mindset grounded in continuous improvement and learning, you champion modernization, automation, and pragmatic reliability practices. You are trusted for your ability to identify root causes rather than symptoms, to raise concerns early, and to translate reliability principles into practical, actionable outcomes. Your peers value your technical depth and calm leadership in complex environments, and teams rely on you to elevate operational maturity and execution quality. At CGI, we recognize strong SRE practitioners and provide the environment and support for them to grow, contribute, and make a meaningful impact across engagements.

Responsibilities

  • Develop, operate, and evolve monitoring, logging, and alerting capabilities across cloud and hybrid environments, while temporarily contributing SRE expertise to assess and rationalize existing operational monitoring practices as part of a regional support strategy initiative.
  • Define, implement, and continuously improve SLIs, SLOs, and SLAs for platform and service reliability, applying these principles during the engagement to evaluate current-state service outcomes and inform future-state reliability targets.
  • Lead and participate in incident response, problem investigation, and root cause analysis, leveraging hands-on SRE experience to identify systemic reliability issues and recurring operational failure patterns observed across regional support operations.
  • Design and automate reliability and operational processes, including integration with CI / CD pipelines and operational workflows, while contributing insights into where automation and tooling can reduce manual effort and improve support consistency across regions.
  • Collaborate closely with DevOps, platform engineering, architecture, and application teams, providing SRE leadership during this engagement and transitioning seamlessly to tool- and platform-heavy delivery roles on future projects.
  • Analyze and document current operational workflows, support models, and escalation paths, translating frontline operational insights into actionable reliability and service improvement recommendations.
  • Contribute to the definition of future-state operating models and implementation roadmaps by applying SRE and operational excellence principles to improve reliability, supportability, and scalability.
  • Provide regular status updates and risk assessments, highlighting operational risks, dependencies, and reliability impacts to support informed decision-making.

Required qualifications to be successful in this role :

  • 5+ years of experience in Site Reliability Engineering, platform engineering, or infrastructure operations, with demonstrated ability to apply reliability principles across both delivery and operational contexts.
  • Strong proficiency with observability and monitoring platforms such as Grafana, Prometheus, ELK, New Relic, or equivalent, with the ability to assess, design, and improve monitoring strategies in complex environments.
  • Hands-on experience operating cloud platforms (Azure, AWS, and / or GCP), including production support, reliability engineering, and operational troubleshooting.
  • Strong automation and scripting skills using tools such as Python, Bash, Ansible, or equivalent, with a mindset focused on reducing toil and improving operational efficiency.
  • Excellent communication skills in English (French considered an asset), with the ability to clearly articulate technical concepts to both technical and non-technical stakeholders.
  • Proven track record of improving system reliability, availability, and operational stability, including measurable reductions in incident frequency or impact.
  • Experience analyzing and documenting operational workflows, support models, and escalation paths within IT or platform operations environments.
  • Ability to facilitate technical and operational workshops with engineers, operations teams, and service stakeholders to validate findings and align on improvements.
  • Working knowledge of ITSM / ITIL practices (Incident, Problem, Change), particularly as they relate to reliability, supportability, and operational maturity.
  • Experience working in regulated, enterprise, or public-sector environments where documentation quality, security classification, and auditability are required.
  • CGI is providing a reasonable estimate of the pay range for this role. The determination of this range includes factors such as skill set level, geographic market, experience and training, and licenses and certifications. Compensation decisions depend on the facts and circumstances of each case. A reasonable estimate of the current range is $90,–$,. This role is a future opportunity.

    #LI-AB19

    Use of the term ‘engineering’ in this job posting refers to the technical sense related to Information Technology (IT) and does not imply that the individual practices engineering or possesses the requisite license as prescribed by the applicable provincial or territorial engineering regulator. We are seeking individuals with expertise in IT engineering-related functions, but licensure from an engineering regulator is not a prerequisite for this position. Engineering is a regulated profession in Canada which is restricted in terms of use of titles and designation.

    Skills :

  • Finance&Ops Apps Solution Arch
  • Créer une alerte emploi pour cette recherche

    Principal Site Reliability Engineering Specialist SRE • Vancouver, Canada

    Offres similaires
    Rope Access Technician (L3 IRATA / SPRAT Certified)

    Rope Access Technician (L3 IRATA / SPRAT Certified)

    Cleantech Service Group • Richmond, BC, Canada
    Temps plein +1
    Join Our Team as a Rope Access Technician (L3 IRATA / SPRAT Certified)!.Are you passionate about safety and excellence in high-rise building maintenance? Have you ever wondered why your current emplo...Voir plus
    Dernière mise à jour : il y a 13 heures • Offre sponsorisée • Nouvelle offre
    Third Engineer

    Third Engineer

    Bridgemans Services • Garibaldi Highlands, BC, Canada
    Temps plein
    On MV Isabelle X / Saga-Company Vessels alongside Squamish, BC.Bridgemans Crew Management Ltd.Business Address : 2512 Yukon St, Vancouver, BC V5Y 0H2. Rotational schedule for three years with the poss...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Relay • Vancouver, BC, Canada
    Temps plein
    Relay is a digital banking platform that gives self-made business owners the tools and know-how to be great with money—bringing clarity, confidence, and control to every dollar earned, so the...Voir plus
    Dernière mise à jour : il y a 8 jours • Offre sponsorisée
    Engineering Manager - High-Impact S3 Initiative

    Engineering Manager - High-Impact S3 Initiative

    Amazon Web Services (AWS) • Vancouver
    Temps plein
    A leading cloud services provider in Vancouver is seeking a Software Development Manager to lead engineering teams focused on improving internal communication protocols for one of their foundationa...Voir plus
    Dernière mise à jour : il y a 9 heures • Offre sponsorisée • Nouvelle offre
    Site Superintendent

    Site Superintendent

    TalentSphere • Vancouver, BC, Canada
    Temps plein
    Key Responsibilities as the Site Superintendent : .Work with the Project Manager, develop and maintain master project schedule with input from subcontractors. Assess commissioning and O&M requirem...Voir plus
    Dernière mise à jour : il y a 4 jours • Offre sponsorisée
    Senior Site Reliability Engineer - Distributed Systems & Platforms

    Senior Site Reliability Engineer - Distributed Systems & Platforms

    Apple • Vancouver
    Temps plein
    A leading tech company in Metro Vancouver is seeking Site Reliability Engineers to develop processes and tools for managing distributed systems. The role involves building scalable services and coll...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    Staff Site Reliability Engineer (Staff SRE)

    Staff Site Reliability Engineer (Staff SRE)

    Walt Disney Animation Studios • Vancouver
    Temps plein
    Staff Site Reliability Engineer (Staff SRE).Walt Disney Animation Studios’ world‑class filmmakers, artists, and technical collaborators create the magic of animation. Bring your unique talents, pass...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    SRE Specialist

    SRE Specialist

    Fortinet, Inc. • Burnaby
    Temps plein
    We are the SSP (Support Systems and Processes).Fortinet and passionate about building, improving, and maintaining various information systems that serve our employees worldwide, as well as consumer...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    Team Lead, Systems Engineering

    Team Lead, Systems Engineering

    OSI Maritime Systems Ltd. • Burnaby
    Temps plein
    Posted Thursday, October 16, 2025 at 10 : 00 a.At OSI Maritime Systems, we pride ourselves on delivering world-class navigation and bridge systems. With decades of experience serving military customer...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    Structural Field Review Specialist — On-Site, Employee-Owned

    Structural Field Review Specialist — On-Site, Employee-Owned

    Read Jones Christoffersen Ltd. • Vancouver
    Temps plein
    A prominent engineering firm is seeking a Construction Field Review Representative in Metro Vancouver.This full-time role involves reviewing on-site work for varied projects, engaging with clients ...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    Project Sustainability Lead

    Project Sustainability Lead

    Targeted Talent • Richmond, BC, Canada
    Temps plein
    A cutting-edge sustainability consultancy in Vancouver, guiding architects, developers, and public agencies toward high-performance, low-carbon buildings. We blend deep technical skill with real-wor...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Superintendent (Tenant Improvement & Capital Projects)

    Site Superintendent (Tenant Improvement & Capital Projects)

    Edge Construction • Vancouver, BC, Canada
    Temps plein
    Salary : $110,000 - $130,000 per year + Bonuses.At EDGE Construction, we build more than projects.We build trust, relationships, and environments that inspire. Our team is passionate about craftsmans...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer

    Site Reliability Engineer

    BNB Chain • Vancouver
    Temps plein
    Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building dApps that are no longer constrained by individual blockchain capabilities.With LayerZero's simple, ...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    TECH Specialist

    TECH Specialist

    London Drugs Limited • Squamish, BC, Canada
    Temps plein
    Now hiring for TECH Specialist.Are you passionate about learning? Do your friends and family members always ask you for tech advice? Are you up-to-date with the latest Computer, Audio / Video and Pho...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Reliability and Integrity Engineer

    Reliability and Integrity Engineer

    Pacific Energy Canada • Squamish, BC, Canada
    Temps plein
    Project is located approximately 7 km west-southwest of Squamish, British Columbia.It involves the construction and operation of a liquefied natural gas (LNG) export facility on the previous Woodfi...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer

    Site Reliability Engineer

    ScalePad • Vancouver
    Temps plein
    ScalePad is a market‑leading SaaS company headquartered in Vancouver, Toronto, Montreal and Phoenix, AZ.With a global employee reach, we serve over 12,000 MSPs worldwide, helping them increase clie...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    Senior Application Reliability Engineer (SRE)

    Senior Application Reliability Engineer (SRE)

    Global Relay • Vancouver
    Temps plein
    A technology company specializing in data communications, located in Vancouver, is seeking an Application Support Engineer. This role focuses on ensuring the reliability and availability of services...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    LNG Reliability & Integrity Engineer – Onsite

    LNG Reliability & Integrity Engineer – Onsite

    Woodfibre LNG • Squamish
    Temps plein
    A leading energy firm is seeking a Reliability & Integrity Engineer to support operations at the LNG plant in Squamish, BC. This role involves ensuring technical integrity, managing risks, and leadi...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée