Talent.com

Reliability Jobs in Vancouver, BC

Create a job alert for this search

Reliability • vancouver bc

Last updated: 4 days ago
Principal Site Reliability Engineering Specialist (SRE)

Principal Site Reliability Engineering Specialist (SRE)

CGIVancouver, Canada
Full-time
Open to other locations within proximity to a CGI Office.We are hiring a Senior Site Reliability Engineer (SRE) with a strong foundation in building and operating reliable, scalable, and resilient ...Show moreLast updated: 16 days ago
  • Promoted
Jacob looking for a babysitter or nanny in Vancouver

Jacob looking for a babysitter or nanny in Vancouver

SitlyVancouver, CA
Part-time
Were excited to join this platform and are looking for a caring, responsible, and engaging babysitter who enjoys working with kids. Our family values kindness, good communication, and reliability, a...Show moreLast updated: 15 days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

RelayVancouver, BC, Canada
Full-time
Relay is a digital banking platform that gives self-made business owners the tools and know-how to be great with money—bringing clarity, confidence, and control to every dollar earned, so the...Show moreLast updated: 8 days ago
Mid-Senior Petrochemical Professionals

Mid-Senior Petrochemical Professionals

Hire Resolve.comVancouver, BC, Canada
Full-time
Hire Resolve is assisting petrochemical organizations in hiring experienced petrochemical professionals.This is a multi-role opportunity spanning several functions within the sector, including plan...Show moreLast updated: 30+ days ago
Data Engineer In Test

Data Engineer In Test

Two CirclesVancouver, British Columbia, Canada
Full-time
We are a Sports & Entertainment Marketing business.We grow audiences and revenues.We do that by knowing fans best.We work with clients to help them understand & influence what their fans ar...Show moreLast updated: 16 days ago
Staff Site Reliability Engineer (Staff SRE)

Staff Site Reliability Engineer (Staff SRE)

Walt Disney Animation StudiosVancouver, Canada
Full-time
Walt Disney Animation Studios’ world-class filmmakers, artists, and technical collaborators create the magic of animation. Bring your unique talents, passion and ideas to our team and prepare to pla...Show moreLast updated: 30+ days ago
Product Reliability Engineer

Product Reliability Engineer

Motorola SolutionsVancouver, British Columbia, Canada
Full-time
At Motorola Solutions we believe that everything starts with our people.Were a global close-knit community united by the relentless pursuit to help keep people safer everywhere.Our critical communi...Show moreLast updated: 29 days ago
Site Reliability Engineer

Site Reliability Engineer

66degreesVancouver, British Columbia, Canada
Full-time
AI transformation partner that guides enterprises from complex business challenges to clear quantifiable outcomes.Our company is the culmination of several successful firms each a leader in its own...Show moreLast updated: 9 days ago
Site Reliability Engineer

Site Reliability Engineer

BoeingCanada,Richmond,CAN
Full-time
Defence & Government Services team.This position will focus on supporting the Boeing Global Services (BGS) business organization. This new SRE role will bridge the gap between traditional software e...Show moreLast updated: 4 days ago
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Royal Bank of Canada>VANCOUVER, Canada
Full-time
City National Bank (CNB), an RBC company, is seeking a Lead Site Reliability Engineer, who will be responsible for supporting CNB digital and corporate applications along with the implementation of...Show moreLast updated: 8 days ago
Senior Site Reliability Engineer (SRE) – CloudVision as a Service (CVaaS)

Senior Site Reliability Engineer (SRE) – CloudVision as a Service (CVaaS)

Arista NetworksVancouver, BC, CA
Full-time
We’re looking for Site Reliability Engineers to join our growing Arista’s CloudVision-as-a-Service (CVaaS) global SRE team. SREs at Arista combine strong software engineering background, systems arc...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Targeted TalentBurnaby, BC, Canada
Permanent
We are looking for an experienced.Senior Site Reliability Engineer.Our client is a global enterprise company with a product that you've likely used. Experience with coding / software development, ...Show moreLast updated: 30+ days ago
  • Promoted
Principal Software Architect

Principal Software Architect

EviSmartVancouver, Metro Vancouver Regional District, Canada
Full-time
EviSmart™ is a global leader in AI-powered dental workflow automation and CAD design outsourcing.Trusted in 26+ countries, our mission is to make dental care smarter, faster, and better—powered by ...Show moreLast updated: 30+ days ago
General Supervisor, Equipment Reliability

General Supervisor, Equipment Reliability

SeaspanNorth Vancouver, BC, Canada
Full-time +1
Reporting to the Manager, Operations Facilities, the General Supervisor, Equipment Reliability directly oversees the consistent performance, maintenance, and optimization of welding, cutting, and l...Show moreLast updated: 30+ days ago
Cloud Operation Engineer

Cloud Operation Engineer

E-SolutionsRichmond, BC
Full-time
Role : Senior Site Reliability Engineer.Location : Richmond, BC, VX G, Canada (Hybrid).Senior Site Reliability Engineering Job Requirements. The Sr Cloud Engineer / Sr Site Reliability Engineer is a m...Show moreLast updated: 30+ days ago
Reliability and Integrity Engineer

Reliability and Integrity Engineer

Pacific Energy CanadaVancouver, BC, CA
Full-time
Quick Apply
Project is located approximately 7 km west-southwest of Squamish, British Columbia.It involves the construction and operation of a liquefied natural gas (LNG) export facility on the previous Woodfi...Show moreLast updated: 30+ days ago
AI Implementation Engineer (Azure AI, Agents & Automation)

AI Implementation Engineer (Azure AI, Agents & Automation)

Tribe Property TechnologiesVancouver, BC, Canada
Full-time +1
AI Implementation Engineer (Azure AI, Agents & Automation).Location : Vancouver, BC (Hybrid).Tribe Property Technologies is modernizing one of Canada’s most traditional industries — ...Show moreLast updated: 23 days ago
Intermediate / Senior DevOps, Site Reliability (Linux)

Intermediate / Senior DevOps, Site Reliability (Linux)

Global RelayVancouver, BC, Canada
Full-time
For over 25 years, Global Relay has set the standard in enterprise information archiving with industry-leading cloud archiving, surveillance, eDiscovery, and analytics solutions.We securely capture...Show moreLast updated: 30+ days ago
Principal Site Reliability Engineering Specialist (SRE)

Principal Site Reliability Engineering Specialist (SRE)

CGIVancouver, Canada
16 days ago
Job type
  • Full-time
Job description

Position Description :

Location : Edmonton

Open to other locations within proximity to a CGI Office

Hybrid work model

We are hiring a Senior Site Reliability Engineer (SRE) with a strong foundation in building and operating reliable, scalable, and resilient cloud platforms. You bring a reliability and performance engineering mindset to everything you do—balancing operational stability with modernization and automation. In this role, you will apply core SRE practices—including SLIs / SLOs, observability, incident management, and operational automation—while temporarily supporting a regional support strategy engagement focused on assessing and strengthening large-scale operational environments. You will work closely with platform, operations, and architecture teams to evaluate current-state practices, identify reliability and support gaps, and contribute to the definition of future-state operating models and implementation roadmaps. Beyond this engagement, the role is designed for ongoing, hands-on SRE delivery, where you will lead and implement monitoring, reliability engineering, automation, and tooling across cloud and hybrid environments. You will collaborate with cross-functional teams to design, build, and continuously improve platform reliability, engineering standards, and operational excellence practices for mission-critical services. This position places you in a client-facing, high-impact environment, where your technical depth, operational judgment, and ability to translate reliability principles into practical outcomes will directly influence service stability, modernization efforts, and future cloud initiatives. If you are a proven SRE who thrives in complex environments and values both hands-on engineering and operational leadership, this role offers the opportunity to make a meaningful and lasting impact.

Your future duties and responsibilities :

Who are You?

You are a senior Site Reliability Engineer who thrives on solving complex reliability and operational challenges at scale. You are curious, collaborative, and continuously focused on improving how platforms, infrastructure, and services are operated and supported. Your strength lies in applying sound engineering judgment to real-world operational problems, balancing reliability, performance, and maintainability. You are equally comfortable working hands-on with tools and systems and stepping back to assess how operational practices, support models, and workflows impact service reliability. You can engage confidently in technical discussions with engineers while also communicating clearly with operational leaders and stakeholders to explain risks, trade-offs, and improvement opportunities.

With a mindset grounded in continuous improvement and learning, you champion modernization, automation, and pragmatic reliability practices. You are trusted for your ability to identify root causes rather than symptoms, to raise concerns early, and to translate reliability principles into practical, actionable outcomes. Your peers value your technical depth and calm leadership in complex environments, and teams rely on you to elevate operational maturity and execution quality. At CGI, we recognize strong SRE practitioners and provide the environment and support for them to grow, contribute, and make a meaningful impact across engagements.

Responsibilities

  • Develop, operate, and evolve monitoring, logging, and alerting capabilities across cloud and hybrid environments, while temporarily contributing SRE expertise to assess and rationalize existing operational monitoring practices as part of a regional support strategy initiative.
  • Define, implement, and continuously improve SLIs, SLOs, and SLAs for platform and service reliability, applying these principles during the engagement to evaluate current-state service outcomes and inform future-state reliability targets.
  • Lead and participate in incident response, problem investigation, and root cause analysis, leveraging hands-on SRE experience to identify systemic reliability issues and recurring operational failure patterns observed across regional support operations.
  • Design and automate reliability and operational processes, including integration with CI / CD pipelines and operational workflows, while contributing insights into where automation and tooling can reduce manual effort and improve support consistency across regions.
  • Collaborate closely with DevOps, platform engineering, architecture, and application teams, providing SRE leadership during this engagement and transitioning seamlessly to tool- and platform-heavy delivery roles on future projects.
  • Analyze and document current operational workflows, support models, and escalation paths, translating frontline operational insights into actionable reliability and service improvement recommendations.
  • Contribute to the definition of future-state operating models and implementation roadmaps by applying SRE and operational excellence principles to improve reliability, supportability, and scalability.
  • Provide regular status updates and risk assessments, highlighting operational risks, dependencies, and reliability impacts to support informed decision-making.

Required qualifications to be successful in this role :

  • 5+ years of experience in Site Reliability Engineering, platform engineering, or infrastructure operations, with demonstrated ability to apply reliability principles across both delivery and operational contexts.
  • Strong proficiency with observability and monitoring platforms such as Grafana, Prometheus, ELK, New Relic, or equivalent, with the ability to assess, design, and improve monitoring strategies in complex environments.
  • Hands-on experience operating cloud platforms (Azure, AWS, and / or GCP), including production support, reliability engineering, and operational troubleshooting.
  • Strong automation and scripting skills using tools such as Python, Bash, Ansible, or equivalent, with a mindset focused on reducing toil and improving operational efficiency.
  • Excellent communication skills in English (French considered an asset), with the ability to clearly articulate technical concepts to both technical and non-technical stakeholders.
  • Proven track record of improving system reliability, availability, and operational stability, including measurable reductions in incident frequency or impact.
  • Experience analyzing and documenting operational workflows, support models, and escalation paths within IT or platform operations environments.
  • Ability to facilitate technical and operational workshops with engineers, operations teams, and service stakeholders to validate findings and align on improvements.
  • Working knowledge of ITSM / ITIL practices (Incident, Problem, Change), particularly as they relate to reliability, supportability, and operational maturity.
  • Experience working in regulated, enterprise, or public-sector environments where documentation quality, security classification, and auditability are required.
  • CGI is providing a reasonable estimate of the pay range for this role. The determination of this range includes factors such as skill set level, geographic market, experience and training, and licenses and certifications. Compensation decisions depend on the facts and circumstances of each case. A reasonable estimate of the current range is $90,–$,. This role is a future opportunity.

    #LI-AB19

    Use of the term ‘engineering’ in this job posting refers to the technical sense related to Information Technology (IT) and does not imply that the individual practices engineering or possesses the requisite license as prescribed by the applicable provincial or territorial engineering regulator. We are seeking individuals with expertise in IT engineering-related functions, but licensure from an engineering regulator is not a prerequisite for this position. Engineering is a regulated profession in Canada which is restricted in terms of use of titles and designation.

    Skills :

  • Finance&Ops Apps Solution Arch