Talent.com
Senior Site Reliability Engineer
Senior Site Reliability Engineer0000050007 Royal Bank of Canada • TORONTO, Ontario, Canada
Senior Site Reliability Engineer

Senior Site Reliability Engineer

0000050007 Royal Bank of Canada • TORONTO, Ontario, Canada
30+ days ago
Job type
  • Full-time
Job description

Job Description

What is the opportunity?This is an exciting opportunity to join a high-impact team responsible for ensuring the reliability, scalability, and performance of critical ATM production systems. As a Senior Service Reliability Engineer, you will play a pivotal role in shaping the future of our ATM services by driving innovation, implementing cutting-edge technologies, and ensuring seamless operations in a fast-paced, mission-critical environment. You will have the chance to work on complex challenges, such as optimizing system performance, automating processes, and enhancing system resilience, all while collaborating with a team of talented engineers who are passionate about delivering exceptional customer experiences. This role offers the opportunity to make a tangible impact on millions of users who rely on our ATM services daily.What will you do?As a Senior Service Reliability Engineer in the ATM team, you will be responsible for ensuring the reliability, performance, and scalability of our production environment. Your day-to-day responsibilities will include:
  • Analyze operational pain points and define automation requirements for development teams.
  • Deploy, maintain, and optimize PowerShell-based automation and self-healing tools.
  • Validate and test new automation solutions in production, collaborating with QE for robustness.
  • Monitor tool effectiveness and provide data-driven feedback to development teams.
  • Own SCCM deployment and configuration management for ~4,000 ATMs, ensuring reliability and scalability.
  • Drive integration between ATM platforms and bank infrastructure, partnering with development teams.
  • Enforce security protocols and regulatory compliance (e.g., banking standards) across systems.
  • Lead advanced troubleshooting using PowerShell tools for critical production issues.
  • Participate in 24/7 on-call rotation, prioritizing rapid incident resolution to minimize downtime.
  • Collaborate with NCR engineering teams on complex technical escalations.
  • Facilitate daily standups with development, QE, and leadership to align technical priorities.
  • Coordinate maintenance windows and major deployments with cross-functional stakeholders.
  • Maintain up-to-date technical documentation and operational runbooks for operations teams.
What do you need to succeed?Must have:
  • Bachelor’s degree in computer science, Information Technology, Engineering, or related technical field
  • Minimum 5-7 years of experience in ATM technology operations or enterprise system administration
  • Strong knowledge of PowerShell scripting concepts and automation frameworks for requirements analysis
  • Expert-level experience with Microsoft SCCM administration, deployment, and troubleshooting
  • Deep technical knowledge of NCR ATM hardware and software platforms
  • Experience with Windows Server administration and enterprise system management
  • Knowledge of network protocols (TCP/IP, VPN) and network troubleshooting techniques
  • Understanding of SQL Server databases and basic querying capabilities
  • Experience with monitoring tools (like: Nagios, SCOM, SolarWinds) and alerting systems
  • Strong analytical and problem-solving skills for complex technical issues
  • Excellent communication skills for cross-functional team collaboration
  • Experience with change management processes and deployment coordination
  • Certified Kubernetes Administrator (CKA): Demonstrates knowledge and skills in deploying, managing, and maintaining Kubernetes clusters
Nice-to-have: While not required, the following skills and experiences would give candidates an edge and help them ramp up faster in the role of Senior Service Reliability Engineer in the ATM team:Previous experience with RBC technology infrastructure or Canadian banking systems
  • Experience with PowerShell Desired State Configuration (DSC) and advanced automation
  • Knowledge of machine learning concepts for predictive maintenance and monitoring
  • Understanding of artificial intelligence applications in operations management
  • Experience with geographic information systems (GIS) for ATM location management
  • Experience with vendor management and technical relationship coordination
  • Puppet Certified Professional: Demonstrates expertise in using Puppet for automation, including manifest files, modules, and classes.
  • HashiCorp Certified: Terraform Associate: Shows knowledge and skills in using Terraform for infrastructure as code, including configuration files, modules, and state management.
  • Red Hat Certified Engineer (RHCE): Validates expertise in using Red Hat Enterprise Linux, including system administration, networking, and security.
  • ITIL (Information Technology Infrastructure Library) Foundation Certificate: Demonstrates understanding of IT service management best practices, including incident management, problem management, and change management.
  • Familiarity with container networking: Understanding of container networking concepts, including Docker networking, Kubernetes networking, or Calico.
  • Experience with logging and monitoring: Knowledge of logging and monitoring tools, including ELK Stack, Splunk, or New Relic.
  • Familiarity with continuous integration and delivery: Understanding of continuous integration and delivery concepts, including Jenkins, GitLab CI/CD, or CircleCI.
  • Knowledge of cybersecurity: Familiarity with cybersecurity principles, including threat modeling, vulnerability assessment, and penetration testing.
  • Experience with IoT: Knowledge of Internet of Things (IoT) concepts, including device management, data processing, and analytics.
  • Experience with data science: Knowledge of data science concepts, including data wrangling, visualization, and statistical analysis.
  • Familiarity with DevSecOps: Understanding of DevSecOps principles, including security integration, compliance, and risk management.
  • Familiarity with disaster recovery: Understanding of disaster recovery concepts, including backup and restore, failover, and disaster recovery planning.
What's in it for you? We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
  • A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
  • Leaders who support your development through coaching and managing opportunities.
  • Ability to make a difference and lasting impact
  • Work in a dynamic, collaborative, progressive, and high-performing team
  • A world-class training program in financial services
  • Flexible work/life balance options.
  • Opportunities to do challenging work.
  • Opportunities to take on progressively greater accountabilities.
  • Opportunities to building close relationships with clients.
#LI-POST#TECHPJJob SkillsAgile Methodology, Application Infrastructure, Group Problem Solving, IT Automation, IT Monitoring, Operations Support, Production Support, Software Development Life Cycle (SDLC), Software Engineering, Software Product Technical Knowledge, System Applications, Systems SoftwareAdditional Job Details

Address:

RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO

City:

Toronto

Country:

Canada

Work hours/week:

37.5

Employment Type:

Full time

Platform:

TECHNOLOGY AND OPERATIONS

Job Type:

Regular

Pay Type:

Salaried

Posted Date:

2025-12-01

Application Deadline:

2026-04-03Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date aboveOur Employment OpportunitiesAt RBC, we are guided by living shared values of Client First, Integrity, Collaboration, Respect and Excellence and winning together as One RBC. We believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all.

Join our Talent Community

Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.

Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com.

RBC is presently inviting candidates to apply for this existing vacancy. Applying to this posting allows you to express your interest in this current career opportunity at RBC. Qualified applicants may be contacted to review their resume in more detail.
Create a job alert for this search

Senior Site Reliability Engineer • TORONTO, Ontario, Canada

Similar jobs

Site Reliability Engineer

TykToronto, ON, CA
Full-time

The Tyk API Management platform is helping to drive the connected world and power new products and services.We're changing the way that organisations connect any number of their systems and service...Show more

 • Promoted

Senior Site Reliability Engineer

RBCToronto, ON, CA
Full-time

This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.As t...Show more

 • Promoted

Senior Site Reliability Engineer for Innovative Problem Management

RBCToronto, ON, CA
Full-time

Become a key player as a Senior Site Reliability Engineer, focusing on SRE solutions and operational excellence.Collaborate with IT partners to enhance incident management and technical insights.In...Show more

 • Promoted

Lead Site Reliability Engineer

Movable InkToronto, ON, CA
Full-time

Movable Ink scales content personalization for marketers through data-activated content generation and AI decisioning.The world’s most innovative brands rely on Movable Ink to maximize revenue, sim...Show more

 • Promoted

Senior Site Reliability Engineer

ThinkificToronto, ON, CA
Full-time

Senior Site Reliability Engineer.Senior Site Reliability Engineer.Are you an experienced Site Reliability Engineer looking for a new challenge?.Senior Site Reliability Engineer.Senior Site Reliabil...Show more

 • Promoted

Lead Reliability Enhancements as a Site Reliability Engineer

ScotiabankToronto, ON, CA
Full-time

Become the backbone of digital services as a Site Reliability Engineer.Elevate application reliability and spearhead operational improvements while enhancing customer engagement.This role is pivota...Show more

 • Promoted • New!

Site Reliability Engineer

KyndrylToronto, ON, CA
Full-time +1

Join to apply for the Site Reliability Engineer role at Kyndryl.Direct message the job poster from Kyndryl.Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Se...Show more

 • Promoted

Site Reliability Engineer

CapgeminiToronto
Full-time

Talent Acquisition Business Partner – Strategic Business Unit at Capgemini America Inc.Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d ...Show more

 • Promoted

Senior Site Reliability Engineer

SimCorpToronto, ON, CA
Full-time

Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: Torontotime type: Full timeposted on: Posted Todayjob requisition id: R-211168Job Advertisement**WHAT MA...Show more

 • Promoted

Impactful Site Reliability Engineer Fostering Reliability and Performance

RootlyToronto, ON, CA
Full-time

Join as an impactful Site Reliability Engineer, shaping the technical future and enhancing system reliability.Tackle rewarding challenges in a collaborative startup atmosphere.As a key player, you’...Show more

 • Promoted

Senior Site Reliability Engineer, Observability

Framework VenturesToronto, ON, CA
Full-time

Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi).The Chainlink stack provides essential data, intero...Show more

 • Promoted

Sr. Site Reliability Engineer I

Axon EnterpriseToronto, ON, CA
Full-time

At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud software.Like our products, we work b...Show more

 • Promoted

Site Reliability Engineer, Observability

PricelineToronto, ON, CA
Full-time

This role is eligible for our hybrid work model: Two days in-office.Site Reliability Engineer, Observability.Our Technology team is the backbone of our company: constantly creating, testing, learni...Show more

 • Promoted

Senior Site Reliability Engineer

VantageToronto, ON, CA
Full-time

Do you enjoy keeping systems reliable, performant, and scalable while continuing to grow your technical depth? As a Senior Site Reliability Engineer (SRE) / DevOps Engineer at Vantage, you’ll contr...Show more

 • Promoted

Site Reliability Engineer (Dynatrace & Observability)

Astra North Infoteck Inc.Toronto, ON, CA
Full-time

A technology solutions company in Toronto is seeking a skilled Site Reliability Engineer to enhance their monitoring and observability practices.The ideal candidate will have extensive experience w...Show more

 • Promoted

Senior Site Reliability Engineer II - Remote, Scale-Focused

InstacartToronto, ON, CA
Remote
Full-time

A leading grocery delivery service is seeking a Senior Site Reliability Engineer II in Calgary, Alberta.You will ensure optimal performance and reliability of the platform while establishing incide...Show more

 • Promoted

Senior Site Reliability Engineer Focused on Kubernetes Infrastructure

Chainlink LabsToronto, ON, CA
Full-time

Elevate decentralized architecture as a Senior Site Reliability Engineer.Spearhead Kubernetes-based infrastructure for decentralized applications, driving scalability, security, and operational eff...Show more

 • Promoted

Site Reliability Engineer

DeltatreToronto, ON, CA
Permanent

The Site Reliability Engineer (SRE) is responsible for improving the reliability, stability, and operational readiness of critical digital platforms.The role focuses on proactively reducing risk, s...Show more