Sr. DevOps Engineer (Disaster Recovery)

Healthcare of Ontario Pension PlanToronto, Ontario, Canada

Il y a plus de 30 jours

Salaire

150 000,00 $CA – 160 000,00 $CA par an

Type de contrat

Temps plein

Description de poste

Why you’ll love working here :

high-performance, people-focused culture

our commitment that equity, diversity, and inclusion are fundamental to our work environment and business success, which helps employees feel valued and empowered to be their authentic selves

membership in HOOPP’s world class defined benefit pension plan, which can serve as an important part of your retirement security

competitive, 100% company-paid extended health and dental benefits for permanent employees, including coverage supporting our team's diversity and mental health gender affirmation, fertility and drug treatment, psychological support benefits of $2,500 per year, and newly extended maternity / parental leave top of 26 weeks)

optional post-retirement health and dental benefits subsidized at 50%

yoga classes, meditation workshops, nutritional consultations, and wellness seminars

access to an annual wellness reimbursement program for health and wellness-related expenses for permanent and temporary employees

the opportunity to make a difference and help take care of those who care for us, by providing a financially secure retirement for Ontario healthcare workers

Job Summary

HOOPP’s IT division pushes beyond corporate technological norms to bring our members, our Fund and our enterprise the most innovative, efficient, and secure solutions. We support one of the best pension funds in the world, and we need people who are passionate and up for a challenge.

Our IT Corporate Solutions Group (CSG) is looking for an experienced individual who can fill a permanent, fulltime Sr. DevOps Engineer, Disaster Recovery role. This highly collaborative role is responsible for Backups and Disaster Recovery, supporting CSG’s app dev teams in delivering Business Continuity and Resiliency to CSG’s business stakeholders.

As a senior technical resource within CSG’s Agile Corporate team (IT4C) responsibilities include backups; Disaster Recovery, Cyber Recovery; support for teams to ensure they implement and adhere to HOOPP’s disaster recovery policies; as well as playing a senior role in other IT4C DevOps activities beyond backup and recovery.

Why you will love working with CSG :

We are a valued part of the IT division that is foundational to HOOPP’s daily operations.
We ensure the organization leverages the most effective cloud-based technology and software to allow cross-team collaboration while working in the office or remotely.
We are responsible for supporting HOOPP’s divisions with strategic, configurable, and innovative solutions to solve business issues and enhance operational efficiencies.
We are a hybrid work force, with our teams working remotely and onsite. We offer a hybrid flexible work model that embraces remote work in Ontario for eligible roles.
Our efficiency, focus and hard work is complemented with a social and enthusiastic attitude.
We are a diverse group of people and that is one of our greatest strengths. We are animal lovers, golfers, mountain bikers, hikers, amateur chefs and foodies, parents and game board enthusiasts. We are collegial and enjoy each other’s company. We push each other to be our best selves and we can’t wait to expand our work family.

What you will do

Actively participate in Agile Scrum practices such as daily standups, backlog refinement, planning and sprint retrospectives.

Create a safe, supportive and participatory environment that produces ongoing mutual respect

Play an active role in delivering backup and DR solutions for new and existing features supporting and collaborating with our app dev, platform and infrastructure teams.

Monitor operations of daily backups

Assess, adapt and evolve operational strategies for DR, BC and application resilience.

Partner with DR / BC leads in the IT4Enterprise and Governance & Risk teams.

Educate peers & stakeholders who will use these solutions and mitigate potential issues.

Monitor and support backup infrastructure, applications, services and network.

Write wiki articles and participate in issue and team retrospectives. Ask powerful questions, create awareness and guide individuals and groups in exploring options and deciding what to do.

Learn our technical infrastructure through operational support activities.

Develop contingency plans in case of infrastructure failure or service interruption.

Evaluate, recommend, and apply new concepts and technologies.

Continuously improve corporate DevOps practices and drive automation.

Formulate and update solution standards and policies

Work with internal teams and vendors

Provide and participate in team training

Participate in the off-hours on-call rotation. Our off-hours support volume is very light, and we put in the effort to keep it that way.

Work with existing Terraform code and should be able to write and maintain infrastructure as code.

Use Azure DevOps for CI / CD pipelines, automate, configure and deploy resources on Azure environments.

Who you are :

Bachelor’s degree or equivalent diploma program in Business, Computer Science, Information Management or related.

7+ years’ experience in Operations in progressively more senior roles

5+years’ experience in setting up general application backup and recovery and operations

Knowledge of Disaster Recover technologies such as Commvault, MS 365 backups, Azure Backup, Iron Mountain (or comparable technologies).

Technical writing and diagramming skills

Knowledge of service and hosting solutions in a private / public cloud using IaaS, PaaS and SaaS platforms and their integrations.

Knowledge of Enterprise and Cloud Technology, specifically Microsoft Azure backup services.

Knowledge of the various techniques for meeting Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO) in the cloud, based on application criticality.

Knowledge of configuration management and automation tools such as Ansible, Terraform or Puppet provisioning services for provisioning and recovery of application / cloud workloads.

General understanding of cloud networking principles, vulnerability management controls, and identity services (Entra ID preferred)

Strong interpersonal and communication skills with an ability to take end-to-end ownership.

innovative, motivated and a quick thinker

driven to be a trusted and valued contributor / partner

collaborative and a strong team player, adept at building relationships

able to thrive under pressure and pivot easily to adapt to change, based on business needs

passionate about “moving the needle,” being a change agent and an influencer of growth

satisfied in seeing your team’s ideas and dreams become reality to support the business’s objectives and mission of delivering the pension promise to our members.

Experience with distributed storage technologies and backup solutions. Knowledge of disaster recovery planning and processes.

Enterprise Application Recovery : Experience with enterprise application recovery, understanding of high availability architecture, failover and redundancy practices.

Hold relevant industry certifications (including, but not limited to) Azure / AWS CertifiedDRII CBCPBCI CBCI

Knowledge Database consistency & duplication. Cloud based SQL Azure or AWS RDS SQL would be considered an asset

Experience with databases and SQL (MS SQL, Cosmos DB and related technologies) would be considered an asset