Site Reliability Engineer

TEEMA

Vanouver, BC, CA

$100K-$175K a year (estimated)

Full-time

We are sorry. The job offer you are looking for is no longer available.

MUST LIVE IN CANADA NEAR AN AIRPORT

Monitoring and logging services are a must 2 or 3 of them that are listed and Orchestration

Close to a city with the ability to traveling up to 4 X a year to Vancouver.

1st - technical interview

Team Size - 2 team members already onboarded plus manager

Work is very meaningful - province wide for public safety - big project roll out.

Pensioned position - municipality pension plan is better. Stable and room to grow.

MUST HAVE - 5+ years permanent residence or Citizenship (cant have lived out of Canada for the last 5 years)

Our client is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join their dynamic and innovative team.

As an SRE, you will play a critical role in maintaining and enhancing the reliability, availability, and performance of their systems.

Your expertise in both software engineering and systems administration will be key in building and automating scalable infrastructure solutions.

In this role, you will be responsible for improving the reliability and performance of production applications and infrastructure with a focus of automation, system design and improvements to system resilience.

We are seeking a technical expert who understands the criticality of our systems and who is able to manage risk and support the improvement of more resilient and reliable technological capabilities.

What you will be doing :

Collaborate with cross-functional teams to design, deploy, and maintain reliable and scalable services

Implement best practices for monitoring, logging, and alerting to ensure rapid detection and resolution of issues

Troubleshoot and resolve incidents related to the infrastructure, applications, and network to minimize downtime and improve system reliability

Participate in capacity planning and performance optimization efforts to handle increasing user demands and traffic growth

Develop and maintain automation tools for configuration management, deployment, and continuous integration / continuous deployment (CI / CD) pipelines

Conduct thorough post-incident reviews and work towards preventing similar incidents in the future

Perform regular security assessments and ensure compliance with industry standards and regulations

Stay up-to-date with the latest technologies and industry trends to propose innovative solutions and improvements

What you must have :

Completion of a degree or diploma program in computer science or a related discipline plus 5 years of related experience, or an equivalent combination of training and experience

ITIL Foundation v3 or later accreditation preferred

Sound experience (5+ years) of running services in a large scale enterprise environment

Experience in one of the leading cloud platforms such as AWS, Azure or Google Cloud

Experience with distributed monitoring and logging solutions (such as Prometheus, Thanos, Splunk, Elasticsearch, Grafana, Dynatrace, New Relic, Honeycomb)

Experience with containers and container orchestration (such as docker, podman, kubernetes)

Experience with DevOps platform (such Gitlab, Github, Azure DevOps, Teamcity, Octopus)

Knowledge of application performance monitoring (such as Dynatrace, New Relic, Appdynamics)

Knowledge of Scaling, Capacity Planning and Disaster Recovery

Knowledge of Chaos Engineering

Ability to design, author, and release code in any language (Go, Python, Ruby or Java would be a plus)

1 day ago

Related jobs

Site Reliability Engineer

TEEMA

Vancouver, British Columbia

Full-time

We are looking for a Site Reliability Engineer in. As a Site Reliability Engineer for TEEMA you will be in charge of..

Promoted

Site Reliability Engineer

Altis Technology

Greater Vancouver Metropolitan Area, Canada

Full-time

From small businesses to enterprises. We are looking for experienced Site Reliability Engineers in.. As a critical member of our Engineering team, the ideal candidate will combine engineering experience..

Promoted

Site Reliability Engineer

Jotform

Vancouver, British Columbia

Full-time

From small businesses to enterprises. We are looking for experienced DevOps Engineers in Vancouver for.. ABOUT THE ROLE Jotform is looking for DevOps Engineers who have an interest in system administration and..

Promoted

Site Reliability Engineer (4443)

New Value Solutions

Richmond, British Columbia

Full-time

Who Can Apply Candidates must be legally authorized to work in Canada Job Description Insight Global is looking for a Site Reliability Engineer to join one of the largest healthcare technology..

Site Reliability Engineer

Insight Global

Richmond, British Columbia

Part-time

Required Skills and Experience. A minimum of 8 years of experience. Hold a Professional Engineer or P.. This is a permanent, full time role that requires on site work. The successful candidate must have a..

Site Reliability Engineer

Stafflink

Vancouver, British Columbia

Full-time

Job Description On behalf of our public sector client, we are looking for a Senior Cloud Systems Engineer for a one year contract, with possible extension. Principally remote, with a few days..

Staff Engineer Site Reliability

Albertsons Companies, Inc.

Burnaby, British Columbia

Full-time

Requisition Number. 183499 Position Title. Software Engineer III External Description. EA's Digital.. Description The CPE team is looking for a talented Site Reliability Engineer to join our team. We are..

Site Reliability Engineer III

Electronic Arts Inc

Burnaby, British Columbia

Full-time

Requisition Number. 182915 Position Title. Software Engineer II External Description. At our core.. In your role as a Software Engineer, you will work with a team to build APIs and web applications to..