Talent.com
Azure Kuberbetes & Site Reliability Engineer (SRE)
Azure Kuberbetes & Site Reliability Engineer (SRE)KLANIK • Montréal, Canada
Azure Kuberbetes & Site Reliability Engineer (SRE)

Azure Kuberbetes & Site Reliability Engineer (SRE)

KLANIK • Montréal, Canada
30+ days ago
Job type
  • Full-time
Job description

KLANIK est une société de conseil en Ingénierie IT qui accompagne ses clients dans leurs projets digitaux et technologiques.

Le groupe KLANIK compte désormais plus de 750 talents, évoluant dans 16 agences en Europe, Amérique du Nord, Afrique et Moyen-Orient. Des experts engagés, atypiques et passionnés, impliqués dans des projets stratégiques grâce à leur haut niveau de compétences en Software, DevOps, Cloud, Agilité, Cybersécurité, Big Data & IA.

En parallèle de leurs métiers, les collaborateurs du groupe KLANIK sont accompagnés au quotidien dans leur développement personnel et professionnel, via différentes initiatives engageantes et innovantes :

KONSCIOUS : communauté interne engagée dans les enjeux écologiques, sociaux et environnementaux

KAMPUS : institut de formation technique certifié

KORNER : incubateur de start-ups technologiques

KLANIK ESPORT : club professionnel e-sport ouvert aux collaborateurs

Description du poste :

Titre du poste : Ingénieur en Fiabilité de Site (SRE) - Kubernetes sur Azure

L'Ingénieur en Fiabilité de Site (SRE) spécialisé en Kubernetes sur Azure sera responsable de garantir la fiabilité, l'évolutivité et la disponibilité de la flotte Kubernetes de l'entreprise sur la plateforme Azure. Il travaillera en étroite collaboration avec les autres équipes SRE pour s'assurer que les services cloud de l'organisation respectent les objectifs de niveau de service (SLO) et les accords de niveau de service (SLA) requis.

Responsabilités principales :

Concevoir, mettre en œuvre et maintenir l'infrastructure Kubernetes et ses services transversaux pour assurer une haute disponibilité, évolutivité et performance.

Mettre en place et maintenir des procédures de surveillance, d'alerte et de réponse aux incidents pour garantir une réponse rapide aux problèmes système et de service.

Développer et maintenir des scripts et des outils d'automatisation pour rationaliser les processus de déploiement et de gestion des applications conteneurisées sur Kubernetes.

Collaborer avec les autres équipes SRE pour concevoir, mettre en œuvre et maintenir des plans de reprise après sinistre et de continuité des activités.

Développer et maintenir des politiques et procédures de sécurité pour garantir la sécurité des services Kubernetes sur Azure.

Se tenir à jour des nouvelles fonctionnalités et capacités de Kubernetes sur Azure et recommander des modifications ou des mises à niveau si nécessaire.

Fournir des conseils et des formations aux autres membres de l'équipe SRE sur les meilleures pratiques et procédures Azure.

Développer et maintenir la documentation relative à l'infrastructure et aux services Kubernetes sur Azure.

Profil recherché :

Diplôme en informatique, technologies de l'information ou domaine connexe.

Minimum de 5 ans d'expérience en ingénierie de fiabilité de site ou dans un rôle similaire, avec un accent sur l'infrastructure cloud sur la plateforme Azure.

Connaissance approfondie des services Kubernetes sur Azure, en particulier le calcul, le réseau et le stockage.

Expérience avec les outils d'automatisation Kubernetes tels que Terraform, Helm, FluxCD ou Kustomize.

Expérience en surveillance et création de tableaux de bord (Datadog, Grafana).

Excellentes compétences en résolution de problèmes et en dépannage.

Excellentes compétences en communication et en collaboration.

Les certifications Microsoft Azure sont préférées.

Create a job alert for this search

Azure Kuberbetes Site Reliability Engineer SRE • Montréal, Canada

Similar jobs
Staff Platform Site Reliability Specialist (Observability & Kubernetes)

Staff Platform Site Reliability Specialist (Observability & Kubernetes)

Everbridge • Montreal
Full-time
Everbridge is seeking a Staff Platform Site Reliability Specialist to own, operate, and evolve our enterprise observability platform. In this role, you will be responsible for the up-keep, reliabili...Show more
Last updated: 12 days ago • Promoted
Senior SRE Engineer — Build Reliable, Scalable Systems

Senior SRE Engineer — Build Reliable, Scalable Systems

PowerToFly • Montreal
Full-time
A global financial firm in Montreal is seeking a Systems Reliability Engineer to enhance service availability and reliability for technology products. This role involves collaborating within a fast-...Show more
Last updated: 17 days ago • Promoted
SRE Lead (Site Reliability Engineer)

SRE Lead (Site Reliability Engineer)

Tech Mahindra • Ahuntsic North, ca
Full-time
A Bachelor’s or Higher Degree is the minimum entry required for the position.This job posting is for an existing, active vacancy and we are looking for SRE Lead immediately with experience in AWS, ...Show more
Last updated: 16 days ago • Promoted
SRE Azure Engineer

SRE Azure Engineer

Trekrecruit • Montreal
Full-time
We are looking for a strong technologist and a doer who is willing to lead by example by being hands on every day.This role will be supporting Institutional Securities and Wealth Management brokera...Show more
Last updated: 17 days ago • Promoted
Senior Site Reliability Engineer, Backend (Reliability Engineering)

Senior Site Reliability Engineer, Backend (Reliability Engineering)

Affirm • Ahuntsic North, ca
Full-time
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.Responsibilities Site Rel...Show more
Last updated: 24 days ago • Promoted
Azure Kuberbetes & Site Reliability Engineer (SRE)

Azure Kuberbetes & Site Reliability Engineer (SRE)

Klanik • Montreal
Full-time
KLANIK est une société de conseil en Ingénierie IT qui accompagne ses clients dans leurs projets digitaux et technologiques. Le groupe KLANIK compte désormais plus de 750 talents, évoluant dans 16 a...Show more
Last updated: 17 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

ApTask • Montreal
Full-time
Direct message the job poster from ApTask.Looking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to...Show more
Last updated: 17 days ago • Promoted
Senior SRE : Cloud Reliability & Scale (Remote)

Senior SRE : Cloud Reliability & Scale (Remote)

Veeva Systems • Ahuntsic North, ca
Remote
Full-time
A leading life sciences technology company is looking for a Senior Software Engineer - SRE to join its Vault Platform team in Ottawa. In this role, you will ensure the scalability and reliability of...Show more
Last updated: 30+ days ago • Promoted
DevOps / SRE Engineer (Remote)

DevOps / SRE Engineer (Remote)

Rivalry • Montreal, QC, Canada
Remote
Full-time
Rivalry is a startup uniquely positioned to disrupt the dated online gambling space.The founders and staff come from the gaming and esports scene and are now working their way into the betting worl...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Noramtec Consultants Inc. • Montreal
Full-time
A major global financial services institution is partnering with us to hire a.Site Reliability Engineer (SRE).Montreal-based Application Infrastructure team. This pivotal role will focus on.ServiceN...Show more
Last updated: 17 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Vertex Elite LLC • Ahuntsic North, ca
Full-time
Duration : Contract Key Skills : Monitoring / Observability tools - Dynatrace, ELK etc.Platform / cloud Observability - OpenShift, Prometheus / Azure Cloud etc. Key Responsibilities : Collaborate with v...Show more
Last updated: 30+ days ago • Promoted
Specialist Site Reliability Engineer

Specialist Site Reliability Engineer

Global Talent Alliance, Canada • Montreal
Full-time
About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Show more
Last updated: 17 days ago • Promoted
Platform Engineer - Cloud Reliability (AWS / Kubernetes)

Platform Engineer - Cloud Reliability (AWS / Kubernetes)

Nanometrics Inc. • Ahuntsic North, ca
Full-time
A leading seismic monitoring solutions provider in Ottawa is seeking a Platform Engineer to enhance cloud-based services related to seismic technology. This role requires expertise in AWS and Kubern...Show more
Last updated: 24 days ago • Promoted
Senior SRE : Observability & AI Infra (Hybrid Ottawa)

Senior SRE : Observability & AI Infra (Hybrid Ottawa)

LRO Staffing • Ahuntsic North, ca
Full-time
A leading software company in Ottawa is seeking a Senior Software Engineer to join their agile team.The role involves designing and maintaining Observability infrastructure for a mission-critical A...Show more
Last updated: 7 hours ago • Promoted • New!
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Targeted Talent • Montreal, QC, Canada
Permanent
We are looking for an experienced.Senior Site Reliability Engineer.Our client is a global enterprise company with a product that you've likely used. Experience with coding / software development, ...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tecsys Inc. • Montreal
Full-time +1
Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company.Our...Show more
Last updated: 6 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

High Tech Genesis • Montreal
Full-time
WE'RE HIRING! At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do.Be part of a design services company that is among the companies that lead the ...Show more
Last updated: 17 days ago • Promoted
Site Reliability Engineer, Inference Infrastructure

Site Reliability Engineer, Inference Infrastructure

Cohere • Montreal
Full-time
Our mission is to scale intelligence to serve humanity.We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like cont...Show more
Last updated: 17 days ago • Promoted