Talent.com
Senior Site Reliability Engineer (SRE)
Senior Site Reliability Engineer (SRE)Intelcom • Canada, Quebec, Montreal
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Intelcom • Canada, Quebec, Montreal
Il y a plus de 30 jours
Type de contrat
  • Temps plein
Description de poste

Intelcom | Dragonfly

With more than 100 sorting stations and operations across three continents, Intelcom | Dragonfly is Canada’s leader in last-mile logistics. Our vision is clear : to deliver fast, accurate, and reliable service powered by cutting-edge technology.

A Strategic Role at the Heart of Logistics

Responsibilities

Incident Management : Detect and respond to issues, ensuring rapid recovery to minimize downtime. Current on-call contributors need better coordination and structure in investigations. This role involves off-hours events, but these are cyclical with quieter periods. Define and implement an escalation process. Ensure the communication and adhesion of all the stakeholders across the business to the process. Document incident reports and conduct post-mortems to promote a continuous improvement approach.

Collaboration : Work closely with development and operations teams to ensure smooth deployment and operation of applications. Provide primary operational support and engineering for large-scale distributed software applications. Collaborate with development teams to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, and capacity planning. This requires a diligent follow-up and close collaboration with all teams

Influence : Create sustainable systems and services through automation and enhancements. Promote a culture of innovation and continuous improvement within the SRE team and the broader organization. Coordinate the SRE team in establishing and executing operational policies that promote agility and scalability. Coordinate and mentor SRE team members, fostering professional growth and development. Work closely with development and operations teams to ensure smooth deployment and operation of applications.

Automation : Automate repetitive tasks to improve efficiency and reduce human errors. Improve the reliability, quality, and time-to-market of our software solutions. Measure and optimize system performance anticipating business needs.

Monitoring and Alerting : Implement and enhance monitoring systems (e.g., Datadog) to track the health and performance of applications and infrastructure. There are existing systems, but additional ones are needed. Monitor and maintain the production environment, ensuring high availability and system health. Gather and process metrics from operating systems and applications to assist in performance tuning and fault finding. Develop an health monitoring dashboard to enable the visibility of our various stakeholders on our production environment.

Disaster Recovery : Prepare and implement disaster recovery plans to manage unexpected outages.

Performance Optimization : Continuously improve system performance and scalability.

Capacity Planning : Ensure the infrastructure can handle current and future demands.

Chaos Engineering : Intentionally introduce failures to test system resilience and improve robustness.

Qualifications

Bachelor's degree in software engineering, computer science or equivalent.

Minimum of 7 years experience in cloud management, development and / or SRE responsibilities.

Experience in Agile methodology and technical project execution.

Knowledgeable in DevOps concepts, AWS, Azure, GCP, observability tools (Datadog, cloudflare), Terraform, PagerDuty and how to integrate all these things together.

Other Skills :

Strong initiative and resilience, with a demonstrated ability to explore new ideas and innovative approaches to solving complex problems.

Excellent interpersonal and communication skills in both French and English.

Be able and comfortable evolving in fast-moving environment.

Schedule : Primarily daytime hours, but on-call availability is required for the initial months to observe and refine existing processes.

Why Join Us?

At Intelcom | Dragonfly , you’ll thrive in a flexible and stimulating environment, surrounded by passionate talent. You’ll also enjoy a wide range of benefits :

On-site gym with a personal trainer

Employer-provided lunch of your choice

Comprehensive group insurance

Group RRSP plan

Wellness days

Partial reimbursement for public transportation

Employee Assistance Program

…and much more.

This position has been opened to address a genuine organizational need within the company.

At Intelcom | Dragonfly , we move forward guided by strong values : collaboration, innovation, excellence, and responsibility.

We embrace diversity, ensure equity, and foster a true sense of belonging.

Accommodation measures are available for individuals with disabilities throughout our recruitment process, in compliance with the law. Please let us know if you have any specific needs.

Créer une alerte emploi pour cette recherche

Senior Site Reliability Engineer SRE • Canada, Quebec, Montreal

Offres similaires
Senior SRE Engineer — Build Reliable, Scalable Systems

Senior SRE Engineer — Build Reliable, Scalable Systems

PowerToFly • Montreal
Temps plein
A global financial firm in Montreal is seeking a Systems Reliability Engineer to enhance service availability and reliability for technology products. This role involves collaborating within a fast-...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Chef d'Équipe Lean - Amélioration Continue & Santé / Sécurité

Chef d'Équipe Lean - Amélioration Continue & Santé / Sécurité

Prattwhitney • Longueuil H4H, QC, Canada
Temps plein
Une entreprise manufacturière renommée cherche à recruter un gestionnaire pour superviser les employés dans un environnement syndiqué à Longueuil, Québec. Ce rôle exige des compétences en communicat...Voir plus
Dernière mise à jour : il y a 18 jours • Offre sponsorisée
Implementation Engineer - Client Support & Systems Integration

Implementation Engineer - Client Support & Systems Integration

Services de Gestion Quantum Ltée • Montréal / Saint Laurent, Quebec, Canada
Permanent
Position : Implementation Engineer - Client Support & IntegrationLocation : Saint-Laurent and Hybrid Salary : $70K to $80K based on experience Benefits : Full benefits package, employer RRSP contri...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

ApTask • Montreal
Temps plein
Direct message the job poster from ApTask.Looking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Azure Kuberbetes & Site Reliability Engineer (SRE)

Azure Kuberbetes & Site Reliability Engineer (SRE)

Klanik • Montreal
Temps plein
KLANIK est une société de conseil en Ingénierie IT qui accompagne ses clients dans leurs projets digitaux et technologiques. Le groupe KLANIK compte désormais plus de 750 talents, évoluant dans 16 a...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Senior SRE : Cloud Reliability & Scale (Remote)

Senior SRE : Cloud Reliability & Scale (Remote)

Veeva Systems • Ahuntsic North, ca
Télétravail
Temps plein
A leading life sciences technology company is looking for a Senior Software Engineer - SRE to join its Vault Platform team in Ottawa. In this role, you will ensure the scalability and reliability of...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Specialist Site Reliability Engineer

Specialist Site Reliability Engineer

Global Talent Alliance, Canada • Montreal
Temps plein
About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

Vertex Elite LLC • Ahuntsic North, ca
Temps plein
Duration : Contract Key Skills : Monitoring / Observability tools - Dynatrace, ELK etc.Platform / cloud Observability - OpenShift, Prometheus / Azure Cloud etc. Key Responsibilities : Collaborate with v...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

Noramtec Consultants Inc. • Montreal
Temps plein
A major global financial services institution is partnering with us to hire a.Site Reliability Engineer (SRE).Montreal-based Application Infrastructure team. This pivotal role will focus on.ServiceN...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Senior Reliability Engineer - Low-Latency Trading

Senior Reliability Engineer - Low-Latency Trading

Tower Research Capital • Montreal
Temps plein
A leading quantitative trading firm in Montreal is seeking a Technical Support Engineer who will provide essential support for trading applications while interacting closely with traders and develo...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Agente, agent de réadaptation - Banque de candidatures

Agente, agent de réadaptation - Banque de candidatures

Centre de services scolaire des Samares • Saint-Lin-Laurentides, QC, Canada
Temps plein
Banque de candidatures pour tout le territoire desservi par le Centre de services scolaire des Samares.Chaque élève a droit à un parcours scolaire. Viens nous aider à créer un environnement où tous ...Voir plus
Dernière mise à jour : il y a 2 heures • Offre sponsorisée • Nouvelle offre
Supervisor, Building Officials

Supervisor, Building Officials

City of Whitehorse • saint-esprit, QC, ca
Temps plein
Scope and Responsibilities Reporting to the Manager, Land & Building Services, the Supervisor, Building Officials is the top building official in and for the City and provides...Voir plus
Dernière mise à jour : il y a 8 jours • Offre sponsorisée
Agent à la vérification de projets

Agent à la vérification de projets

Lanauco ltée • Saint-Alexis-de-Montcalm
Temps plein
Lanauco est actuellement à la recherche d’un.Programme d’assurances collectives avec contribution de l’employeur;.Programme d’aide aux employés et leur famille tout à fait gratuit et confidentiel;....Voir plus
Dernière mise à jour : il y a 9 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

High Tech Genesis • Montreal
Temps plein
WE'RE HIRING! At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do.Be part of a design services company that is among the companies that lead the ...Voir plus
Dernière mise à jour : il y a 15 jours • Offre sponsorisée
Site Reliability Engineer

Site Reliability Engineer

Tecsys Inc. • Montreal
Temps plein +1
Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company.Our...Voir plus
Dernière mise à jour : il y a 5 jours • Offre sponsorisée
Site Reliability Engineering developer

Site Reliability Engineering developer

National Bank • Montreal
Temps plein
A career as a site reliability engineering developer in the corporate sector Cloud Support team at National Bank means serving as a reliability and automation specialist for AWS data platforms and ...Voir plus
Dernière mise à jour : il y a 3 jours • Offre sponsorisée
Lead Site Reliability Engineering (SRE)

Lead Site Reliability Engineering (SRE)

freelance.ca • Montreal, Canada
Temps plein
Lead Site Reliability Engineering (SRE).Vous serez responsable de bâtir et de maintenir des pipelines CI / CD partagés, d’implanter des pratiques exemplaires en matière de résilience et de stabilité,...Voir plus
Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
Senior Accident Reconstruction Engineer

Senior Accident Reconstruction Engineer

Confidential Jobs • mercier, QC, ca
Temps plein
About the Company Global consulting firm is seeking an experienced Accident Reconstruction Engineer.The ideal c...Voir plus
Dernière mise à jour : il y a 8 jours • Offre sponsorisée