Talent.com
Senior Site Reliability Engineer (SRE)
Senior Site Reliability Engineer (SRE)Intelcom Express Inc. • Montreal, Montreal (administrative region), CA
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Intelcom Express Inc. • Montreal, Montreal (administrative region), CA
30+ days ago
Job type
  • Full-time
Job description

Senior Site Reliability Engineer (SRE) page is loaded## Senior Site Reliability Engineer (SRE)locations : Canada, Quebec, Montrealtime type : Full timeposted on : Posted Todayjob requisition id : JR109652#

  • Ride the next mile with us!
  • ###
  • Responsibilities
  • ###
  • Incident Management
  • : Detect and respond to issues, ensuring rapid recovery to minimize downtime. Current on-call contributors need better coordination and structure in investigations. This role involves off-hours events, but these are cyclical with quieter periods. Define and implement an escalation process. Ensure the communication and adhesion of all the stakeholders across the business to the process. Document incident reports and conduct post-mortems to promote a continuous improvement approach.
  • ###
  • Collaboration :
  • Work closely with development and operations teams to ensure smooth deployment and operation of applications. Provide primary operational support and engineering for large-scale distributed software applications. Collaborate with development teams to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, and capacity planning. This requires a diligent follow-up and close collaboration with all teams
  • ###
  • Influence :
  • Create sustainable systems and services through automation and enhancements. Promote a culture of innovation and continuous improvement within the SRE team and the broader organization. Coordinate the SRE team in establishing and executing operational policies that promote agility and scalability. Coordinate and mentor SRE team members, fostering professional growth and development. Work closely with development and operations teams to ensure smooth deployment and operation of applications.
  • ###
  • Automation :
  • Automate repetitive tasks to improve efficiency and reduce human errors. Improve the reliability, quality, and time-to-market of our software solutions. Measure and optimize system performance anticipating business needs.
  • ###
  • Monitoring and Alerting
  • : Implement and enhance monitoring systems (e.g., Datadog) to track the health and performance of applications and infrastructure. There are existing systems, but additional ones are needed. Monitor and maintain the production environment, ensuring high availability and system health. Gather and process metrics from operating systems and applications to assist in performance tuning and fault finding. Develop an health monitoring dashboard to enable the visibility of our various stakeholders on our production environment.
  • ###
  • Disaster Recovery :
  • Prepare and implement disaster recovery plans to manage unexpected outages.
  • ###
  • Performance Optimization
  • : Continuously improve system performance and scalability.
  • ###
  • Capacity Planning :
  • Ensure the infrastructure can handle current and future demands.
  • ###
  • Chaos Engineering :
  • Intentionally introduce failures to test system resilience and improve robustness.###
  • Qualifications
  • ### Bachelor's degree in software engineering, computer science or equivalent.
  • ### Minimum of 7 years experience in cloud management, development and / or SRE responsibilities.
  • ### Experience in Agile methodology and technical project execution.
  • ### Knowledgeable in DevOps concepts, AWS, Azure, GCP, observability tools (Datadog, cloudflare), Terraform, PagerDuty and how to integrate all these things together.### ### Other Skills :
  • ### Strong initiative and resilience, with a demonstrated ability to explore new ideas and innovative approaches to solving complex problems.
  • ### Excellent interpersonal and communication skills in both French and English.
  • ### Be able and comfortable evolving in fast-moving environment.### Schedule : Primarily daytime hours, but on-call availability is required for the initial months to observe and refine existing processes.Intelcom is a leading last-mile carrier in the e-commerce sector. Our teams across Canada as well as our network of independent contractors contribute to Intelcom’s daily operations.Our goal is simple : in a constantly evolving business sector, we don't just follow, we get ahead. In addition to standing out through innovative services and delivery methods, Intelcom is also undergoing a technological transformation where the integration of customer experience and logistics technologies are at the heart of its evolution.At Intelcom, we know experience comes in many forms and are committed to building a culture where difference is valued. We are always looking for talented and diverse individuals to join our teams. With over 60 delivery centers across Canada, we may have the right opportunity for you.
  • Apply Now.
  • Click to learn more about our new identity. Whether you’re interested in a job at
  • Intelcom
  • or at
  • Dragonfly
  • , it’s from this site that you will make your application. We are growing and looking for dynamic people to join our team. We offer challenging and rewarding career opportunities where you’ll work to achieve our goals… and yours. If you’re interested, apply today

#J-18808-Ljbffr

Create a job alert for this search

Senior Site Reliability Engineer • Montreal, Montreal (administrative region), CA

Similar jobs
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Intelcom Express Inc. • Montreal
Full-time
Senior Site Reliability Engineer (SRE) page is loaded## Senior Site Reliability Engineer (SRE)locations : Canada, Quebec, Montrealtime type : Full timeposted on : Posted Todayjob requisition id : ...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

TMC Canada • Montreal
Full-time +1
The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ...Show more
Last updated: 1 day ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

ApTask • Montreal
Full-time
Direct message the job poster from ApTask.Looking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

AKUR8 • Montreal
Full-time
Akur8 is a fast-growing Insurtech scale‑up that transforms insurance pricing and reserving with transparent machine learning. Our SaaS platform injects speed, performance and reliability into insure...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineering Specialist (Hybrid)

Site Reliability Engineering Specialist (Hybrid)

Morgan Stanley • Montreal
Full-time
Site Reliability Engineering Specialist (Hybrid).Site Reliability Engineering Specialist (Hybrid).We're seeking someone to join our Data Protection Fleet as a Site Reliability Engineering (SRE) Spe...Show more
Last updated: 13 days ago • Promoted
Senior Engineer, Reliability

Senior Engineer, Reliability

VIA Rail Canada • Montreal
Full-time
Did you know that VIA Rail is carrying out ambitious projects to modernize its services and infrastructure? From our new ultramodern train fleet to ongoing improvement of our infrastructure, we’re ...Show more
Last updated: 13 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Compunnel, Inc. • Montreal
Full-time
Client is seeking an experienced Site Reliability Engineer (SRE) to support and enhance the reliability, performance, and operational efficiency of our global ServiceNow SaaS platform.As part of th...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Open Systems Technologies • Montreal
Full-time
The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ...Show more
Last updated: 1 day ago • Promoted
Specialist Site Reliability Engineer

Specialist Site Reliability Engineer

Global Talent Alliance, Canada • Montreal
Full-time
About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Show more
Last updated: 6 days ago • Promoted
Senior Site Reliability Engineer — GitOps & IaC Lead

Senior Site Reliability Engineer — GitOps & IaC Lead

Canonical • Montreal
Full-time
A leading open-source software provider in Montreal is seeking a Senior Site Reliability / Gitops Engineer to drive operations automation and manage infrastructure as code across various clouds.The...Show more
Last updated: 13 days ago • Promoted
Site Reliability Engineering Specialist (Hybrid)

Site Reliability Engineering Specialist (Hybrid)

PowerToFly • Montreal
Full-time
We're seeking someone to join our Data Protection Fleet as a Site Reliability Engineering (SRE) Specialist in Cyber to help drive performance, reliability, enhanced observability and efficiency for...Show more
Last updated: 13 days ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Targeted Talent • Montreal, QC, Canada
Permanent
We are looking for an experienced.Senior Site Reliability Engineer.Our client is a global enterprise company with a product that you've likely used. Experience with coding / software development, ...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer / Platform Operations Engineer

Site Reliability Engineer / Platform Operations Engineer

Targeted Talent • Montreal, QC, Canada
Permanent
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client.This is a permanent position that is remote to start with later relocation to.Our client i...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

High Tech Genesis • Montreal
Full-time
Be among the first 25 applicants.At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do. Be part of a design services company that is amongst the com...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer w / Python (Onsite Hybrid)

Site Reliability Engineer w / Python (Onsite Hybrid)

NTT DATA, Inc. • Montreal
Full-time
Site Reliability Engineer w / Python (Onsite Hybrid).NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adapt...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Devopshunt • Montreal
Full-time
Senior Site Reliability Engineer (SRE).Digital Infrastructure Team Lead.This is an opportunity to make a significant impact in a fast-paced, innovative environment. If you’re passionate about buildi...Show more
Last updated: 13 days ago • Promoted
Senior SRE - Retail Platform & Kubernetes Reliability

Senior SRE - Retail Platform & Kubernetes Reliability

Lightspeed • Montreal
Full-time
A global commerce platform is seeking a Senior Site Reliability Engineer to join their Retail group in Montreal.The role involves ensuring the reliability and scalability of their POS systems infra...Show more
Last updated: 1 day ago • Promoted
Site Reliability Engineer (Linux / Cloud Infrastructure)

Site Reliability Engineer (Linux / Cloud Infrastructure)

Atlantis IT Group • Montreal
Full-time
Site Reliability Engineer (Linux / Cloud Infrastructure) role with hands-on experience across Linux, distributed systems, scripting, databases, monitoring, containers, cloud SaaS integrations, mess...Show more
Last updated: 30+ days ago • Promoted