Talent.com
Site Reliability Engineer
Site Reliability EngineerHigh Tech Genesis • Montreal (administrative region), QC, CA
Site Reliability Engineer

Site Reliability Engineer

High Tech Genesis • Montreal (administrative region), QC, CA
4 days ago
Job type
  • Full-time
Job description

Join to apply for the Site Reliability Engineer role at High Tech Genesis

WE'RE HIRING! At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do. Be part of a design services company that is among the companies that lead the world in technology and innovation.

Your next chapter starts here.

Responsibilities

  • Perform advanced troubleshooting and service recovery for residential and small-business networks, energy systems, and IoT technologies.
  • Support and troubleshoot Azure workloads, cloud integrations, and general IT issues across Windows, Linux, and hybrid environments.
  • Perform detailed incident investigations, document findings, and implement corrective actions. Strengthen the team’s ability to resolve complex technical issues.
  • Communicate clearly with customers and internal teams, providing timely updates, guidance, and expectations throughout the incident lifecycle.
  • Assist L1 analysts through training, coaching, and escalation support to increase first-contact resolution rates.
  • Stay current with developments in EV charging, solar energy, and related emerging technologies to enhance overall technical support.
  • Apply security-first practices in daily operations, including secrets management, patching cycles, baseline image maintenance, identity hygiene, and RBAC reviews.
  • Work with security and compliance teams on vulnerability management, audit documentation, and evaluation of control effectiveness across applicable frameworks.
  • Develop and maintain operational documentation such as SOPs, runbooks, knowledge base articles, escalation procedures, and service catalogs.
  • Ensure documentation accuracy, version control, and comprehensive coverage; treat documentation as a critical operational output.
  • Utilize Jira for incident, problem, and change workflows, including SLAs, dashboards, and reporting.
  • Collaborate with Engineering and DevOps teams to define operational requirements, enhance service design, and prioritize reliability improvements.
  • Provide ongoing mentorship and training opportunities to L1 analysts to support skill growth and improve initial resolution outcomes.
  • Communicate effectively with both internal and external stakeholders regarding incidents, maintenance activities, service enhancements, and post-incident analyses.

Qualifications

  • At least 3 years of experience in network operations, site reliability, or cloud platform support roles managing production systems.
  • Strong understanding of networking, VPNs, firewalls, load balancers, DNS, and certificate management.
  • Hands-on experience with cloud services including compute, storage, networking, and identity management.
  • Practical experience with both Linux and Windows systems administration.
  • Proficiency in one or more scripting languages such as Python, PowerShell, or Bash, and ability to create dependable automation workflows.
  • Familiarity with monitoring, alerting, and telemetry systems, including the design of meaningful service-level indicators.
  • Working knowledge of service management platforms and workflow automation tools.
  • Proven ability to write accurate operational documentation, including procedures and troubleshooting guides.
  • Strong communication skills for both technical and customer-facing interactions.
  • Preferred Qualifications

  • Experience with Infrastructure-as-Code tools (e.g., Terraform, Bicep) and CI / CD systems.
  • Knowledge of IoT or distributed device management at scale.
  • Understanding of system reliability concepts such as graceful degradation and autoscaling.
  • Exposure to industrial or energy systems involving telemetry, control, or gateway operations.
  • Relevant certifications such as Azure Administrator, Azure Network Engineer, ITIL, or CCNA (or equivalents).
  • High Tech Genesis Inc. is an Equal Opportunity Employer committed to building inclusive teams where diverse perspectives drive innovation.

    We support an accessible recruitment process and are happy to provide accommodation upon request.

    Applicants must be legally authorized to work in Canada, and resumes should be submitted in Microsoft Word format.

    Location : Montreal, Quebec, Canada

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Montreal (administrative region), QC, CA

    Similar jobs
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Intelcom Express Inc. • Montreal
    Full-time
    Senior Site Reliability Engineer (SRE) page is loaded## Senior Site Reliability Engineer (SRE)locations : Canada, Quebec, Montrealtime type : Full timeposted on : Posted Todayjob requisition id : ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer : Cloud, Kubernetes & AI

    Site Reliability Engineer : Cloud, Kubernetes & AI

    The Pythian Group • Ahuntsic North, ca
    Full-time
    A multinational technology company in Ottawa is seeking talented Site Reliability Engineers to join their next-generation engineering team. This role involves designing, deploying, and operating lar...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ApTask • Montreal
    Full-time
    Direct message the job poster from ApTask.Looking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AKUR8 • Montreal
    Full-time
    Akur8 is a fast-growing Insurtech scale‑up that transforms insurance pricing and reserving with transparent machine learning. Our SaaS platform injects speed, performance and reliability into insure...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineering Specialist (Hybrid)

    Site Reliability Engineering Specialist (Hybrid)

    Morgan Stanley • Montreal
    Full-time
    Site Reliability Engineering Specialist (Hybrid).Site Reliability Engineering Specialist (Hybrid).We're seeking someone to join our Data Protection Fleet as a Site Reliability Engineering (SRE) Spe...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Vertex Elite LLC • Ahuntsic North, ca
    Full-time
    Duration : Contract Key Skills : Monitoring / Observability tools - Dynatrace, ELK etc.Platform / cloud Observability - OpenShift, Prometheus / Azure Cloud etc. Key Responsibilities : Collaborate with v...Show more
    Last updated: 9 days ago • Promoted
    Specialist Site Reliability Engineer

    Specialist Site Reliability Engineer

    Global Talent Alliance, Canada • Montreal
    Full-time
    About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Show more
    Last updated: 2 days ago • Promoted
    Senior Site Reliability Engineer : Observability & Cloud Mastery

    Senior Site Reliability Engineer : Observability & Cloud Mastery

    Xsolla • Montreal
    Full-time
    A leading gaming services company in Montreal is looking for a Site Reliability Engineer to ensure system reliability and availability. The ideal candidate will have extensive experience in monitori...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer w / Python (Onsite Hybrid)

    Site Reliability Engineer w / Python (Onsite Hybrid)

    NTT DATA • Montreal
    Full-time
    NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us.If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now....Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineering Specialist (Hybrid)

    Site Reliability Engineering Specialist (Hybrid)

    PowerToFly • Montreal
    Full-time
    We're seeking someone to join our Data Protection Fleet as a Site Reliability Engineering (SRE) Specialist in Cyber to help drive performance, reliability, enhanced observability and efficiency for...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer w / Python (Onsite Hybrid)

    Site Reliability Engineer w / Python (Onsite Hybrid)

    NTT DATA North America • Montreal
    Full-time
    Site Reliability Engineer / ServiceNow SaaS (Onsite Hybrid).NTT DATA is seeking a Site Reliability Engineer to join our Montreal, Quebec, Canada team. The position is onsite‑hybrid, requiring office a...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Targeted Talent • Montreal, QC, Canada
    Permanent
    We are looking for an experienced.Senior Site Reliability Engineer.Our client is a global enterprise company with a product that you've likely used. Experience with coding / software development, ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer / Platform Operations Engineer

    Site Reliability Engineer / Platform Operations Engineer

    Targeted Talent • Montreal, QC, Canada
    Permanent
    We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client.This is a permanent position that is remote to start with later relocation to.Our client i...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    High Tech Genesis • Montreal
    Full-time
    Be among the first 25 applicants.At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do. Be part of a design services company that is amongst the com...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Intelcom | Dragonfly • Montreal
    Full-time
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Incident Management : Detect and respond to issues, ensuring rapid recovery to minimize downtime.Curren...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer w / Python (Onsite Hybrid)

    Site Reliability Engineer w / Python (Onsite Hybrid)

    NTT DATA, Inc. • Montreal
    Full-time
    Site Reliability Engineer w / Python (Onsite Hybrid).NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adapt...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineering (SRE)

    Lead Site Reliability Engineering (SRE)

    freelance.ca • Montreal, Canada
    Full-time
    Lead Site Reliability Engineering (SRE).Vous serez responsable de bâtir et de maintenir des pipelines CI / CD partagés, d’implanter des pratiques exemplaires en matière de résilience et de stabilité,...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Devopshunt • Montreal
    Full-time
    Senior Site Reliability Engineer (SRE).Digital Infrastructure Team Lead.This is an opportunity to make a significant impact in a fast-paced, innovative environment. If you’re passionate about buildi...Show more
    Last updated: 10 days ago • Promoted