Talent.com
Site Reliability Engineer (Linux / Cloud Infrastructure)
Site Reliability Engineer (Linux / Cloud Infrastructure)Atlantis IT Group • Montreal, Montreal (administrative region), CA
Site Reliability Engineer (Linux / Cloud Infrastructure)

Site Reliability Engineer (Linux / Cloud Infrastructure)

Atlantis IT Group • Montreal, Montreal (administrative region), CA
30+ days ago
Job type
  • Full-time
Job description

Overview

Site Reliability Engineer (Linux / Cloud Infrastructure) role with hands-on experience across Linux, distributed systems, scripting, databases, monitoring, containers, cloud SaaS integrations, messaging, load balancers, security, and incident management.

Responsibilities

  • Provide hands-on administration of Linux 7.x and related infrastructure.
  • Work with Service Oriented Architecture, distributed systems, and scripting (Python, shell).
  • Manage relational databases (e.g., Sybase, DB2, SQL, Postgres) and application integration, configuration, and troubleshooting.
  • Operate observability and monitoring tools : Open Telemetry, Prometheus, Grafana, Splunk, Ansible.
  • Manage web servers (Apache, Nginx) and application servers (Tomcat, JBoss) for integration and troubleshooting.
  • Work with Docker containers, Kubernetes, and SaaS platform integration.
  • Understand messaging systems (e.g., Kafka) and their role in the architecture.
  • Design and implement load balancing, web proxies, and storage platforms (NAS / SAN) from an implementation perspective.
  • Apply basic security policies for secure hosting solutions, including Kerberos and encryption methods (SSL / TLS).
  • Experience in managing large web-based, multi-tier (n-tier) applications in secure cloud environments.
  • Apply SRE principles with appropriate tooling approach; strong Linux / Unix admin, storage, networking, and web technologies knowledge.
  • Troubleshoot application issues and manage incidents effectively.
  • Exhibit excellent verbal and written communication skills.

Qualifications

  • Hands-on experience with Linux 7.x operating system (5+ years) at an advanced level.
  • Hands-on experience with SOA, distributed systems, and scripting (Python, shell).
  • Experience with relational databases (Sybase, DB2, SQL, Postgres).
  • Exposure to tools : Open Telemetry, Prometheus, Grafana, Splunk, Ansible.
  • Hands-on experience with web servers (Apache, Nginx) and application servers (Tomcat, JBoss).
  • Experience with Docker, Kubernetes, and SaaS platform integration.
  • Experience with Kafka and messaging technologies.
  • Understanding of load balancers, web proxies, and NAS / SAN storage from an implementation perspective.
  • Familiar with security policies for secure hosting, Kerberos, SSL / TLS.
  • Experience managing large web-based n-tier applications in secure cloud environments.
  • Strong knowledge of SRE principles and tooling.
  • Strong infrastructure knowledge in Linux / Unix administration, storage, networking, and web technologies.
  • Excellent troubleshooting and incident management capabilities.
  • Senioriry level

    Mid-Senior level

    Employment type

    Contract

    Job function

    Information Technology

    Industries

    IT Services and IT Consulting

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer Linux Cloud Infrastructure • Montreal, Montreal (administrative region), CA

    Similar jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Tekshapers • Montreal
    Full-time
    Production experience in SRE / Infrastructure / ops for large-scale systems.Strong programming / scripting skills (Python, Go, Java, or equivalent). Deep experience with containerization (Docker), orc...Show more
    Last updated: 1 hour ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    TMC Canada • Montreal
    Full-time +1
    The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ...Show more
    Last updated: 8 days ago • Promoted
    Staff Platform Site Reliability Specialist (Observability & Kubernetes)

    Staff Platform Site Reliability Specialist (Observability & Kubernetes)

    Everbridge • Montreal
    Full-time
    Everbridge is seeking a Staff Platform Site Reliability Specialist to own, operate, and evolve our enterprise observability platform. In this role, you will be responsible for the up-keep, reliabili...Show more
    Last updated: 3 days ago • Promoted
    Chef d'Équipe Lean - Amélioration Continue & Santé / Sécurité

    Chef d'Équipe Lean - Amélioration Continue & Santé / Sécurité

    Prattwhitney • Longueuil H4H, QC, Canada
    Full-time
    Une entreprise manufacturière renommée cherche à recruter un gestionnaire pour superviser les employés dans un environnement syndiqué à Longueuil, Québec. Ce rôle exige des compétences en communicat...Show more
    Last updated: 11 days ago • Promoted
    Linux Infrastructure Engineer - Production Reliability

    Linux Infrastructure Engineer - Production Reliability

    PowerToFly • Montreal
    Full-time
    A leading financial services firm in Montreal is seeking a Linux Infrastructure Specialist to manage and implement Linux infrastructure. The role involves diagnosing production issues, collaborating...Show more
    Last updated: 8 days ago • Promoted
    Senior Linux & Cloud Platform Engineer

    Senior Linux & Cloud Platform Engineer

    Barracuda Networks • Ahuntsic North, ca
    Full-time
    A leading cybersecurity company is looking for a Senior Software Engineer in Ottawa, Canada.You will develop and maintain the Operating System platform, collaborate with development teams, and trou...Show more
    Last updated: 30+ days ago • Promoted
    Algebra Private Tutoring Jobs Lanaudi

    Algebra Private Tutoring Jobs Lanaudi

    Superprof • Lanaudi, Canada
    Full-time +1
    Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Vertex Elite LLC • Ahuntsic North, ca
    Full-time
    Duration : Contract Key Skills : Monitoring / Observability tools - Dynatrace, ELK etc.Platform / cloud Observability - OpenShift, Prometheus / Azure Cloud etc. Key Responsibilities : Collaborate with v...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Noramtec Consultants Inc. • Montreal
    Full-time
    A major global financial services institution is partnering with us to hire a.Site Reliability Engineer (SRE).Montreal-based Application Infrastructure team. This pivotal role will focus on.ServiceN...Show more
    Last updated: 8 days ago • Promoted
    Specialist Site Reliability Engineer

    Specialist Site Reliability Engineer

    Global Talent Alliance, Canada • Montreal
    Full-time
    About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Show more
    Last updated: 8 days ago • Promoted
    Senior Reliability Engineer - Low-Latency Trading

    Senior Reliability Engineer - Low-Latency Trading

    Tower Research Capital • Montreal
    Full-time
    A leading quantitative trading firm in Montreal is seeking a Technical Support Engineer who will provide essential support for trading applications while interacting closely with traders and develo...Show more
    Last updated: 8 days ago • Promoted
    Senior Cloud Platform Engineer – Linux & Kubernetes

    Senior Cloud Platform Engineer – Linux & Kubernetes

    Aptiv • Ahuntsic North, ca
    Full-time
    A leading technology company based in Ottawa is seeking a Senior Engineer to develop high-quality, testable code for processes that run natively on Linux. The ideal candidate will have a Bachelor's ...Show more
    Last updated: 14 days ago • Promoted
    Site Reliability Engineer w / Python (Onsite Hybrid)

    Site Reliability Engineer w / Python (Onsite Hybrid)

    NTT DATA, Inc. • Montreal
    Full-time
    Site Reliability Engineer w / Python (Onsite Hybrid).NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adapt...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Architect - Kubernetes and Container Solutions

    Site Reliability Architect - Kubernetes and Container Solutions

    Synopsys Inc • Ahuntsic North, ca
    Full-time
    Site Reliability Architect - Kubernetes and Container Solutions Join to apply for the.Site Reliability Architect - Kubernetes and Container Solutions. We Are : At Synopsys, we drive the innovations t...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer (Linux / Cloud Infrastructure)

    Site Reliability Engineer (Linux / Cloud Infrastructure)

    Atlantis IT Group • Montreal
    Full-time
    Site Reliability Engineer (Linux / Cloud Infrastructure) role with hands-on experience across Linux, distributed systems, scripting, databases, monitoring, containers, cloud SaaS integrations, mess...Show more
    Last updated: 8 days ago • Promoted
    Senior SRE - Retail Platform & Kubernetes Reliability

    Senior SRE - Retail Platform & Kubernetes Reliability

    Lightspeed • Montreal
    Full-time
    A global commerce platform is seeking a Senior Site Reliability Engineer to join their Retail group in Montreal.The role involves ensuring the reliability and scalability of their POS systems infra...Show more
    Last updated: 8 days ago • Promoted
    Senior DevOps SRE : Reliability, Automation & Cloud

    Senior DevOps SRE : Reliability, Automation & Cloud

    TechDoQuest • Montreal
    Full-time
    A technology company in Montreal is looking for a skilled Site Reliability Engineer (SRE) to enhance system reliability and performance. This position involves automating infrastructure, managing ob...Show more
    Last updated: 8 days ago • Promoted
    Lead Site Reliability Engineering (SRE)

    Lead Site Reliability Engineering (SRE)

    freelance.ca • Montreal, Canada
    Full-time
    Lead Site Reliability Engineering (SRE).Vous serez responsable de bâtir et de maintenir des pipelines CI / CD partagés, d’implanter des pratiques exemplaires en matière de résilience et de stabilité,...Show more
    Last updated: 30+ days ago • Promoted