Talent.com

Reliability engineer Jobs in Laval, QC

Create a job alert for this search

Reliability engineer • laval qc

Last updated: 2 days ago
Site Reliability Engineer

Site Reliability Engineer

XsollaMontreal, Montreal (administrative region), CA
Full-time
At Xsolla, we believe that great games begin as ideas, driven by the curiosity, dedication, and grit of creators around the world. Our mission is to empower these visionaries by providing the suppor...Show moreLast updated: 30+ days ago
Linux Infrastructure Engineer - Production Reliability

Linux Infrastructure Engineer - Production Reliability

PowerToFlyMontreal (administrative region), QC, CA
Full-time
A leading financial services firm in Montreal is seeking a Linux Infrastructure Specialist to manage and implement Linux infrastructure. The role involves diagnosing production issues, collaborating...Show moreLast updated: 19 days ago
Site Reliability Engineer

Site Reliability Engineer

ApTaskMontreal, Montreal (administrative region), CA
Full-time
Direct message the job poster from ApTask.Looking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to...Show moreLast updated: 30+ days ago
IAM Reliability Engineer

IAM Reliability Engineer

Compunnel, Inc.Montreal, Montreal (administrative region), CA
Full-time
We are seeking a highly motivated individual to join the authentication and identity management reliability engineering group within an agile team. The ideal candidate will be responsible for suppor...Show moreLast updated: 30+ days ago
Site Reliability Engineer 3

Site Reliability Engineer 3

BehavoxMontreal, Montreal (administrative region), CA
Full-time
Join to apply for the Site Reliability Engineer 3 role at Behavox.Behavox is shaping the future of how businesses harness their most important raw material – data. Our mission is bold : Organize ente...Show moreLast updated: 23 days ago
Senior Engineer, Reliability

Senior Engineer, Reliability

VIA Rail CanadaMontreal (administrative region), QC, CA
Full-time
Did you know that VIA Rail is carrying out ambitious projects to modernize its services and infrastructure? From our new ultramodern train fleet to ongoing improvement of our infrastructure, we’re ...Show moreLast updated: 14 days ago
Site Reliability Engineer

Site Reliability Engineer

High Tech GenesisMontreal (administrative region), QC, CA
Full-time
WE'RE HIRING! At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do.Be part of a design services company that is among the companies that lead the ...Show moreLast updated: 4 days ago
Site Reliability Engineer 3

Site Reliability Engineer 3

Behavox Limited.Montreal, Montreal (administrative region), CA
Full-time
Behavox is shaping the future of how businesses harness their most important raw material – data.Our mission is bold : organize enterprise data into actionable information that protects and promotes...Show moreLast updated: 24 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Vertex Elite LLCAhuntsic North, ca
Full-time
Duration : Contract Key Skills : Monitoring / Observability tools - Dynatrace, ELK etc.Platform / cloud Observability - OpenShift, Prometheus / Azure Cloud etc. Key Responsibilities : Collaborate with v...Show moreLast updated: 8 days ago
  • Promoted
HW / SW Reliability Engineer

HW / SW Reliability Engineer

NokiaAhuntsic North, ca
Full-time
As a HW / SW Reliability Engineer in the NI-IP Organization, you will be responsible for the Reliability of product design of the latest developments. This position requires self-starters who can mana...Show moreLast updated: 24 days ago
Specialist Site Reliability Engineer

Specialist Site Reliability Engineer

Global Talent Alliance, CanadaMontreal (administrative region), QC, CA
Full-time
About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Show moreLast updated: 2 days ago
Site Reliability Engineer - Observability

Site Reliability Engineer - Observability

FlinksMontreal, Montreal (administrative region), CA
Full-time
Flinks is where financial data moves—with purpose, trust, and impact.We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial prod...Show moreLast updated: 30+ days ago
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Targeted TalentMontreal, QC, Canada
Permanent
We are looking for an experienced.Senior Site Reliability Engineer.Our client is a global enterprise company with a product that you've likely used. Experience with coding / software development, ...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

High Tech Genesis Inc.Montreal, Montreal (administrative region), CA
Full-time
At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do.Be part of a design services company that is amongst the companies that lead the world in tec...Show moreLast updated: 30+ days ago
Hardware Reliability Engineer

Hardware Reliability Engineer

LyftMontreal, Montreal (administrative region), CA
Full-time
At Lyft, our purpose is to serve and connect.We aim to achieve this by cultivating a work environment where all team members belong and have the opportunity to thrive. Ensure product quality and rel...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

AKUR8Montreal, Montreal (administrative region), CA
Full-time
Akur8 is a fast-growing Insurtech scale‑up that transforms insurance pricing and reserving with transparent machine learning. Our SaaS platform injects speed, performance and reliability into insure...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

CanonicalLaval (administrative region), QC, CA
Full-time
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 3 days ago
Hardware Reliability Engineer

Hardware Reliability Engineer

Socotra, Inc.Montreal, Montreal (administrative region), CA
Full-time
At Lyft, our purpose is to serve and connect.We aim to achieve this by cultivating a work environment where all team members belong and have the opportunity to thrive. Ensure product quality and rel...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Intelcom Express Inc.Montreal, Montreal (administrative region), CA
Full-time
Senior Site Reliability Engineer (SRE) page is loaded## Senior Site Reliability Engineer (SRE)locations : Canada, Quebec, Montrealtime type : Full timeposted on : Posted Todayjob requisition id : ...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

XsollaMontreal, Montreal (administrative region), CA
30+ days ago
Job type
  • Full-time
Job description

ABOUT US

At Xsolla, we believe that great games begin as ideas, driven by the curiosity, dedication, and grit of creators around the world. Our mission is to empower these visionaries by providing the support and resources they need to bring their games to life. We are committed to leveling the playing field, ensuring that every creator has the opportunity to share their passion with the world.

Headquartered in Los Angeles, with offices in Berlin, Seoul, and beyond, we partner with industry leaders like Valve, Twitch, and Ubisoft to clear the paths for innovation in gaming. Our global reach spans over 200 geographies, offering more than 700 payment methods in 130+ currencies.

Longevity Opportunity Vision Enjoy the game!

Requirements

  • Proven experience as a Site Reliability Engineer, or similar Software Engineering role in a large-scale production environment ( 5 years to 10 years)
  • overall in IT area (as Ops or Developer).
  • Proficiency in scripting languages such as Python, Bash. Strong understanding of Go and PHP will be a plus.
  • Deep knowledge of monitoring systems such as Datadog, Prometheus, Grafana.
  • Good understanding of continuous integration / continuous delivery processes and platforms (Gitlab preferred). Experience with Helm.
  • Experience with Docker, Kubernetes, or other container orchestration systems.
  • Familiarity with infrastructure automation tools like Terraform.
  • Experience with automation, system administration, and system hardening.
  • Experience with Linux-based infrastructures, Linux / Unix administration.
  • Demonstrated problem-solving skills, particularly debugging and troubleshooting complex software systems. Ability to work under pressure.
  • Excellent communication skills with a capacity to articulate and solve complex technical problems
  • Xsolla Technology Stack : Ubuntu, Kubernetes, Gitlab, Terraform, Terragrunt, Puppet, Nginx, Google Cloud Platform, Datadog, Prometheus, Grafana,
  • ELK, Zabbix and Harbor.

Responsibilities

  • Ensure high reliability and availability and meet SLAs, SLOs, and SLIs.
  • Monitor the system for issues and respond to incidents, ensuring quick resolution to maintain high system availability.
  • Drive incident resolution and process improvements to minimize downtime and increase operational transparency.
  • Ensure all key services are measured, monitored and raising alerts when needed.
  • Develop comprehensive monitoring solutions to provide full visibility to the different platform components using tools and services like Kubernetes, Datadog, Prometheus, Grafana and others.
  • Support services before they go live through activities such as capacity planning, monitoring setup, logging, and production readiness reviews.
  • Engage in service capacity planning and demand forecasting, performance analysis, and system tuning.
  • Collaborate with the development teams to enhance the product's operational stability.
  • Build and drive the automation systems that maintain system health
  • Education

  • IT professional certifications are not required, but it will be a plus
  • Certified Kubernetes Administrator or Developer
  • HashiCorp Certifications
  • GCP Certifications
  • $120,000 - $150,000 a year

    Benefits :

    We are passionate about fostering a supportive environment for our team, so we prioritize the physical, mental, and emotional well-being of our employees and their families through a comprehensive Benefits Program. This includes 100% company-paid medical, dental, and vision plans, unlimited Flexible Time Off, and a personalized career roadmap for each employee. By investing in professional development through training and educational opportunities, we ensure that our team thrives both personally and professionally. Together, we’re not just building a business; we’re cultivating a community that values creativity, collaboration, and the transformative power of play.

    By submitting the following job application form, you consent to Xsolla processing your data for career-related inquiries and potential employment opportunities. We process your data in accordance with this Xsolla Privacy Notice for Job Applicants . Please direct any inquiries regarding your data privacy to careers@xsolla.com.

    #J-18808-Ljbffr