Talent.com
Platform Engineer (Cloud Reliability Engineer)
Platform Engineer (Cloud Reliability Engineer)Nanometrics Inc. • Ahuntsic North, ca
Platform Engineer (Cloud Reliability Engineer)

Platform Engineer (Cloud Reliability Engineer)

Nanometrics Inc. • Ahuntsic North, ca
6 days ago
Job type
  • Full-time
Job description

Platform Engineer (Cloud Reliability Engineer)

Reports to : Director, Global Operations

Based in Ottawa, ON

Term : Full Time

About Nanometrics

With 40 years of seismic technology and industry application experience, we are a global, award‑winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From mission‑critical seismic arrays, tsunami and early‑earthquake warning systems in over 90 countries across the globe to induce seismicity monitoring in the energy sector. We specialize in full‑service, integrated solutions for studying artificial and natural seismicity, including turnkey seismic networks, industry‑leading precision instrumentation, complete data processing, analysis services, and software applications.

At Nanometrics, we take pride in fostering a culture of innovation, collaboration, and excellence. We are passionate about making a global impact through cutting‑edge technology while staying rooted in values of intentional innovation, trust, ethics, and stability.

About the role

This is an exciting opportunity for a motivated and experienced Platform Engineer to evolve, enhance and lead the technological footprint of our Seismic Monitoring Services portfolio. Nanometrics provides a top tier portfolio of tools and services which is supported by a continuously evolving cloud based platform.

The Platform Engineer / Cloud Reliability Engineer ensures the reliability, performance, and operational excellence of cloud‑hosted seismic monitoring and data processing services. This role blends software engineering, cloud infrastructure management, and SRE practices to build resilient systems, reduce manual toil through automation, and improve observability across AWS and Kubernetes ecosystems.

The successful candidate will use Terraform or similar Infrastructure-as-Code technologies (Pulumi, AWS CDK, CloudFormation, OpenTofu) to deliver consistent, automated, scalable infrastructure.

Responsibilities

Cloud Reliability & Resilience

Ensure uptime, performance, and reliability of AWS-hosted services and Kubernetes workloads

Implement self-healing patterns, automated rollbacks, health checks, and safe-deployment strategies

Participate in on-call rotation and lead first-response triage for cloud and platform incidents

Build and maintain service-level indicators (SLIs) and service-level objectives (SLOs)

Automation & Infrastructure Engineering

Develop automation for cloud operations using Python, Bash, and IaC (Terraform)

Reduce operational toil through automated runbooks, event-driven remediation, and system orchestration

Improve deployment reliability in collaboration with Platform Engineering and R&D teams

Implement and refine configuration standards, CI / CD hygiene, and environment stability

Observability & Operational Intelligence

Maintain and extend observability stack (Prometheus, Grafana, InfluxDB, OpenTelemetry)

Tune alerts for accuracy, reduce noise, and implement actionable alerting tied to SLOs

Analyze logs, metrics, and traces to detect reliability issues and validate system behavior

Build dashboards that provide real‑time visibility into system health and reliability trends

Operational Excellence

Support release processes, platform upgrades, and cloud infrastructure changes

Conduct root‑cause analysis and drive post‑incident corrective actions

Maintain operational documentation, runbooks, and environment validation workflows

Collaborate cross‑functionally with NetOps, Platform Engineering, Field Ops, and R&D

Requirements

Education and Experience

Bachelor's degree or higher in Software Engineering, Computer Science, or related field.

7+ years experience in software development

3+ years hands‑experience working with cloud providers like AWS, etc and cloud‑native technologies like Kubernetes, Helm, etc. and related technologies including observability platforms.

Experience with database operations (MySQL, PostgreSQL, MongoDB, Redis) in cloud and on‑prem environments.

Cloud & Infrastructure

Strong experience with AWS (EC2, S3, IAM, VPC, EKS / ECS, CloudWatch)

Solid understanding of Kubernetes , Helm charts, and container orchestration

Familiarity with hybrid cloud environments (cloud + on‑prem integration)

Infrastructure as Code & Automation

Hands‑on experience with Terraform

Scripting skills in Python and Bash

Ability to build automated workflows and cloud operations tooling

CI / CD & Deployment Engineering

Experience with deployment pipelines (Jenkins, Bitbucket Pipelines, ArgoCD)

Familiarity with GitOps workflows

Understanding of build systems (Maven, Gradle)

Monitoring & Observability

Experience with monitoring / metrics / logging tools such as Prometheus, Grafana, InfluxDB

Familiarity with OpenTelemetry for distributed tracing

Ability to diagnose performance issues in distributed systems

Reliability Engineering Concepts

Knowledge of SLOs / SLIs / error budgets

Incident management principles

Understanding of resilience patterns (retry, circuit breakers, autoscaling, etc.)

Why Nanometrics?

We are a global leader in seismic solutions and a Canada's Best Managed Companies Platinum member.

We value sustainable growth that benefits our employees, our community, and the environment.

Maximize your productivity with our flexible hybrid work model. Our centrally located office space offers a stimulating environment for collaboration and focused work. Plus, enjoy a convenient commute with easy access to biking paths and public transportation.

Engage in virtual and onsite social events centered around collaboration, learning, and fun, including volunteer events, celebrations, and team-building activities.

Our comprehensive group benefits program includes RRSP matching, health / dental benefits, a corporate bonus program, education assistance, and a health spending account.

Our Employee Assistance Program (EAP) provides services and support for health, work‑life solutions, legal guidance, financial resources, wellness tools, and more.

Enjoy a competitive leave program, including a holiday shutdown (December 25 to January 1).

Grow your career with learning and development opportunities.

Collaborate with high-performing teams and some of the industry's top minds.

Nanometrics is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Should you require accommodation as part of the recruitment and selection process, please reach out to careers@nanometrics.ca

#J-18808-Ljbffr

Create a job alert for this search

Platform Engineer Cloud Reliability Engineer • Ahuntsic North, ca

Similar jobs
Ingnieur(e) infonuagique / Cloud Engineer

Ingnieur(e) infonuagique / Cloud Engineer

Taiga Motors • Montreal, QC, Canada
Full-time
Taiga Motors, une entreprise de technologie et de fabrication de vhicules lectriques hors route en pleine expansion, est la recherche dun(e) ingnieur(e). Dans ce rle, vous serez responsable de la co...Show more
Last updated: 30+ days ago • Promoted
Cloud Solution Integration Specialist

Cloud Solution Integration Specialist

Astra North Infoteck Inc. • Montreal, QC, ca
Full-time
Quick Apply
Solution Integration Specialist.The Cloud Solution Integration Specialist is a key technical contributor within IT projects, focusing on designing, implementing, and integrating cloud-based s...Show more
Last updated: 4 days ago
Software Architect - Randstad Digital Americas

Software Architect - Randstad Digital Americas

Randstad Digital Americas • saint-esprit, qc, ca
Full-time
Oakville, Ontario (Hybrid - 3 days onsite / week).We are seeking a pragmatic and visionary.In this role, you will be the bridge between complex business requirements and robust technological solution...Show more
Last updated: 1 hour ago • Promoted • New!
Chef d'Équipe Lean - Amélioration Continue & Santé / Sécurité

Chef d'Équipe Lean - Amélioration Continue & Santé / Sécurité

Prattwhitney • Longueuil H4H, QC, Canada
Full-time
Une entreprise manufacturière renommée cherche à recruter un gestionnaire pour superviser les employés dans un environnement syndiqué à Longueuil, Québec. Ce rôle exige des compétences en communicat...Show more
Last updated: 1 day ago • Promoted
Optical Engineer (On site - Montreal, Canada)

Optical Engineer (On site - Montreal, Canada)

Astronics • Dorval, QC, Canada
Full-time
Astronics - Luminescent Systems Canada Inc.Reporting to the Director of Engineering, the incumbent will be responsible for optical design, inspection test design, automation software tools.He will ...Show more
Last updated: 30+ days ago • Promoted
Professionnel Intégration Systèmes Cybersécurité / Cybersecurity Systems Integration Professional

Professionnel Intégration Systèmes Cybersécurité / Cybersecurity Systems Integration Professional

Airbus Canada Limited Partnership • Côte-Saint-Luc, Canada, CA
Permanent
Job Description : • • • • •English job description follows • • • •Description de l'emploi : •Vous avez une expérience en aéronautique et un intérêt pour les systèmes avioniques, vous avez travaillé dans...Show more
Last updated: 2 days ago • Promoted
Senior Protocol Engineer Crypto Infrastructure Remote (EST or Lisbon timezone)

Senior Protocol Engineer Crypto Infrastructure Remote (EST or Lisbon timezone)

Inner Circle Agency Inc. • Montreal, QC, Canada
Remote
Full-time
Senior Protocol Engineer – Crypto Infrastructure – Remote (EST or Lisbon).Remote (Must overlap with EST or Lisbon time zones). Full-time, flexible hours with strong overlap to EST or Lis...Show more
Last updated: 30+ days ago • Promoted
Lead DevSecOps Engineer (Remote, Montreal, QC, Canada)

Lead DevSecOps Engineer (Remote, Montreal, QC, Canada)

HR POD - Hiring Talent Globally • Montreal, QC, Canada
Remote
Full-time
Proven track record of redesigning and scaling production infrastructure for high-growth companies.Deep expertise in AWS services including RDS, EC2, ELB / ALB, Route53, VPC, IAM, and.Strong security...Show more
Last updated: 4 days ago • Promoted
English Private Tutoring Jobs Lanaudi

English Private Tutoring Jobs Lanaudi

Superprof • Lanaudi, Canada
Full-time +1
Superprof is Canada's #1 tutoring platform, and we're actively recruiting passionate tutors! Whether you're a student, a professional, or simply someone who loves teaching, join the largest communi...Show more
Last updated: 30+ days ago • Promoted
DevOps Engineer

DevOps Engineer

VBeyond Corporation • saint-esprit, QC, ca
Full-time
We are seeking a DevOps Engineer.The role focuses on infrastructure setup, deployment automation, performance, security, and operational stability throughout the migration and post-launch phases.Ke...Show more
Last updated: 5 hours ago • Promoted • New!
Director of Enterprise Architecture & Cloud Strategy

Director of Enterprise Architecture & Cloud Strategy

Fairstone Bank • Montreal (administrative region), QC, Canada
Full-time
A financial services organization based in Montreal is seeking a Director of Enterprise Architecture to define and govern technology architecture strategy. You will lead the development of framework...Show more
Last updated: 12 days ago • Promoted
Cloud DevOps Engineer

Cloud DevOps Engineer

Targeted Talent • Montreal, QC, Canada
Permanent
We are looking for an experienced.This is a permanent position that is remote to start with later relocation to.Our client is a global enterprise company with a product that you've likely used....Show more
Last updated: 30+ days ago • Promoted
Ingénieur logiciel senior - Plateforme cloud / Senior Software Engineer - Cloud Platform

Ingénieur logiciel senior - Plateforme cloud / Senior Software Engineer - Cloud Platform

Tait • Montreal, QC, Canada
Full-time
Créer des moments qui touchent les gens.Vous conceverez et mettrez en œuvre des fonctionnalités complexes, façonnerez des normes de codage et guiderez les décisions tech...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Kutir Technologies • Montreal, QC, Canada
Full-time
Quick Apply
Role : Site Reliability Engineer (SRE) Location : Montreal, QC, Canada Experience in Python is a MUST The...Show more
Last updated: less than 1 hour ago • New!
Founder - Loud Solutions

Founder - Loud Solutions

Loud Solutions • saint-esprit, qc, ca
Full-time
Loud has partnered with a well-capitalized, highly active VC deploying capital into AI-driven businesses across large, legacy industries. What’s missing is the right person to steer the ship.We are ...Show more
Last updated: 7 hours ago • Promoted • New!
Senior Backend Engineer

Senior Backend Engineer

Meroka Inc. • Montréal, Quebec, Canada, H4A 2H4
Full-time
Meroka is building the future of independent medicine in the United States.We provide operations, finance, tech, and strategic support to physician-owned practices-helping them grow sustainably and...Show more
Last updated: 2 days ago
Senior AI-Driven Cloud & API Solutions Architect

Senior AI-Driven Cloud & API Solutions Architect

Banque Nationale du Canada • Montreal (administrative region), QC, Canada
Remote
Full-time
A leading Canadian bank is seeking a Solutions Architect to join their Technology and Operations team.This role involves designing and integrating innovative solutions, with a focus on cloud servic...Show more
Last updated: 11 days ago • Promoted
Forward Deployed Engineer - Montreal Canada

Forward Deployed Engineer - Montreal Canada

Fiveonefour Labs Inc • Montreal, Quebec, Canada, H1A 0A1
Full-time
We believe that data is the key to unleashing human potential.We've seen firsthand how data helps bridge art and science to create delightful experiences, impactful insights, and seamless automatio...Show more
Last updated: 30+ days ago