Talent.com
Platform Engineer (Cloud Reliability Engineer)
Platform Engineer (Cloud Reliability Engineer)Nanometrics • Ottawa, ON, CA
Platform Engineer (Cloud Reliability Engineer)

Platform Engineer (Cloud Reliability Engineer)

Nanometrics • Ottawa, ON, CA
30+ days ago
Job type
  • Full-time
Job description

Job Title :

Platform Engineer (Cloud Reliability Engineer)

Reports to :

Director, Global Operations

Based in : Ottawa, ON

Term : Full Time

About Nanometrics :

With 40 years of seismic technology and industry application experience, we are a global, award-winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From mission-critical seismic arrays, tsunami and early earthquake warning systems in over 90 countries across the globe to induce seismicity monitoring in the energy sector. We specialize in full-service, integrated solutions for studying artificial and natural seismicity, including turnkey seismic networks, industry-leading precision instrumentation, complete data processing, analysis services, and software applications.

At Nanometrics, we take pride in fostering a culture of innovation, collaboration, and excellence. We are passionate about making a global impact through cutting-edge technology while staying rooted in values of intentional innovation, trust, ethics, and stability.

About the role :

This is an exciting opportunity for a motivated and experienced Platform Engineer to evolve, enhance and lead the technological footprint of our Seismic Monitoring Services portfolio. Nanometrics provides a top tier portfolio of tools and services which is supported by a continuously evolving cloud based platform.

The Platform Engineer / Cloud Reliability Engineer ensures the reliability, performance, and operational excellence of cloud-hosted seismic monitoring and data processing services. This role blends software engineering, cloud infrastructure management, and SRE practices to build resilient systems, reduce manual toil through automation, and improve observability across AWS and Kubernetes ecosystems.

The successful candidate will use Terraform or similar Infrastructure-as-Code technologies (Pulumi, AWS CDK, CloudFormation, OpenTofu) to deliver consistent, automated, scalable infrastructure.

Responsibilities :

Cloud Reliability & Resilience

Ensure uptime, performance, and reliability of AWS-hosted services and Kubernetes workloads

Implement self-healing patterns, automated rollbacks, health checks, and safe-deployment strategies

Participate in on-call rotation and lead first-response triage for cloud and platform incidents

Build and maintain service-level indicators (SLIs) and service-level objectives (SLOs)

Automation & Infrastructure Engineering

Develop automation for cloud operations using Python, Bash, and IaC (Terraform)

Reduce operational toil through automated runbooks, event-driven remediation, and system orchestration

Improve deployment reliability in collaboration with Platform Engineering and R&D teams

Implement and refine configuration standards, CI / CD hygiene, and environment stability

Observability & Operational Intelligence

Maintain and extend observability stack (Prometheus, Grafana, InfluxDB, OpenTelemetry)

Tune alerts for accuracy, reduce noise, and implement actionable alerting tied to SLOs

Analyze logs, metrics, and traces to detect reliability issues and validate system behavior

Build dashboards that provide real-time visibility into system health and reliability trends

Operational Excellence

Support release processes, platform upgrades, and cloud infrastructure changes

Conduct root-cause analysis and drive post-incident corrective actions

Maintain operational documentation, runbooks, and environment validation workflows

Collaborate cross-functionally with NetOps, Platform Engineering, Field Ops, and R&D

Requirements :

Education and Experience

Bachelor's degree or higher in Software Engineering, Computer Science, or related field.

7+ years experience in software development

3+ years hands-experience working with cloud providers like AWS, etc and cloud-native technologies like Kubernetes, Helm, etc. and related technologies including observability platforms.

Experience with database operations (MySQL, PostgreSQL, MongoDB, Redis) in cloud and on-prem environments.

Cloud & Infrastructure

Strong experience with AWS (EC2, S3, IAM, VPC, EKS / ECS, CloudWatch)

Solid understanding of Kubernetes, Helm charts, and container orchestration

Familiarity with hybrid cloud environments (cloud + on-prem integration)

Infrastructure as Code & Automation

Hands-on experience with Terraform

Scripting skills in Python and Bash

Ability to build automated workflows and cloud operations tooling

CI / CD & Deployment Engineering

Experience with deployment pipelines (Jenkins, Bitbucket Pipelines, ArgoCD)

Familiarity with GitOps workflows

Understanding of build systems (Maven, Gradle)

Monitoring & Observability

Experience with monitoring / metrics / logging tools such as Prometheus, Grafana, InfluxDB

Familiarity with OpenTelemetry for distributed tracing

Ability to diagnose performance issues in distributed systems

Reliability Engineering Concepts

Knowledge of SLOs / SLIs / error budgets

Incident management principles

Understanding of resilience patterns (retry, circuit breakers, autoscaling, etc.)

Why Nanometrics?

We are a global leader in seismic solutions and a Canada's Best Managed Companies Platinum member.

We value sustainable growth that benefits our employees, our community, and the environment.

Maximize your productivity with our flexible hybrid work model. Our centrally located office space offers a stimulating environment for collaboration and focused work. Plus, enjoy a convenient commute with easy access to biking paths and public transportation.

Engage in virtual and onsite social events centered around collaboration, learning, and fun, including volunteer events, celebrations, and team-building activities.

Our comprehensive group benefits program includes RRSP matching, health / dental benefits, a corporate bonus program, education assistance, and a health spending account.

Our Employee Assistance Program (EAP) provides services and support for health, work-life solutions, legal guidance, financial resources, wellness tools, and more.

Enjoy a competitive leave program, including a holiday shutdown (December 25 to January 1).

Grow your career with learning and development opportunities.

Collaborate with high-performing teams and some of the industry's top minds.

Create a job alert for this search

Reliability Engineer • Ottawa, ON, CA

Similar jobs
Senior Platform Engineer

Senior Platform Engineer

Facilisgroup • Ottawa
Full-time
Senior Platform Engineer - Product Infrastructure.Facilisgroup is a leading technology provider in the Promotional Products industry. We build software-as-a-service solutions that help promotional p...Show more
Last updated: 4 days ago • Promoted
Cloud Architect

Cloud Architect

Akkodis • Ottawa
Full-time
Get AI-powered advice on this job and more exclusive features.Join Our IT Support Talent Network – Future Opportunities in Ottawa. Akkodis is currently building a network of.Design and implement sec...Show more
Last updated: 26 days ago • Promoted
DevOps Engineer

DevOps Engineer

Octopus HR • Ottawa, ON, Canada
Full-time
DevOps EngineerAbout Octopus HR.Octopus HR is a fractional HR consultancy that partners with high-growth startups to build exceptional teams. We specialize in talent acquisition, people operations, ...Show more
Last updated: 17 days ago • Promoted
Senior AI & Cloud Software Engineer

Senior AI & Cloud Software Engineer

Export Development Canada | Exportation et développement Canada • Ottawa H2B, ON, Canada
Remote
Full-time
A financial services organization based in Ottawa seeks a Software Engineer or Senior Software & AI Engineer to join their Digital Delivery Marketing and Architects team. This role involves designin...Show more
Last updated: 18 hours ago • Promoted • New!
Cloud DevOps Engineer

Cloud DevOps Engineer

Targeted Talent • Ottawa, ON, Canada
Permanent
We are looking for an experienced.This is a permanent position that is remote to start with later relocation to.Our client is a global enterprise company with a product that you've likely used....Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer / Platform Operations Engineer

Site Reliability Engineer / Platform Operations Engineer

Targeted Talent • Ottawa, ON, Canada
Permanent
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client.This is a permanent position that is remote to start with later relocation to.Our client i...Show more
Last updated: 30+ days ago • Promoted
Cloud Network Engineer

Cloud Network Engineer

Paymentology • Ottawa
Full-time
As the first truly global issuer-processor, we give banks and fintechs the technology and talent to launch and manage Mastercard and Visa cards at scale—across more than 60 countries.Our advanced, ...Show more
Last updated: 5 days ago • Promoted
Senior Platform Engineer – Cloud Orchestration & Analytics

Senior Platform Engineer – Cloud Orchestration & Analytics

Wind River • Ottawa
Full-time
A global software development firm in Ottawa is seeking a Senior Engineer to develop distributed cloud-based orchestration and automation platform solutions. The ideal candidate should have over 5 y...Show more
Last updated: 5 days ago • Promoted
Cloud Operations Engineer : 24 / 7 Reliability & Automation

Cloud Operations Engineer : 24 / 7 Reliability & Automation

Fuel Industries • Ottawa
Full-time
Fuel Industries, a leader in interactive entertainment, seeks a Cloud Operations Specialist.You will support our cloud infrastructure, ensuring stability and reliability while collaborating with de...Show more
Last updated: 26 days ago • Promoted
Senior Cloud & Microsoft Platform Engineer

Senior Cloud & Microsoft Platform Engineer

IC 360 Solutions • Ottawa
Full-time
A dynamic technology services company is seeking a Senior Systems Engineer to lead the design and deployment of cloud solutions, primarily within the Microsoft ecosystem. This role involves collabor...Show more
Last updated: 5 days ago • Promoted
Cloud DevOps Developer

Cloud DevOps Developer

March Networks • Ottawa, ON, Canada
Full-time
Cloud DevOps Developer – DevOps Team – Ottawa.March Networks is proud to be recognized as one of Ottawa’s Best Places to Work. March Networks is an established global leader in the...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - Go Cloud Networking (Hybrid)

Senior Software Engineer - Go Cloud Networking (Hybrid)

Illumio • Ottawa, ON, Canada
Full-time
A cybersecurity company is seeking a Senior Backend Software Engineer to enhance the Azure Firewall Management Program.This position requires expertise in Go / Golang and cloud environments like Az...Show more
Last updated: 23 days ago • Promoted
Relocate to Malta Azure Cloud Solution Architect (Consulting / Big 4)

Relocate to Malta Azure Cloud Solution Architect (Consulting / Big 4)

Black Pen Recruitment • Ottawa, ON, Canada
Full-time
Our client’s Microsoft Business Solutions team is a Microsoft Gold Partner and leader in Microsoft software implementations for medium to large organisations, providing their clients with the...Show more
Last updated: 30+ days ago • Promoted
Senior Cloud Operations Engineer

Senior Cloud Operations Engineer

Canada Mortgage and Housing Corporation • Toronto, Montreal (Administrative Region), Ottawa
Full-time
A national housing authority in Toronto is looking for a Cloud Support Engineer to oversee and maintain cloud infrastructure primarily within Azure. Responsibilities include managing access permissi...Show more
Last updated: 1 day ago • Promoted
Principal DevOps Engineer – Cloud, CI / CD & SRE Leader

Principal DevOps Engineer – Cloud, CI / CD & SRE Leader

Veem • Ottawa
Full-time
A technology company in Ontario is seeking a skilled Principal DevOps Engineer to lead their infrastructure strategy and automation processes. The ideal candidate will have over 8 years of expertise...Show more
Last updated: 19 days ago • Promoted
Senior Cloud Systems Engineer - Azure & Automation (Hybrid)

Senior Cloud Systems Engineer - Azure & Automation (Hybrid)

IC 360 Solutions • Ottawa
Full-time
A dynamic technology services company in Canada is seeking a Senior Systems Engineer to lead the design and deployment of cloud and IT solutions. You will work with clients to implement cutting-edge...Show more
Last updated: 5 days ago • Promoted
Senior Cloud Engineer – AWS, IaC & Security

Senior Cloud Engineer – AWS, IaC & Security

Tree Trust • Ottawa
Full-time
A prominent survey platform is looking for a Cloud Engineer to design and optimize AWS environments, ensuring security and reliability. The role requires 3-8 years of AWS hands-on experience and kno...Show more
Last updated: 24 days ago • Promoted
Site Reliability Engineer : Cloud, Kubernetes & AI

Site Reliability Engineer : Cloud, Kubernetes & AI

The Pythian Group • Ottawa
Full-time
A multinational technology company in Ottawa is seeking talented Site Reliability Engineers to join their next-generation engineering team. This role involves designing, deploying, and operating lar...Show more
Last updated: 5 days ago • Promoted