Talent.com
Platform Engineer (Cloud Reliability Engineer)
Platform Engineer (Cloud Reliability Engineer)Nanometrics Inc. • Ottawa, ON, CA
Platform Engineer (Cloud Reliability Engineer)

Platform Engineer (Cloud Reliability Engineer)

Nanometrics Inc. • Ottawa, ON, CA
3 days ago
Job type
  • Full-time
Job description

Platform Engineer (Cloud Reliability Engineer)

Reports to : Director, Global Operations

Based in Ottawa, ON

Term : Full Time

About Nanometrics

With 40 years of seismic technology and industry application experience, we are a global, award‑winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From mission‑critical seismic arrays, tsunami and early‑earthquake warning systems in over 90 countries across the globe to induce seismicity monitoring in the energy sector. We specialize in full‑service, integrated solutions for studying artificial and natural seismicity, including turnkey seismic networks, industry‑leading precision instrumentation, complete data processing, analysis services, and software applications.

At Nanometrics, we take pride in fostering a culture of innovation, collaboration, and excellence. We are passionate about making a global impact through cutting‑edge technology while staying rooted in values of intentional innovation, trust, ethics, and stability.

About the role

This is an exciting opportunity for a motivated and experienced Platform Engineer to evolve, enhance and lead the technological footprint of our Seismic Monitoring Services portfolio. Nanometrics provides a top tier portfolio of tools and services which is supported by a continuously evolving cloud based platform.

The Platform Engineer / Cloud Reliability Engineer ensures the reliability, performance, and operational excellence of cloud‑hosted seismic monitoring and data processing services. This role blends software engineering, cloud infrastructure management, and SRE practices to build resilient systems, reduce manual toil through automation, and improve observability across AWS and Kubernetes ecosystems.

The successful candidate will use Terraform or similar Infrastructure-as-Code technologies (Pulumi, AWS CDK, CloudFormation, OpenTofu) to deliver consistent, automated, scalable infrastructure.

Responsibilities

Cloud Reliability & Resilience

  • Ensure uptime, performance, and reliability of AWS-hosted services and Kubernetes workloads
  • Implement self-healing patterns, automated rollbacks, health checks, and safe-deployment strategies
  • Participate in on-call rotation and lead first-response triage for cloud and platform incidents
  • Build and maintain service-level indicators (SLIs) and service-level objectives (SLOs)

Automation & Infrastructure Engineering

  • Develop automation for cloud operations using Python, Bash, and IaC (Terraform)
  • Reduce operational toil through automated runbooks, event-driven remediation, and system orchestration
  • Improve deployment reliability in collaboration with Platform Engineering and R&D teams
  • Implement and refine configuration standards, CI / CD hygiene, and environment stability
  • Observability & Operational Intelligence

  • Maintain and extend observability stack (Prometheus, Grafana, InfluxDB, OpenTelemetry)
  • Tune alerts for accuracy, reduce noise, and implement actionable alerting tied to SLOs
  • Analyze logs, metrics, and traces to detect reliability issues and validate system behavior
  • Build dashboards that provide real‑time visibility into system health and reliability trends
  • Operational Excellence

  • Support release processes, platform upgrades, and cloud infrastructure changes
  • Conduct root‑cause analysis and drive post‑incident corrective actions
  • Maintain operational documentation, runbooks, and environment validation workflows
  • Collaborate cross‑functionally with NetOps, Platform Engineering, Field Ops, and R&D
  • Requirements

    Education and Experience

  • Bachelor's degree or higher in Software Engineering, Computer Science, or related field.
  • 7+ years experience in software development
  • 3+ years hands‑experience working with cloud providers like AWS, etc and cloud‑native technologies like Kubernetes, Helm, etc. and related technologies including observability platforms.
  • Experience with database operations (MySQL, PostgreSQL, MongoDB, Redis) in cloud and on‑prem environments.
  • Cloud & Infrastructure

  • Strong experience with AWS (EC2, S3, IAM, VPC, EKS / ECS, CloudWatch)
  • Solid understanding of Kubernetes , Helm charts, and container orchestration
  • Familiarity with hybrid cloud environments (cloud + on‑prem integration)
  • Infrastructure as Code & Automation

  • Hands‑on experience with Terraform
  • Scripting skills in Python and Bash
  • Ability to build automated workflows and cloud operations tooling
  • CI / CD & Deployment Engineering

  • Experience with deployment pipelines (Jenkins, Bitbucket Pipelines, ArgoCD)
  • Familiarity with GitOps workflows
  • Understanding of build systems (Maven, Gradle)
  • Monitoring & Observability

  • Experience with monitoring / metrics / logging tools such as Prometheus, Grafana, InfluxDB
  • Familiarity with OpenTelemetry for distributed tracing
  • Ability to diagnose performance issues in distributed systems
  • Reliability Engineering Concepts

  • Knowledge of SLOs / SLIs / error budgets
  • Incident management principles
  • Understanding of resilience patterns (retry, circuit breakers, autoscaling, etc.)
  • Why Nanometrics?

  • We are a global leader in seismic solutions and a Canada's Best Managed Companies Platinum member.
  • We value sustainable growth that benefits our employees, our community, and the environment.
  • Maximize your productivity with our flexible hybrid work model. Our centrally located office space offers a stimulating environment for collaboration and focused work. Plus, enjoy a convenient commute with easy access to biking paths and public transportation.
  • Engage in virtual and onsite social events centered around collaboration, learning, and fun, including volunteer events, celebrations, and team-building activities.
  • Our comprehensive group benefits program includes RRSP matching, health / dental benefits, a corporate bonus program, education assistance, and a health spending account.
  • Our Employee Assistance Program (EAP) provides services and support for health, work‑life solutions, legal guidance, financial resources, wellness tools, and more.
  • Enjoy a competitive leave program, including a holiday shutdown (December 25 to January 1).
  • Grow your career with learning and development opportunities.
  • Collaborate with high-performing teams and some of the industry's top minds.
  • Nanometrics is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Should you require accommodation as part of the recruitment and selection process, please reach out to careers@nanometrics.ca

    #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • Ottawa, ON, CA

    Similar jobs
    Senior Linux & Cloud Platform Engineer

    Senior Linux & Cloud Platform Engineer

    Barracuda Networks • Ottawa H2B, ON, Canada
    Full-time
    A leading cybersecurity company is looking for a Senior Software Engineer in Ottawa, Canada.You will develop and maintain the Operating System platform, collaborate with development teams, and trou...Show more
    Last updated: 30+ days ago • Promoted
    Equity-Eligible Principal DevOps Lead (Cloud, CI / CD & SRE)

    Equity-Eligible Principal DevOps Lead (Cloud, CI / CD & SRE)

    Veem • Ottawa H2B, ON, Canada
    Full-time
    A technology company in Ontario is seeking a skilled Principal DevOps Engineer to lead their infrastructure strategy and automation processes. This key role focuses on architecting CI / CD pipelines a...Show more
    Last updated: 3 days ago • Promoted
    Hybrid Cloud DevOps Engineer – CI / CD & Automation Lead

    Hybrid Cloud DevOps Engineer – CI / CD & Automation Lead

    TULLOCH • Ottawa
    Full-time
    A leading consulting engineering firm based in Ottawa seeks a DevOps Engineer to drive automation, scalability, and efficiency across software development and IT operations.This role involves desig...Show more
    Last updated: 30+ days ago • Promoted
    Platform Engineer (Cloud Reliability Engineer)

    Platform Engineer (Cloud Reliability Engineer)

    Nanometrics Inc. • Ottawa
    Full-time
    Platform Engineer (Cloud Reliability Engineer).Reports to : Director, Global Operations.With 40 years of seismic technology and industry application experience, we are a global, award‑winning compan...Show more
    Last updated: 4 days ago • Promoted
    Cloud DevOps Engineer

    Cloud DevOps Engineer

    Targeted Talent • Ottawa, ON, Canada
    Permanent
    We are looking for an experienced.This is a permanent position that is remote to start with later relocation to.Our client is a global enterprise company with a product that you've likely used....Show more
    Last updated: 30+ days ago • Promoted
    Senior Cloud Platform Developer

    Senior Cloud Platform Developer

    Telesat Corporation • Ottawa
    Full-time
    Telesat (Nasdaq and TSX : TSAT) is a leading global satellite operator, providing reliable and secure satellite-delivered communications solutions worldwide to broadcast, telecommunications, corpora...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Network Engineer

    Cloud Network Engineer

    Paymentology • Ottawa
    Full-time
    As the first truly global issuer-processor, we give banks and fintechs the technology and talent to launch and manage Mastercard and Visa cards at scale—across more than 60 countries.Our advanced, ...Show more
    Last updated: 11 days ago • Promoted
    Senior Platform Engineer – Cloud Orchestration & Analytics

    Senior Platform Engineer – Cloud Orchestration & Analytics

    Wind River • Ottawa
    Full-time
    A global software development firm in Ottawa is seeking a Senior Engineer to develop distributed cloud-based orchestration and automation platform solutions. The ideal candidate should have over 5 y...Show more
    Last updated: 11 days ago • Promoted
    Azure Cloud Engineer

    Azure Cloud Engineer

    RedMane Technology LLC • Ottawa
    Full-time
    RedMane Technology Canada is an application software consulting and systems integration company in British Columbia and Ottawa, Canada. We deliver software solutions for our clients throughout Canad...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Operations Engineer : 24 / 7 Reliability & Automation

    Cloud Operations Engineer : 24 / 7 Reliability & Automation

    Fuel Industries • Ottawa
    Full-time
    Fuel Industries, a leader in interactive entertainment, seeks a Cloud Operations Specialist.You will support our cloud infrastructure, ensuring stability and reliability while collaborating with de...Show more
    Last updated: 30+ days ago • Promoted
    Solutions Architect (Cloud, Software & Enterprise IT)

    Solutions Architect (Cloud, Software & Enterprise IT)

    TULLOCH • Ottawa
    Full-time
    We want to build an organization where everyone loves their job and their leaders care for them.Over the last 30 years, TULLOCH has built a robust multi-disciplinary consulting engineering firm rec...Show more
    Last updated: 30+ days ago • Promoted
    Senior SRE : Cloud Reliability & Scale (Remote)

    Senior SRE : Cloud Reliability & Scale (Remote)

    Veeva Systems • Ottawa H2B, ON, Canada
    Remote
    Full-time
    A leading life sciences technology company is looking for a Senior Software Engineer - SRE to join its Vault Platform team in Ottawa. In this role, you will ensure the scalability and reliability of...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Platform Engineer — Edge & OpenStack (Hybrid)

    Cloud Platform Engineer — Edge & OpenStack (Hybrid)

    Wind River Systems • Ottawa
    Full-time
    A leading software solutions company in Ottawa seeks a software engineer with expertise in Linux processes and Kubernetes applications. The successful candidate will develop high-quality software an...Show more
    Last updated: 30+ days ago • Promoted
    Lead Kafka & Streaming Platform Engineer

    Lead Kafka & Streaming Platform Engineer

    Telesat Corporation • Ottawa
    Full-time
    A leading global satellite operator is seeking a skilled Kafka Expert based in Ottawa, Canada, to join their data platform team. The role involves designing, deploying, and optimizing Kafka-based da...Show more
    Last updated: 30+ days ago • Promoted
    Senior Cloud & Microsoft Platform Engineer

    Senior Cloud & Microsoft Platform Engineer

    IC 360 Solutions • Ottawa
    Full-time
    A dynamic technology services company is seeking a Senior Systems Engineer to lead the design and deployment of cloud solutions, primarily within the Microsoft ecosystem. This role involves collabor...Show more
    Last updated: 11 days ago • Promoted
    Senior Cloud Platform Engineer – Multi-Tenant, Global Scale

    Senior Cloud Platform Engineer – Multi-Tenant, Global Scale

    March Networks • Ottawa
    Full-time
    A leading technology company in Ottawa is seeking a Senior Cloud Platform Software Developer to architect, build, and evolve their cloud-native platform. This role includes leading design reviews, i...Show more
    Last updated: 1 day ago • Promoted
    Senior Cloud Platform Developer

    Senior Cloud Platform Developer

    Telesat • Ottawa
    Full-time
    Senior Cloud Platform Developer – Telesat.We are seeking a highly skilled.Design, deploy, and manage Apache Kafka clusters across development, testing, and production environments.Deploy and manage...Show more
    Last updated: 30+ days ago • Promoted
    Senior Cloud Platform Engineer - Kubernetes & IaC

    Senior Cloud Platform Engineer - Kubernetes & IaC

    March Networks Corporation • Ottawa
    Full-time
    A leading tech firm in video surveillance seeks a Senior Cloud Platform Software Developer to architect and evolve their cloud-native platform. This role involves leading the design of distributed s...Show more
    Last updated: 3 days ago • Promoted