Staff Cloud DevOps/Site Reliability Engineer

Inworld AI
Canada
$170K-$220K a year
Full-time

Why Join Inworld

Inworld is the best-funded startup in AI and games with a $500 million valuation and backing from top-tier investors like Intel, Microsoft, Lightspeed, Bitkraft, Founders Fund, Kleiner Perkins, and more.

Inworld was recognized by CB Insights as one of the ten most promising AI companies in the US, ranking in the top two in both the Early Stage and Vertical AI categories among all companies worldwide in 2024.

Inworld is the leading AI engine for games, enabling developers to build groundbreaking game mechanics, dynamic NPCs and worlds that evolve with each action.

Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft / Xbox, Epic Games and Unity.

Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps / Site Reliability Engineer to join our team.

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience).
  • At least 2 years experience each with :

Terraform

Helm

Kubernetes

AWS, Azure, or GCP

CI / CD using modern tools (GitOps)

Optional (not required but considered a plus) :

MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)

Prometheus / Grafana

Multi-cloud deployments (2 or more)

ArgoCD

Network management and VPNs

Responsibilities

  • Infrastructure : Maintain and contribute to Infrastructure-as-Code (Terraform)
  • DevOps and CI / CD Pipelines : Orchestrate pipelines using Github Actions, Helm, ArgoCD
  • Microservices scalability : Kubernetes Administration
  • Cloud Administration
  • Site Reliability : Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis

In-office location : Vancouver, Canada.

Remote location : Canada.

The Canada base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits.

Within the range, individual pay is determined by work location and additional factors, including competencies and experience.

30+ days ago
Related jobs
Promoted
Mizuho Financial Group Inc.
Canada

Technical knowledge and experience in cloud architectures, hybrid cloud and cloud native solutions to leverage reliable designs in cloud to improve operational efficiencies. Site Reliability Engineer page is loaded. Join the Mizuho team as a Lead Site Reliability Engineer (SRE)!. Software Engineeri...

Promoted
Behavox Limited
Canada

As a Site Reliability Engineer you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product and Engineering teams to d...

Promoted
Cribl, Inc.
Canada

We are looking for Cloud Site Reliability Engineers and Developers at all levels at Cribl, who enjoy being in the thick of it. Cribl Inc is seeking a Senior Site Reliability Engineer to join our mission to unlock the value of all observability data. You provide your creative input into all things Cl...

Promoted
Magic Labs
Canada

Cloud Engineering experience (DevOps/SRE) designing production systems in public cloud environments (AWS [preferred], GCP, or Azure). As a Staff Cloud Engineer, you will play a pivotal role in designing, implementing, and maintaining robust infrastructure solutions supporting all of Magic’s products...

Promoted
? Grafana Enterprise
Canada
Remote

We are looking for a Senior SRE to help us support our highest value Grafana Cloud customers by increasing the reliability of our Cloud databases that are based on Mimir, Loki, Tempo, and Pyroscope. SLOs or user experience for the customer (learn what is special about each of these customers, and mi...

Promoted
Practice Better
Canada

We are on a mission to build an industry-leading product on a strong foundation built by a world-class engineering, product, and design team! We seek an experienced and dedicated Senior Site Reliability Engineer (SRE) to join our growing Engineering, Product, Design, and Growth team. Senior Site Rel...

Promoted
Menlo Security, Inc.
Canada

As a Site Reliability Engineer, you'll join a group of experienced engineers located in the North America region who are part of a globally distributed team responsible for managing the company's core infrastructure services and maintaining our constantly growing platform. Be an advocate of Site Rel...

Promoted
Mosaec
Canada

In this role, you will not only support our customers using Atmosphere, our open-source cloud product, but also provide critical support to our internal CloudOps team that manages our public and private cloud infrastructure. We are seeking someone with a strong background in Linux and cloud technolo...

Jobber
Canada
Remote

Senior Site Reliability Engineer. As a part of our cloud infrastructure team (SRE), you'll play a critical role in empowering our product development teams, ensuring the safe release of software, and maintaining top-tier application performance and reliability. Our Software Engineering team is pivot...

Behavox
Canada

As a Site Reliability Engineer, you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product, and Engineering teams to...