Staff Cloud DevOps/Site Reliability Engineer

Inworld AI
British Columbia, Canada
170K $-220K $ / an
Temps plein

Why Join Inworld

Inworld is the best-funded startup in AI and games with a $500 million valuation and backing from top-tier investors like Intel, Microsoft, Lightspeed, Bitkraft, Founders Fund, Kleiner Perkins, and more.

Inworld was recognized by CB Insights as one of the ten most promising AI companies in the US, ranking in the top two in both the Early Stage and Vertical AI categories among all companies worldwide in 2024.

Inworld is the leading AI engine for games, enabling developers to build groundbreaking game mechanics, dynamic NPCs and worlds that evolve with each action.

Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft / Xbox, Epic Games and Unity.

Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps / Site Reliability Engineer to join our team.

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience)

Experience with at least 2 years in :

  • Terraform
  • Helm
  • Kubernetes
  • AWS, Azure, or GCP
  • CI / CD using modern tools (GitOps)

Nice-to-Have :

  • MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)
  • Prometheus / Grafana
  • Multi-cloud deployments (2 or more)
  • ArgoCD
  • Network management and VPNs

Responsibilities

  • Infrastructure : Maintain and contribute to Infrastructure-as-Code (Terraform)
  • DevOps and CI / CD Pipelines : Orchestrate pipelines using Github Actions, Helm, ArgoCD
  • Microservices scalability : Kubernetes Administration
  • Cloud Administration
  • Site Reliability : Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis

Work Location : British Columbia, Canada.

The base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits.

Within the range, individual pay is determined by work location and additional factors, including competencies and experience.

Il y a 16 jours
Emplois reliés
Offre sponsorisée
Sigmaways Inc
Vancouver, Colombie-Britannique

We're seeking a Site Reliability Engineer to join our team with expertise in. Years experience in Site reliability space. ...

Offre sponsorisée
Arista Networks
Vancouver, Colombie-Britannique

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in ...

CIRCLE
Vancouver, Colombie-Britannique

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle’s infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Staff Site Reliability Engineer (IV). Staff Site R...

Offre sponsorisée
T-Net British Columbia
Vancouver, Colombie-Britannique

Intermediate/Senior DevOps, Site Reliability (Linux). In this DevOps/SRE role, you will be responsible for the reliability and smooth operation of your service in both production and test environments. For over 20 years, Global Relay has set the standard in enterprise information archiving with indu...

Offre sponsorisée
Senioren-Residenz Bertram GmbH
Canada

LogiSense is seeking a Senior Cloud Systems Administrator/DevOps Engineer to join our team and work on fascinating technology, while enjoying a work-from-home environment with an outstanding group of people who are solving new challenges around monetizing IoT, Communications and XaaS products and se...

Leica Geosystems
Canada

Senior DevOps Engineer / Site Reliability. DevOps &/or Site Reliability Engineering principles. Senior DevOps Engineer / Site Reliability | Hexagon Geosystems. As a Senior DevOps/SRE Engineer, you will help build solutions that allow our cloud-based platform, HxDR, to continue to evolve and grow thr...

Electronic Arts
Vancouver, Colombie-Britannique

You will build and operate distributed, large-scale, cloud-based infrastructure using modern open-source software solutions. Cloud Computing (AWS preferred), Linux server support, Server Administration, Systems Administration. ...

BMO
Canada, Canada

Migrates existing applications to the cloud, and modifies existing applications for the cloud, or builds new cloud-native applications. Focuses on the technical design, development, enhancement, testing, debugging and maintenance of Cloud applications and supports the design of business processes in...

circle
Burnaby, Colombie-Britannique

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle's infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Staff Site Reliability Engineer (IV). Senior Site ...

Arista Networks
Vancouver, Colombie-Britannique

SREs at Arista combine strong software and systems engineering with a passion for operating production systems at scale. As an SRE you’ll be responsible for our global CloudVision service fleet. Driving infrastructure and cloud-based application security design. Arista’s CloudVision is an enterprise...