Staff Cloud DevOps/Site Reliability Engineer

Inworld AI
Canada
170K $-220K $ / an
Temps plein

Why Join Inworld

Inworld is the best-funded startup in AI and games with a $500 million valuation and backing from top-tier investors like Intel, Microsoft, Lightspeed, Bitkraft, Founders Fund, Kleiner Perkins, and more.

Inworld was recognized by CB Insights as one of the ten most promising AI companies in the US, ranking in the top two in both the Early Stage and Vertical AI categories among all companies worldwide in 2024.

Inworld is the leading AI engine for games, enabling developers to build groundbreaking game mechanics, dynamic NPCs and worlds that evolve with each action.

Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft / Xbox, Epic Games and Unity.

Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps / Site Reliability Engineer to join our team.

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience).
  • At least 2 years experience each with :

Terraform

Helm

Kubernetes

AWS, Azure, or GCP

CI / CD using modern tools (GitOps)

Optional (not required but considered a plus) :

MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)

Prometheus / Grafana

Multi-cloud deployments (2 or more)

ArgoCD

Network management and VPNs

Responsibilities

  • Infrastructure : Maintain and contribute to Infrastructure-as-Code (Terraform)
  • DevOps and CI / CD Pipelines : Orchestrate pipelines using Github Actions, Helm, ArgoCD
  • Microservices scalability : Kubernetes Administration
  • Cloud Administration
  • Site Reliability : Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis

In-office location : Vancouver, Canada.

Remote location : Canada.

The Canada base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits.

Within the range, individual pay is determined by work location and additional factors, including competencies and experience.

Il y a plus de 30 jours
Emplois reliés
Offre sponsorisée
Inworld AI
Canada

We are looking for a Staff Cloud DevOps/Site Reliability Engineer to join our team. DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience). Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our pla...

Offre sponsorisée
Understanding Recruitment
Canada

We are seeking a fully remote Staff DevOps Engineer a Series B RiskOps scale-up building AI into underwriting predictions, extracting data from hundreds of sources to revolutionise the underwriting process. You’d join them to build out and maintain their GCP (the team are using some of the latest Go...

GlossGenius
Canada
Télétravail

Production Engineer, Cloud Engineer, Site Reliability Engineer, or DevOps equivalent roles. In this role, you'll have the opportunity to join GlossGenius as one of the first Senior Site Reliability Engineer as part of the Platform Engineering team. As a Site Reliability Engineer, you will play a key...

Unreal Gigs
CA
Télétravail

We are seeking an experienced Site Reliability Engineer (SRE) who is passionate about leveraging data and automation to optimize a highly dynamic infrastructure. Provide platform support to engineering teams, leveraging data insights to drive decision-making. Collaborate with engineering to redefine...

Life360
Remote, Canada, US
Télétravail

As an SRE on the Location Engineering group you will help build and operate scalable services powering Life360 product. Our cloud team ensures that our API's are able to process hundreds of thousands of requests a second with the ability to scale 10x. Engage with product and engineering teams to des...

Deloitte
Canada, Canada

Review application architecture reviews to recommend improvements for better reliability and application performance. Understanding of the Reliability and configuration management principles. Good knowledge in Cloud Migrations. Developer, Instrumentation, Database, SQL, Sharepoint, Technology, Engin...

Jobber
Canada
Télétravail

Senior Site Reliability Engineer. As a part of our cloud infrastructure team (SRE), you'll play a critical role in empowering our product development teams, ensuring the safe release of software, and maintaining top-tier application performance and reliability. Our Software Engineering team is pivot...

Deloitte
Canada, Canada

Providing recommended improvements to the team to improve the reliability and performance of applications. Presenting analyses and recommendations to team or discussing the technical merits of solutions with engineers and architects. Owning the day-to-day health, uptime, monitoring, and reliability ...

Behavox
Canada

As a Site Reliability Engineer you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product and Engineering teams to d...

Behavox
Canada

As a Site Reliability Engineer, you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product, and Engineering teams to...