Search jobs > Toronto, ON > Site reliability engineer

Site Reliability Engineer 3

Behavox
Toronto
$124K-$186K a year (estimated)
Full-time

About the Role

The Behavox Platform is a scalable, fault-tolerant and highly performant storage and processing system which allows us to manage and analyze massive volumes of data.

We have an extensive and flexible set of APIs to develop products that allow our clients to work through millions of data items, by searching, filtering, and visualizing relationships between entities in the system.

As a Site Reliability Engineer, you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services.

You will work together with other DevOps, Product, and Engineering teams to design and implement SRE practice at Behavox to build foundational infrastructure allowing to support the rapid growth of the Behavox client base.

This is an incredible opportunity to discover the world of high-load data processing and face the challenges of distributed Big Data systems.

It will also provide you the opportunity to :

1. Work with high-load and business-critical services that will have a big impact on the company

2. Implement your ideas in an environment that strives for continuous improvement

3. Be part of a fast-growing dynamic company and with modern technologies

More information about the tools and solutions used at Behavox can be found on our engineering blog https : / / blog.behavox.engineering

What You'll Bring

  • A deep and genuine interest in Behavox as demonstrated by a connection to its mission, marketplace and / or technologies
  • 5+ years of experience as an SRE / DevOps engineer responsible for deployment and maintenance of production systems
  • Experience with Public Clouds (GCP / AWS). Knowledge of Google cloud Dataflow, Cloud Functions, Pub / Sub, or similar AWS technologies would be a plus
  • Automation skills - SaltStack or equivalent tools(Ansible), knowledge of programming languages (Python, Golang, Java)
  • Experience with Hashicorp stack : Terraform, Nomad, Consul, Vault

What You'll Do

  • Perform deployment and maintenance of high-load and large-scale distributed storage and data processing systems in Public clouds
  • Monitor, develop, and troubleshoot applications to resolve issues, lead incident support, be part of the on-call team
  • Automate routine operations using Python / Golang
  • Maintain cloud-based services in the public cloud providers (GCP / AWS)
  • Administer and troubleshoot Linux operating systems and networks

What We Offer

  • A truly global mission with a passionate community in locations all over the world
  • Huge impact and learning potential as our aspirations require bold innovation
  • Highly competitive compensation with 100% bonus pay already integrated
  • Benefits include great health coverage for employee and family
  • Generous time-off policy and flexible work schedule
  • 30+ days ago
Related jobs
circle
Toronto, Ontario

As a Senior Site Reliability Engineer at Circle, you will design, build, and maintain Circle's infrastructure estate to meet the growing worldwide customer base on public cloud providers across multiple regions. Staff Site Reliability Engineer (IV). Senior Site Reliability Engineer (III). Senior Sit...

Index Exchange
Toronto, Ontario

We are seeking an experienced Staff Engineer with a strong background in Site Reliability Engineering (SRE) to own and develop on-premise and hybrid cloud environments, with a focus on optimizing performance low-latency on Kubernetes platforms supporting a robust developer experience framework. As w...

Bourse de Montreal Inc.
Toronto, Ontario

Previous experience as a Site Reliability Engineer (SRE). The Devops Engineering team is responsible for working closely with various business units and stakeholders to solve complex problems using innovative solutions, quickly and effectively using agile, lean and devops methodologies, while ensuri...

0000050007 Royal Bank of Canada
Toronto, Ontario

As a Senior Site Reliability Engineer on the Client360 Advisor Platform team you will be responsible for monitoring, deploying, and maintaining applications built on the Salesforce platform & applications used to integrate Salesforce with other RBC systems. Agile Methodology, Application Infrastruct...

Scotiabank
Toronto, Ontario

We are looking for a developer to join our Digital Engineering Operations. Develop software following sound software engineering principles and lead investigations for production issues and come up with solutions that meet security standards defined by the organization. If you require accommodation ...

Thomson Reuters
Toronto, Ontario

Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. In this opportunity as Senior Site Reliability Engineer, you will:. You're a fit for the role of Senior Site Reliability Engineer if your background includes:. DevOps Engineer, Cloud Engine...

Etraveli Group
Toronto, Ontario

As a Senior member of the Site Reliability Engineering Team working closely with Software Engineers, you will be helping them to deploy and operate various production systems and processes. ...

Jobber
Canada
Remote

Senior Site Reliability Engineer. Our Software Engineering team is pivotal to Jobber's success, creating software that adds value to tens of thousands of users worldwide. As a part of our cloud infrastructure team (SRE), you'll play a critical role in empowering our product development teams, ensuri...

Broadridge
Toronto, Ontario

Broadridge is growing! We are seeking a Site Reliability Engineer Lead to join our team in Toronto. Broadridge associates helped us envision our Connected Workplace - a work model that allows associates around the globe, dependent upon their role responsibilities, take advantage of the benefits of b...

Criteo
Toronto, Ontario

As a Site Reliability Engineer, you’ll work closely with product engineering to improve the reliability of our apps, systems and pipelines and assess where optimization is needed most. The concept of Product Reliability Engineering (PRE) was born from an industry leading online SRE book (go ahead, “...