Talent.com
Site Reliability Engineer
Site Reliability EngineerOmniscient Neurotechnology (o8t) • Toronto, Canada
Site Reliability Engineer

Site Reliability Engineer

Omniscient Neurotechnology (o8t) • Toronto, Canada
18 days ago
Job type
  • Full-time
Job description

Overview

Omniscient (o8t®) is the world leader in using AI to decode the human brain—a field known as connectomics. Our mission is to improve the lives of billions through connectomics. Today, Omniscient’s connectomic analysis platform, Quicktome®, generates personalized, patient-specific maps of an individual’s brain networks, or connectome. These critical insights inform prognosis and planning across neurologic conditions, from cranial surgery and neuro-oncology to stroke and beyond. Tomorrow, Omniscient is poised to revolutionize brain health and help conquer conditions such as Alzheimer’s disease and depression through truly personalized brain medicine.

Our products deliver these insights with enterprise-grade efficiency and usability, enabling broader access to vital subject-specific neurological insights.

Since founding, we have grown exponentially and achieved several world firsts with the development of the world’s first connectomic neurosurgical planning and visualization platform to be cleared by regulatory bodies. Omniscient recently expanded its product offering to include the first FDA-cleared neurological planning and visualization tool using resting-state fMRI, opening up new horizons for clinicians to assess brain connectivity and function in cases such as brain surgery, stroke, disorders of consciousness, and oncology.

With continued development, we intend to improve the lives of billions with both medical and non-medical products and services that drastically change how the human brain is understood, treated, and even enhanced.

About the role

As a global company, we have development and commercial interests which spanning several continents and availability zones. Consequently, ourSREteam is distributed between Australia and North America, providing support to the Delivery, Data Science and our Production Environments.

As a member of this small team, you will design, maintain, and improve our practices across development, staging and production environments while working closely with the delivery teams. You will be working with best-in-class technologies both OSS and close. We host applications and services on Kubernetes (EKS) by default and our cloud is composed of technologies including : Istio, Helm, ArgoCD, Argo Workflows, Cloudflare and Datadog. We run data science workflows on medical imaging datasets at scale to guide neurosurgical and neurological decision making. As such, the reliability and security of our environments is of very high importance.

We seek team members that are excited about learning new things, improving existing things and challenging themselves to mastery. When incidents do occur, you will serve at the front-line of our incident response : restoring availability, running post-mortems, and then working to develop monitoring solutions to be notified earlier as well as practices and technical solutions to avoid similar pitfalls in the future.

Responsibilities

Work closely with the application development team to improve upon our testing, release, and deployment processes.

Work with the field and tech support teams to ensure smooth customer onboarding and reliability of our cloud services, particularly in the North American timezone.

Build internal tools for debugging, performance analysis, compliance, monitoring and enforcement of code and security best practices.

Prepare for the worst : build and conduct experiments that explore performance and induce failure to see how our systems respond. Translate those learnings into updates to our platform and practices to achieve greater resiliency.

Design the future of our platform as we expand into new features, products or markets, we will need to take on new technologies and architectural patterns. We want an SRE who is excited about exploring trade-offs and opinionated about the ways in which we should grow in order to maintain a world-class platform.

Qualifications

KEY REQUIREMENTS :

Bachelor’s degree in Computer Science, Engineering or relevant STEM field

5+ years of experience directly in DevOps, SRE or a similar role

Minimum demonstrated 5+ years in production AWS.

Extensive knowledge of AWS cloud services (and / or Google Cloud) and infrastructure orchestration tools, e.g. Terraform, Helm

Demonstrated experience with Kubernetes management and tools, e.g. Argo CD, Istio

Demonstrated experience in networking concepts, e.g. DNS, routing, TCP and UDP protocols, AWS VPC

Experience with best practices for logging, monitoring, and alerting, e.g. Datadog

HIGHLY DESIRED :

Management of CI / CD workloads, e.g. Gitlab CI

Designing and maintaining infrastructure for security

Previous experience as a software engineer, e.g. NodeJS, Python, Go

Experience with computationally intensive workloads

Experience working in a regulated environment or with personally identifying information, e.g. Aerospace, Automotive, Banking

Perks & Benefits

Flexible and remote working - we value work-life balance

If you're seeking professional growth and enjoy working on large, distributed, cloud-based applications that change the world of brain care then apply now to be considered for the position!

#J-18808-Ljbffr

Create a job alert for this search

Site Reliability Engineer • Toronto, Canada

Similar jobs
Staff Site Reliability Engineer

Staff Site Reliability Engineer

ContactMonkey • Toronto, ON, Canada
Full-time
Hey there! We're ContactMonkey 👋.Our mission? To power measurable employee engagement worldwide.And we'd love for you to join us!. About the job - Staff Site Reliability Engineer.You are no...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Humankind Global Recruitment • Toronto, Canada
Full-time
Linking exceptional talent with leading companies.Our client is a dynamic Information Technology services company that partners with leading global organizations to deliver innovative, high-quality...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Curve Dental • Toronto, Canada
Full-time
Site Reliability Engineer – Calgary, Alberta, Canada Join us at Curve Dental to apply for the Site Reliability Engineer role. Curve Dental provides award‑winning software to dental practices, enabli...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer 3

Site Reliability Engineer 3

Behavox • Toronto, Canada
Full-time
About Behavox Behavox is shaping the future of how businesses harness their most important raw material - data.Our mission is bold : Organize enterprise data into actionable information that protect...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Dayforce US, Inc. • Toronto, Canada
Full-time
Posted Thursday, January 29, 2026 at 12 : 00 AM | Expires Sunday, March 1, 2026 at 11 : 59 PM.Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapo...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tyk • Toronto, Canada
Full-time
About Tyk The Tyk API Management platform is helping to drive the connected world and power new products and services.We're changing the way that organisations connect any number of their systems a...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Accelerate Her Future® • Toronto C6A, ON, Canada
Full-time +1
Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative products, and award-winning Client service. The reason why Tangerine employees come to work eac...Show more
Last updated: 19 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tecsys Inc. • Toronto, Canada
Permanent
Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company.The...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Capgemini • Toronto, Canada
Full-time
Talent Acquisition Business Partner – Strategic Business Unit at Capgemini America Inc.Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d ...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability / Infrastructure Platform Engineer

Senior Site Reliability / Infrastructure Platform Engineer

Nextologies Limited • Markham, ON, Canada
Full-time
Senior Site Reliability / Infrastructure Platform Engineer.Virtualization, distributed systems, Linux performance, and service reliability). Act as senior escalation point for service outages, platf...Show more
Last updated: 25 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Manulife Financial • Toronto, Canada
Full-time
We are seeking a motivated Site Reliability Engineer (SRE) to join the Manulife Bank Service Delivery Management (SDM) team. In this role, you will be responsible for ensuring the reliability, avail...Show more
Last updated: 1 day ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

ALLTECH CONSULTING SVC INC • Toronto, Canada
Full-time
Job Description : Technology / Role / Department at our Company Enterprise Technology & Services (ETS) delivers shared technology services for the Firm supporting all business applications and end users...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Funded.club • Toronto, ON, Canada
Full-time
April 2016 and now with more than 70 million users.We believe that the internet was created so that people across the globe could have access to any type of information, no matter where they are.Ou...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Windscribe • Toronto, Canada
Full-time
April 2016 and now with more than 70 million users.We believe that the internet was created so that people across the globe could have access to any type of information, no matter where they are.Ou...Show more
Last updated: 8 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tata Consultancy Services • Toronto, Canada
Full-time
Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to c...Show more
Last updated: 15 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Blue Signal Search • Toronto, Canada
Full-time
Direct message the job poster from Blue Signal Search.Executive Recruiter at Blue Signal Search Site Reliability Engineer. Our client is a fast‑growing provider of AI‑driven edge‑computing platforms...Show more
Last updated: 20 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

iManage • Toronto, ON, Canada
Full-time
SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe. We organize ourselves into distributed teams SRE teams are anchored ...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

66degrees • Toronto, Canada
Full-time
AI transformation partner that guides enterprises from complex business challenges to clear, quantifiable outcomes.Our company is the culmination of several successful firms, each a leader in its o...Show more
Last updated: 3 days ago • Promoted