Talent.com
BetterCloud
Staff Software Developer, Search & Distributed SystemsBetterCloud • Toronto, Canada
No longer accepting applications
Staff Software Developer, Search & Distributed Systems

Staff Software Developer, Search & Distributed Systems

BetterCloud • Toronto, Canada
12 days ago
Job type
  • Full-time
Job description
The Role We are looking for a Staff Software Engineer who would thrive on being accountable for our Search infrastructure: its scalability, reliability, and data resiliency. We don't just need someone who knows how to write a complex query; we need a battle-scarred Distributed Systems expert who understands the deep internals of Elasticsearch and who has a deep toolbox for analyzing, monitoring, alerting, and quickly resolving critical issues as they arise.

You know exactly how Elasticsearch fails, why it fails under load, and how to architect a topology that prevents it. Because our search ecosystem doesn’t exist in a vacuum, you will also own the architectural connective tissue—ensuring our service layers and event-based ecosystem interact with Search flawlessly.

As a Staff Engineer, you will set the technical standard, drive systemic reliability, and mentor senior engineers across the organization.

What You Will Do

Architect for Scale:

Design, configure, and scale our Elasticsearch clusters. You will define our global strategies for shard routing, Index Lifecycle Management (ILM), heap tuning, and data tiering to support massive auction throughput.

Master the Failure Modes:

Anticipate and engineer away points of failure. You will design circuit breakers, implement backpressure mechanisms, and tune asymmetric timeouts to prevent retry storms between our BFFs, K8s services, and the Search layer.

Expert Troubleshooting & IR:

Act as the ultimate technical escalation point for complex, cross-system performance degradation. You will dive deep into JVM metrics, Garbage Collection pauses, K8s network bottlenecks, and slow logs to uncover and remediate root causes.

Holistic System Ownership:

Manage the entire data lifecycle. You will optimize the ingestion pipelines syncing our event datastreams driven by producers and consumers (Kafka) to Elasticsearch, ensuring eventual consistency and data integrity at scale.

Drive Engineering Excellence:

Draft authoritative architectural Blueprints, SOPs, and Runbooks. You will elevate the surrounding engineering culture by coaching teams on distributed systems design, observability best practices, and incident management.

Modernize & Innovate:

Scan the horizon for emerging technologies. You will help evaluate and integrate next-generation search capabilities (e.g., Vector Search, RAG architectures) to support our broader AI and machine learning initiatives.

What You Bring (Requirements)

Experience:

8+ years of software engineering experience, with at least 3+ years operating at a Senior or Staff level focusing on distributed systems and high-throughput platforms.

Elasticsearch Mastery:

Deep, authoritative knowledge of Elasticsearch internals. You have managed large-scale clusters and deeply understand mapping, analysis, query optimization, cluster state management, and split-brain mitigation.

Full-Stack Context:

Proficiency in the systems upstream and downstream of Search. You have hands-on experience with Kubernetes (EKS/GKE), API Gateway/BFF architectures, and event streams (Kafka).

Resilience Engineering:

A proven track record of implementing fault-tolerant patterns (retries, rate limiting, circuit breaking, dead letter queues) in microservice architectures.

Observability:

Expert-level ability to instrument systems and diagnose complex performance issues using modern observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry).

Leadership:

Strong communication skills with a proven ability to influence cross-functional teams, build consensus around architectural decisions (the Knoster model!), and mentor mid-level and senior engineers.

Bonus Points

Experience with Infrastructure as Code (Terraform, Helm) for stateful applications.

Familiarity with FinOps practices, specifically optimizing Elasticsearch compute and storage costs.

Experience integrating AI-assisted development tools into your daily workflow.

#J-18808-Ljbffr
Create a job alert for this search

Staff Software Developer, Search & Distributed Systems • Toronto, Canada

Similar jobs

Staff Software Developer - ShareGate Backup

ShareGate GroupToronto, Ontario, Canada
Full-time

Staff Software Developer - ShareGate Backup ShareGate is the leading Microsoft 365 migration and governance platform, trusted by over 100,000 IT professionals for its unmatched simplicity.It offers... Show more

 • Promoted

Staff Software Developer Focused on Cutting-Edge Backup Features

WorkleapToronto, ON, CA
Full-time

Lead technical innovation by developing a vital backup feature as a Staff Developer.This role combines architectural vision with hands-on coding in a collaborative, mature development environment.Y... Show more

 • Promoted

Staff Software Engineer

Bank On LoopToronto, ON, CA
Full-time

We're a fast-growing fintech startup building the only truly cross-border banking platform.We help businesses seamlessly manage money around the world so they can access more markets, accelerate th... Show more

 • Promoted

Staff Software Developer, Full Stack (Provider Experience)

DialogueToronto, ON, CA
Full-time

Dialogue is the #1 virtual care provider in Canada.By developing our Integrated Health Platform, we provide exceptional online health and wellness programs (primary care, mental health, iCBT, EAP, ... Show more

 • Promoted

Guidewire Developer/Tech Lead - Delta System & Software, Inc.

Delta System & Software, Inc.newmarket, on, ca
Full-time

Job Title: Guidewire Tech Lead.Must have: Guidewire ACE Certification.Strong hands-on experience in PolicyCenter, BillingCenter, or ClaimCenter (v10.Lead and mentor a team of developers, driving de... Show more

 • Promoted

Staff Software Engineer

ConfluentToronto, ON, CA
Full-time

We’re not just building better tech.We’re rewriting how data moves and what the world can do with it.With Confluent, data doesn’t sit still.Our platform puts information in motion, streaming in nea... Show more

 • Promoted

Staff Software Developer - Communications

Jane AppToronto, ON, CA
Full-time

Hi, I’m Bryce Evans, and I’m a leader in the Booking organization at Jane.I’ve been here for about a year and a half, and I lead Engineering for the Communications group.Our team sits at the heart ... Show more

 • Promoted

Lead Principal Software Engineer - richmond hill

Auxo | Growth Partnerrichmond hill, on, ca
Full-time

SaaS products and platforms at scale.You’ll work closely with the Head of Product & Technology to shape technical direction across multiple teams.This is a high-impact IC role where you’ll influenc... Show more

 • Promoted

Staff Software Developer, Search & Distributed Systems

ACV AuctionsToronto, Ontario, Canada
Full-time

If you are looking for a career at a dynamic company with a people-first mindset and a deep culture of growth and autonomy, ACV is the right place for you! Competitive compensation packages and lea... Show more

 • Promoted

Staff Software Engineer — Search Platform, API & Infrastructure

Thomson ReutersToronto, Ontario, Canada
Full-time

Overview Advanced Content Engineering (ACE) is seeking a Staff Software Engineer to lead the design and delivery of the search platform’s control‑plane API and cloud infrastructure.The platform’s c... Show more

 • Promoted

Senior Software Engineer – Agentic AI Foundations

SocureGreater Toronto Area, Canada, Canada
Full-time

Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts.The mission is big, the problems ar... Show more

 • Promoted

Staff Software Developer, Search & Distributed Systems

BetterCloudToronto, ON, CA
Full-time

We are looking for a Staff Software Engineer who would thrive on being accountable for our Search infrastructure: its scalability, reliability, and data resiliency.We don't just need someone who kn... Show more

 • Promoted

Senior API and Middleware Developer - Coforge

Coforgerichmond hill, on, ca
Full-time

Job Title/Role: API Middleware Developer.We at Coforge are seeking a API Middleware Developer with the following skillset:.As a Senior API & Middleware Developer for the Platform Engineering team, ... Show more

 • Promoted

Staff Engineer, Search Systems

MongoDBToronto, ON, CA
Full-time

Staff Engineer, Search Systems.MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data.We enable organizations of all si... Show more

 • Promoted

Staff Software Developer - ShareGate Backup

WorkleapToronto, Ontario, Canada
Full-time

Staff Software Developer - ShareGate Backup.Workleap is a Montreal‑based tech company on a mission to make work simpler.Since 2006, we’ve been building game‑changing products that tackle HR and IT’... Show more

 • Promoted

Staff Software Developer - Communications

Jane.appToronto, ON, CA
Full-time

Staff Software Developer - Communications.Hi, I’m Bryce Evans, and I’m a leader in the Booking organization at Jane.I’ve been here for about a year and a half, and I lead Engineering for the Commun... Show more

 • Promoted

Staff Software Developer

Air-tekToronto, ON, CA
Full-time

Air-tek is a Canadian-based software company with a powerful suite of unique products that have already achieved a significant share of a huge global market.Our product‑market fit is strong, and cu... Show more

 • Promoted

Staff Software Engineer — Search Platform, Api & Infrastructure

Thomson ReutersToronto, Canada
Full-time

Overview Advanced Content Engineering (ACE) is seeking a Staff Software Engineer to lead the design and delivery of the search platform’s control‑plane API and cloud infrastructure.The platform’s c... Show more

 • Promoted

Software Engineer – Advanced Systems - Draganfly Inc.

Draganfly Inc.newmarket, on, ca
Full-time

Company”) has been a recognized technology leader within the commercial UAV space for over two decades.We helped establish the commercial market & adoption of multi-rotor helicopters for public saf... Show more

 • Promoted

Staff Software Engineer

Xtract One TechnologiesToronto, ON, CA
Full-time

We design and deliver advanced threat detection screening systems that help organizations protect people, property, and public spaces—without slowing them down.Our solutions combine cutting‑edge ha... Show more