Talent.com
P2P
Senior Site Reliability Engineer - Payward ServicesP2P • Toronto, Canada
No longer accepting applications
Senior Site Reliability Engineer - Payward Services

Senior Site Reliability Engineer - Payward Services

P2P • Toronto, Canada
10 days ago
Job type
  • Full-time
Job description
Location United Kingdom; Brazil; Canada; Cyprus; Czech Republic; Ireland; Lithuania; Mexico; Poland; Portugal; Romania; Spain; Switzerland; United Arab Emirates

Employment Type Full time

Location Type Remote

Department Engineering SRE / DevOps

Building the Future of Crypto Our Krakenites are a world‑class team with crypto conviction, united by our desire to discover and unlock the potential of crypto and blockchain technology.

What makes us different? Kraken is a mission‑focused company rooted in crypto values. As a Krakenite, you’ll join us on our mission to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. For over a decade, Kraken’s focus on our mission and crypto ethos has attracted many of the most talented crypto experts in the world.

Our remote team has Krakenites in 70+ countries who speak over 50 languages. Krakenites are industry pioneers who develop premium crypto products for experienced traders, institutions, and newcomers to the space. Kraken is committed to industry‑leading security, crypto education, and world‑class client support through our products like Kraken Pro, Desktop, Wallet, and Kraken Futures.

Become a Krakenite and build the future of crypto!

Proof of work The team This role is fully remote, with a strong preference for candidates in EU timezones. The Payward Services (PWS) business unit powers Kraken's B2B and institutional product suite, serving external partners and institutional clients under contractual SLAs.

As a Senior SRE, you will partner with PWS development and operations teams to manage infrastructure, improve CI/CD pipelines, and support operational excellence. You will bring expertise in infrastructure, monitoring, and automation to ensure performant, resilient, and continuously improving services.

The opportunity

Manage and support infrastructure for Payward Services, including Nomad, Kubernetes, databases, and 3rd party system integration

Provide operational support across multiple teams, helping debug issues in staging and production environments

Participate in incident response and post‑incident reviews to improve system resilience

Consult with teams on performance, monitoring, and alerting best practices — with awareness of partner‑facing SLA commitments

Build tooling, automation, and dashboards to improve observability and empower development teams

Maintain and troubleshoot CI pipelines, ensuring reliable and fast build, test, and deployment cycles

Collaborate with developers, QA, and product managers to streamline development and release cycles

Support a fully distributed team operating across multiple timezones

Skills you should HODL

5+ years in DevOps or SRE role

Proficiency with hybrid‑cloud infrastructure environments

Git source version‑control and CI/CD configuration proficiency

Deep understanding of monitoring and alerting systems, preferably Prometheus and Grafana

Ability to debug complex distributed systems, networks, and Linux operating systems issues

Containerization and orchestration experience (Docker, Nomad, Kubernetes a plus)

Strong scripting skills (Bash, Python, or Go)

Self‑starter capable of thriving independently and remotely in fast‑paced environments

Nice to haves

Background working with distributed systems and technologies (Kafka, gRPC, Redis, etc.)

Experience operating services with external SLAs or in a B2B/enterprise context

Experience with benchmarking, performance tuning, and identifying system bottlenecks

Proficiency with databases (SQL and NoSQL) and production operations experience

Interest in lower‑level programming languages such as Rust

Experience integrating with APIs (GitLab, Jira, Slack)

Unless a specific application deadline is stated in the job posting, applications are accepted on an ongoing basis.

Please note, applicants are permitted to redact or remove information on their resume that identifies age, date of birth, or dates of attendance at or graduation from an educational institution.

We consider qualified applicants with criminal histories for employment on our team, assessing candidates in a manner consistent with the requirements of the San Francisco Fair Chance Ordinance.

As an equal opportunity employer, we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, citizenship, religion, sexual orientation, disability, pregnancy, veteran status or any other protected characteristic as outlined by federal, state or local laws.

#J-18808-Ljbffr
Create a job alert for this search

Senior Site Reliability Engineer - Payward Services • Toronto, Canada

Similar jobs

Site Reliability Engineer - toronto

E-ITtoronto, on, ca
Full-time

Incident Management and Reliability:.Lead the incident management process, ensuring high availability and performance of the applications.Develop and implement SRE practices to improve system relia... Show more

 • Promoted

Site Reliability Engineer

CapgeminiToronto, ON, CA
Full-time

Talent Acquisition Business Partner – Strategic Business Unit at Capgemini America Inc.Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d ... Show more

 • Promoted

Senior Site Reliability Engineer in Crypto

P2PToronto, ON, CA
Full-time

Join Kraken as a Senior Site Reliability Engineer, contributing to innovative crypto solutions from anywhere in the world.This remote role emphasizes managing infrastructure and enhancing CI/CD pro... Show more

 • Promoted

Site Reliability Engineer

Tecsys Inc.Toronto, ON, CA
Permanent

Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company.The... Show more

 • Promoted

Site Reliability Engineer

KyndrylToronto, ON, CA
Full-time +1

Join to apply for the Site Reliability Engineer role at Kyndryl.Direct message the job poster from Kyndryl.Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Se... Show more

 • Promoted

Senior Site Reliability Engineer I

InstacartToronto, ON, CA
Permanent

Join our team as a Senior Site Reliability Engineer II, where your expertise will play a crucial role in maintaining the backbone of our platform's operations.You'll take on challenges directly, en... Show more

 • Promoted

Site Reliability Engineer

TELUS DigitalToronto, ON, CA
Full-time

Welcome to TELUS Digital — where innovation drives impact at a global scale.As an award-winning digital product consultancy and the digital division of TELUS, one of Canada’s largest telecommunicat... Show more

 • Promoted

Senior Site Reliability Engineer

SimCorpToronto, ON, CA
Full-time

Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: Torontotime type: Full timeposted on: Posted Todayjob requisition id: R-211168Job Advertisement*... Show more

 • Promoted

Senior Site Reliability Engineer

Apptoza Inc.Toronto, ON, CA
Full-time

Job Title: Senior Platform Engineer / Senior SRE Developer – Observability (Dynatrace).Work Style: Hybrid (2 days per week in-person at Toronto office preferred).Skills: Digital : Python~Digital : ... Show more

 • Promoted

Site Reliability Engineer, Observability

PricelineToronto, ON, CA
Full-time

This role is eligible for our hybrid work model: Two days in-office.Site Reliability Engineer, Observability.Our Technology team is the backbone of our company: constantly creating, testing, learni... Show more

 • Promoted

Senior Site Reliability Engineer

RootlyToronto, ON, CA
Full-time

At Rootly, we are on a mission to be the go‑to way companies respond when things go wrong, helping every organization be more reliable.We do this by building an industry‑leading incident management... Show more

 • Promoted

Site Reliability Engineer

Momentum Financial Services GroupToronto, ON, CA
Full-time

At Momentum Financial Services Group, we help people move forward by reimagining how money works for those who need it most.With more than 40 years of experience, we’re the team behind Money Mart—C... Show more

 • Promoted

Site Reliability Engineer

HCLTechtoronto, on, ca
Full-time

Hands-on experience with at least one major public cloud platform (Azure, AWS, or GCP).Strong understanding of cloud infrastructure and application runtime components, including compute, storage, n... Show more

 • Promoted

Site Reliability Engineer

E-ITtoronto, on, ca
Full-time

Incident Management and Reliability:.Lead the incident management process, ensuring high availability and performance of the applications.Develop and implement SRE practices to improve system relia... Show more

 • Promoted

Senior Site Reliability Engineer

CaptivateIQToronto, ON, CA
Full-time

The Site Reliability Engineering team in CaptivateIQ operates across the engineering organization, supporting our development teams by providing them with the tools and processes they need to get t... Show more

 • Promoted

Senior Site Reliability Engineer Focused on Kubernetes Infrastructure

Chainlink LabsToronto, ON, CA
Full-time

Elevate decentralized architecture as a Senior Site Reliability Engineer.Spearhead Kubernetes-based infrastructure for decentralized applications, driving scalability, security, and operational eff... Show more

 • Promoted

Senior Site Reliability Engineer Role

ITRidersToronto, ON, CA
Full-time

Elevate your career as a Senior Site Reliability Engineer at our company.Craft observability-as-code solutions using Terraform while optimizing system reliability across diverse environments.We see... Show more

 • Promoted

Site Reliability Engineer

LongbridgeToronto, ON, CA
Full-time

Longbridge is a fast-growing online brokerage platform on a mission to make investing smarter, simpler, and more accessible for everyone.As part of our global expansion, we’re looking for a.Site Re... Show more

 • Promoted

Senior Site Reliability Engineer- Remote

ClickHouseToronto, ON, CA
Remote
Full-time

Senior Site Reliability Engineer- Remote.Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies.With more than 3,000 custome... Show more

 • Promoted

Senior Site Reliability Engineer Focusing on Metrics and Automation

ScotiabankToronto, ON, CA
Full-time

Seize the opportunity as a Senior Site Reliability Engineer focused on enhancing resilience and performance metrics.Drive automation and improve service reliability in a dynamic operational landsca... Show more