Talent.com
Staff Site Reliability Engineer, Database
Staff Site Reliability Engineer, DatabaseAlpaca • Toronto, ON, CA
No longer accepting applications
Staff Site Reliability Engineer, Database

Staff Site Reliability Engineer, Database

Alpaca • Toronto, ON, CA
2 days ago
Job type
  • Full-time
Job description

Overview

Who We Are : Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24 / 5 trading, and more. Our recent Series D funding round brought our total investment to over $320 million, fueling our ambitious vision. Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totalling over 9 million brokerage accounts. Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet . We\'re deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it. Alpaca is proudly backed by top-tier global investors, including Portage Ventures, Spark Capital, Tribe Capital, Social Leverage, Horizons Ventures, Unbound, SBI Group, Derayah Financial, Elefund, and Y Combinator.

Our Team

We\'re a dynamic team of 230+ globally distributed members who thrive working from our favorite places around the world, with teammates spanning the USA, Canada, Japan, Hungary, Nigeria, Brazil, the UK, and beyond!

We\'re searching for passionate individuals eager to contribute to Alpaca\'s rapid growth. If you align with our core values—Stay Curious, Have Empathy, and Be Accountable—and are ready to make a significant impact, we encourage you to apply.

Your Role

As a Site Reliability Engineer (SRE) at Alpaca, you will ensure the reliability, scalability, and performance of our systems and services. You will work closely with development, operations and devops teams to build and maintain robust applications, ensuring they run smoothly and efficiently. This role requires a blend of software engineering and operations skills, with a strong ability to troubleshoot technical issues and resolve problems before they impact our users.

Responsibilities

  • Triage difficult technical problems and implement solutions
  • Improve our observability stack (monitoring, logging, profiling)
  • Incident Management : Respond to and resolve incidents in a timely manner, conducting post-incident reviews to identify and implement improvements.
  • Collaboration : Work closely with development teams to ensure new features and services are designed with reliability and scalability in mind.
  • Capacity Planning : Monitor system capacity and performance, making recommendations and implementing changes to handle future growth.

Qualifications

  • 5+ years of experience in Site Reliability Engineering, Performance Engineering, or similar roles.
  • 5+ years of experience with multi-terabyte scale PostgreSQL clusters.
  • Proven track record of managing and maintaining large-scale, high-availability, and high-performance PostgreSQL database.
  • Experience designing and implementing SLIs, SLOs, and SLAs for internal systems and databases.
  • Experience with troubleshooting PostgreSQL performance problems and slow queries.
  • Extensive experience with efficient schema design and efficient query design.
  • Experience migrating multi-terabyte tables into more efficient schemas.
  • Proficient with Go.
  • Proficient with Prometheus.
  • Proficient with Linux.
  • Knowledgeable in trading / fintech domains.
  • Experience with low-latency systems.
  • Experience with distributed tracing.
  • Experience scaling PostgreSQL clusters rapidly.
  • Experience with pgx, gorm, or sqlc.
  • Benefits

  • Competitive Salary & Stock Options
  • Health Benefits
  • New Hire Home-Office Setup : One-time USD $500
  • Monthly Stipend : USD $150 per month via a Brex Card
  • Alpaca is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce. Recruitment Privacy Policy

    #J-18808-Ljbffr

    Create a job alert for this search

    Staff Site Reliability Engineer Database • Toronto, ON, CA

    Similar jobs
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    ContactMonkey • Toronto, ON, Canada
    Full-time
    Hey there! We're ContactMonkey 👋.Our mission? To power measurable employee engagement worldwide.And we'd love for you to join us!. About the job - Staff Site Reliability Engineer.You are no...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Staples • Richmond Hill
    Full-time
    The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and operational excellence of Staples Canada’s digital platforms. This role supports production systems...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Intelliswift - An LTTS Company • Toronto, Canada
    Full-time
    Hybrid work requirements : 2 days / week in office Role Mandate : The DevOps and Automation is looking for a Site Reliability Engineer with strong expertise in Dynatrace to ensure the reliability, perf...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sigmaways Inc • Toronto, Canada
    Full-time
    We are seeking an experienced SRE / Support Engineer to join our dynamic team.In this role, you will monitor and optimize systems, debug issues and automate routine tasks while collaborating closely ...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer, Inference Infrastructure

    Site Reliability Engineer, Inference Infrastructure

    Cohere • Toronto
    Full-time
    Our mission is to scale intelligence to serve humanity.We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like cont...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Accelerate Her Future® • Toronto C6A, ON, Canada
    Full-time +1
    Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative products, and award-winning Client service. The reason why Tangerine employees come to work eac...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Dexian • Toronto, Canada
    Full-time
    Working Location : Toronto, ON [Hybrid 2 days a week in office] Role Mandate.The DevOps and Automation is looking for a Site Reliability Engineer with strong expertise in Dynatrace to ensure the rel...Show more
    Last updated: 12 days ago • Promoted
    Senior Site Reliability / Infrastructure Platform Engineer

    Senior Site Reliability / Infrastructure Platform Engineer

    Nextologies Limited • Markham, ON, Canada
    Full-time
    Senior Site Reliability / Infrastructure Platform Engineer.Virtualization, distributed systems, Linux performance, and service reliability). Act as senior escalation point for service outages, platf...Show more
    Last updated: 24 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Manulife Financial • Toronto, Canada
    Full-time
    We are seeking a motivated Site Reliability Engineer (SRE) to join the Manulife Bank Service Delivery Management (SDM) team. In this role, you will be responsible for ensuring the reliability, avail...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Manulife • Toronto
    Full-time
    We are seeking a motivated Site Reliability Engineer (SRE) to join the Manulife Bank Service Delivery Management (SDM) team. In this role, you will be responsible for ensuring the reliability, avail...Show more
    Last updated: 3 days ago • Promoted
    Lead Site Reliability Engineer - Database Management

    Lead Site Reliability Engineer - Database Management

    SimCorp • Toronto
    Full-time
    WHAT MAKES US, US • •Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challen...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TextNow • Toronto, Canada
    Full-time
    This range is provided by TextNow.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. We believe communication belongs to everyone.We exist to democ...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Funded.club • Toronto, ON, Canada
    Full-time
    April 2016 and now with more than 70 million users.We believe that the internet was created so that people across the globe could have access to any type of information, no matter where they are.Ou...Show more
    Last updated: 20 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Windscribe • Toronto, Canada
    Full-time
    April 2016 and now with more than 70 million users.We believe that the internet was created so that people across the globe could have access to any type of information, no matter where they are.Ou...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    STAPLES Canada • Richmond Hill
    Full-time
    The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and operational excellence of Staples Canada’s digital platforms. This role supports production systems...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Toronto, Canada
    Full-time
    Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to c...Show more
    Last updated: 15 days ago • Promoted
    Senior Site Reliability Engineer - Distributed Systems & Platforms

    Senior Site Reliability Engineer - Distributed Systems & Platforms

    Apple • Toronto, Canada
    Full-time
    A leading tech company in Metro Vancouver is seeking Site Reliability Engineers to develop processes and tools for managing distributed systems. The role involves building scalable services and coll...Show more
    Last updated: 12 days ago • Promoted
    Senior / Staff Site Reliability Engineer

    Senior / Staff Site Reliability Engineer

    Circle • Toronto, Canada
    Full-time
    Circle (NYSE : CRCL) is one of the world’s leading internet financial platform companies, building the foundation of a more open, global economy through digital assets, payment applications, and pro...Show more
    Last updated: 16 days ago • Promoted