Talent.com
Site Reliability Professional (DB2 LUW)
Site Reliability Professional (DB2 LUW)IBM • Markham, York Region, CA
Site Reliability Professional (DB2 LUW)

Site Reliability Professional (DB2 LUW)

IBM • Markham, York Region, CA
3 days ago
Job type
  • Full-time
Job description

Introduction

At IBM Software, we transform client challenges into solutions. Building the world’s leading AI-powered, cloud-native products that shape the future of business and society. Our legacy of innovation creates endless opportunities for IBMers to learn, grow, and make an impact on a global scale. Working in Software means joining a team fueled by curiosity and collaboration. You’ll work with diverse technologies, partners, and industries to design, develop, and deliver solutions that power digital transformation. With a culture that values innovation, growth, and continuous learning, IBM Software places you at the heart of IBM’s product and technology landscape. Here, you’ll have the tools and opportunities to advance your career while creating software that changes the world.

Your Role And Responsibilities

We are looking for an IBM DB2 LUW Database Reliability Professional to design, build, and operate highly available and resilient database systems supporting business-critical applications. The ideal candidate combines deep expertise in IBM DB2 LUW with strong SRE principles , focusing on automation, observability, and performance optimization across hybrid and IBM Cloud environments.

  • Manage and optimize DB2 LUW instances across multiple environments (dev / test / prod) with a focus on availability, scalability, and performance.
  • Implement Site Reliability Engineering (SRE) practices and proactive monitoring for database platforms.
  • Automate database provisioning, configuration, and maintenance tasks using Ansible and shell scripting.
  • Design and maintain HA / DR configurations (HADR, Pacemaker, Q-Replication, etc.) ensuring zero data loss and minimal downtime.
  • Build and operate DB2 on IBM Cloud and other hybrid cloud platforms, ensuring compliance with security and performance standards.
  • Integrate DB2 operational metrics into observability stacks (e.g., Instana, Prometheus, Grafana).
  • Conduct performance tuning, query optimization, and capacity planning to meet SLAs and prevent incidents.
  • Support incident response, root cause analysis (RCA), and continuous improvement efforts to enhance system reliability.
  • Maintain comprehensive runbooks, automated playbooks, and operational documentation for all database services.

Required Technical And Professional Expertise

  • 8+ years of experience as a DB2 LUW Database Administrator.
  • Proven expertise in DB2 administration, backup / recovery, performance tuning, and troubleshooting.
  • Solid understanding of Linux / Unix systems, networking, and security principles.
  • Strong scripting skills in Ansible and shell scripting, or equivalent automation tools.
  • Experience implementing HADR, Pacemaker, Replication, and DB2 clustering solutions.
  • Familiarity with infrastructure-as-code (IaC) concepts and configuration management.
  • Hands‑on experience with monitoring, alerting, and observability tools in SRE environments.
  • Preferred Technical And Professional Experience

  • Analytical mindset with strong troubleshooting and performance optimization skills.
  • Collaborative and proactive in driving reliability initiatives across teams.
  • Excellent communication, documentation, and mentoring abilities.
  • Strong sense of ownership and accountability for system uptime and performance.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Professional DB2 LUW • Markham, York Region, CA

    Similar jobs
    Site Lead

    Site Lead

    St. Alban's Boys And Girls Club • Toronto C6A, ON, Canada
    Full-time
    Humber Boulevard South (Humber Children's TCHC).BGC Weston Mount Dennis & Lawrence Heights Club serves children and youth in the Weston Mount Dennis / Lawrence Heights and surrounding communities pr...Show more
    Last updated: 18 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Staples • Richmond Hill
    Full-time
    The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and operational excellence of Staples Canada’s digital platforms. This role supports production systems...Show more
    Last updated: 13 hours ago • Promoted • New!
    Site Reliability Engineer III

    Site Reliability Engineer III

    ACV Auctions • Toronto
    Full-time
    Posted Tuesday, January 13, 2026 at 5 : 00 AM.If you are looking for a career at a dynamic company with a people-first mindset and a deep culture of growth and autonomy, ACV is the right place for yo...Show more
    Last updated: 13 hours ago • Promoted • New!
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Scotiabank • Toronto
    Full-time
    Site Reliability Engineer (SRE) – Scotiabank.Join a purpose‑driven winning team, committed to results, in an inclusive and high‑performing culture. As an SRE, you will implement, measure, and gather...Show more
    Last updated: 13 hours ago • Promoted • New!
    Sr. Manager, Site Reliability Engineering

    Sr. Manager, Site Reliability Engineering

    OpenText • Richmond Hill
    Full-time
    OpenText - The Information Company.OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture.As a member...Show more
    Last updated: 13 hours ago • Promoted • New!
    Power Platform Reliability Lead - CoE & Platform Innovation

    Power Platform Reliability Lead - CoE & Platform Innovation

    Manulife Financial • Toronto
    Full-time
    A leading financial services provider in Toronto is seeking a Lead Power Platform Reliability Engineer.In this pivotal role, you'll enhance enterprise-level applications and engage directly with st...Show more
    Last updated: 13 hours ago • Promoted • New!
    Senior / Staff Site Reliability Engineer

    Senior / Staff Site Reliability Engineer

    Circle • Toronto
    Full-time
    Circle (NYSE : CRCL) is one of the world’s leading internet financial platform companies, building the foundation of a more open, global economy through digital assets, payment applications, and pro...Show more
    Last updated: 13 hours ago • Promoted • New!
    Site Reliability Engineer (SRE) - Platform Infrastructure team (100 Remote - Canada)

    Site Reliability Engineer (SRE) - Platform Infrastructure team (100 Remote - Canada)

    Hopper • Toronto, On
    Remote
    Full-time
    Senior Site Reliability Engineer.If you care about automation, scalability, and developer experience — and want to make a tangible impact on a growing travel tech company — this could be the perfec...Show more
    Last updated: 7 hours ago • Promoted • New!
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Achievers • Toronto
    Full-time +1
    Our Site Reliability Engineering team sits at the intersection of software engineering and operations, building reliable, scalable cloud systems that our teams and customers can trust.Staff Site Re...Show more
    Last updated: 7 hours ago • Promoted • New!
    Site Reliability Engineer, Inference Infrastructure

    Site Reliability Engineer, Inference Infrastructure

    The Rundown AI, Inc. • Toronto
    Full-time
    Our mission is to scale intelligence to serve humanity.We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like cont...Show more
    Last updated: 13 hours ago • Promoted • New!
    On-Site Professional Services Lead

    On-Site Professional Services Lead

    Betz Pools Ltd. • Whitchurch-Stouffville
    Full-time
    A construction service company in Whitchurch-Stouffville is seeking an Operations Manager to oversee project teams and ensure efficient operations. The role requires 1-2 years of relevant experience...Show more
    Last updated: 13 hours ago • Promoted • New!
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Updata Partners • Toronto
    Full-time
    Hey there! We’re ContactMonkey 👋.Our mission? To power measurable employee engagement worldwide.And we’d love for you to join us!. About the job - Staff Site Reliability Engineer.You are not just b...Show more
    Last updated: 7 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Dexian • Toronto
    Full-time
    Working Location : Toronto, ON [Hybrid 2 days a week in office].The DevOps and Automation is looking for a Site Reliability Engineer with strong expertise in Dynatrace to ensure the reliability, per...Show more
    Last updated: 7 hours ago • Promoted • New!
    L&D Specialist : Design Impactful Hybrid Learning

    L&D Specialist : Design Impactful Hybrid Learning

    ERGO | Munich Re | MEAG • Toronto
    Full-time
    A premier engineering-driven specialty insurer in Toronto is seeking a Corporate Training & Development Specialist to design and deliver training programs across various departments.This role deman...Show more
    Last updated: 13 hours ago • Promoted • New!
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Tangerine • Toronto C6A, ON, Canada
    Full-time +1
    As Canada’s leading digital bank, Tangerine technology is at the heart of everything we do.We have redefined what digital banking is and we continue to evolve on what it can be, using technology to...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Canonical • Toronto, Canada
    Full-time
    Site Reliability Engineer Join to apply for the.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu, i...Show more
    Last updated: 18 days ago • Promoted
    Engineering Sr. Site Reliability Engineer Palo Alto, California

    Engineering Sr. Site Reliability Engineer Palo Alto, California

    getjerry.com • Toronto
    Full-time
    Join a pre-IPO startup with capital, traction and runway ($240M funded | 60X revenue growth in 5 years | $2T market size). Work closely with brilliant leaders and teammates from companies like McKin...Show more
    Last updated: 13 hours ago • Promoted • New!
    Senior Reliability Leader - Framework & Asset Excellence

    Senior Reliability Leader - Framework & Asset Excellence

    Irving Consumer Products • Toronto
    Full-time
    A leading consumer product manufacturer is seeking a Reliability Engineer to develop and implement a reliability framework. You will lead auditing processes, collaborate with teams to improve reliab...Show more
    Last updated: 13 hours ago • Promoted • New!