Talent.com
PointClickCare
Intermediate site reliability Engineer- (AIOps)PointClickCare • Toronto, Canada
No longer accepting applications
Intermediate site reliability Engineer- (AIOps)

Intermediate site reliability Engineer- (AIOps)

PointClickCare • Toronto, Canada
12 days ago
Job type
  • Full-time
Job description
At PointClickCare our mission is simple: to help providers deliver exceptional care. And that starts with our people. As a leading health tech company that’s founder-led and privately held, we empower our employees to push boundaries, innovate, and shape the future of healthcare.

With the largest long-term and post-acute care dataset and a Marketplace of 400+ integrated partners, our platform serves over 30,000 provider organizations, making a real difference in millions of lives. We also reinvest a significant percentage of our revenue back into research and development, ensuring our employees have the resources to innovate and make a lasting impact. Recognized by Forbes as a top private cloud company and honored as one of Canada’s Most Admired Corporate Cultures, we offer flexibility, growth opportunities, and meaningful work.

At PointClickCare, we empower our people to be the architects of a smarter healthcare future; one that is human-first and accelerated by AI to create meaningful and lasting change. Employees harness AI as a catalyst for creativity, productivity, and thoughtful decision-making. By integrating AI tools into our daily workflows, collaboration is enhanced, outcomes are improved, and every team member has the proficiency to maximize their impact. It all starts with our hiring practices where we uncover AI expertise that complements our mission, and we continue to invest in training and development to nurture innovation throughout the employee journey.

Join us in redefining healthcare — so it doesn’t just survive, it thrives. To learn more about PointClickCare, check out Life at PointClickCare and connect with us on Glassdoor and LinkedIn.

Travel to Office expectations For Remote Roles : If this role is remote, there will be in-office events that will require travel to and from the Mississauga and/or Salt Lake City office. These will include, but not limited to, onboarding, team events, semi-annual and annual team meetings.

For Hybrid Roles : If this role is Hybrid, there will be an expectation to reside within commutable distance to the office/location specified in the job listing. This will include, but not limited to, weekly/bi-weekly/monthly events in the office with your specific team. This is a requirement for this role.

Intermediate Site Reliability Engineer SRE – AI Reliability & Automation Are you a software engineer passionate about building intelligent systems that make infrastructure smarter, faster, and more resilient?

Join us as we reimagine operational engineering through AI-first principles. In this role, you’ll design and implement AI-powered solutions that drive observability, automate incident response, and optimise cloud-native platforms.

This is more than a traditional SRE role — it’s a chance to engineer the future of reliability using machine learning, generative AI, and predictive analytics.

What You’ll Work On AI-Driven Observability Build ML-based anomaly detection and pattern recognition systems. Enhance telemetry with smart tagging and metadata for better AI insights.

Intelligent Automation Develop event-driven workflows and self-healing systems using AI triggers. Automate incident response with generative AI and custom AI agent orchestration.

Predictive Reliability Use time-series forecasting and predictive modelling to anticipate failures. Optimise infrastructure with AI-powered autoscaling and cost-aware resource allocation.

Software Engineering for Resilience Build scalable, fault-tolerant systems in a cloud-native environment. Participate in on‑call rotations and lead incident response for critical systems. Skilled in API integration for streamlined data exchange and system connectivity.

Team Enablement Run internal AIOps workshops and help teams adopt AI maturity models. Champion responsible AI practices and ethical automation.

Tech Stack & Skills Languages: Python, Java, Bash, Terraform Platforms: Azure, Kubernetes, Docker Tools: Datadog, Prometheus, AppDynamics, ELK, GitHub Actions ML/AI: MCP framework, AI agents, Vector store, Agent orchestration (LangChain), RAG CI/CD: Jenkins, ArgoCD, Spinnaker Databases: SQL Server, PostgreSQL, MySQL

You’ll Thrive If You Have 5+ years' experience in software engineering. Experience with SRE principles. Experience with AI/ML in production environments A passion for automation, intelligent systems, and operational excellence Strong debugging, problem-solving, and system design skills

Bonus Points Experience with AIOps platforms. Contributions to open-source or AI communities Familiarity with Responsible AI frameworks Participation in AI hackathons or conferences

This is your opportunity to code with purpose — building systems that think, learn, and adapt. If you're excited about the fusion of software engineering and AI, let’s talk.

$115,000 - $128,000 a year

At PointClickCare, base salary is one of the many components that make up our total rewards package. The CAD base salary range for this position is $115,000-$128,000 + bonus + benefits. Our salary ranges are determined by job and level. The range displayed on each job posting reflects the target for new hire salaries for the position across all CAD locations. Within the range, individual compensation is determined by job-related skills and knowledge, relevant experience including professional and lived experience, and/or work location. Your recruiter can share more information about our total rewards package during the hiring process.

PointClickCare Benefits & Perks Benefits starting from Day 1! Retirement Plan Matching Flexible Paid Time Off Wellness Support Programs and Resources Parental & Caregiver Leaves Fertility & Adoption Support Continuous Development Support Program Employee Assistance Program Allyship and Inclusion Communities Employee Recognition … and more!

It is the policy of PointClickCare to ensure equal employment opportunity without discrimination or harassment on the basis of race, religion, national origin, status, age, sex, sexual orientation, gender identity or expression, marital or domestic/civil partnership status, disability, veteran status, genetic information, or any other basis protected by law. PointClickCare welcomes and encourages applications from people with disabilities. Accommodations are available upon request for candidates taking part in all aspects of the selection process. Please contact recruitment@pointclickcare.com should you require any accommodations. As part of our commitment to a streamlined and equitable hiring experience, PointClickCare uses AI tools to assist with candidate screening and assessment.

When you apply for a position, your information is processed and stored with Lever, in accordance with Lever’s Privacy Policy. We use this information to evaluate your candidacy for the posted position. We also store this information, and may use it in relation to future positions to which you apply, or which we believe may be relevant to you given your background. When we have no ongoing legitimate business need to process your information, we will either delete or anonymize it. If you have any questions about how PointClickCare uses or processes your information, or if you would like to ask to access, correct, or delete your information, please contact PointClickCare’s human resources team: recruitment@pointclickcare.com

PointClickCare is committed to Information Security. By applying to this position, if hired, you commit to following our information security policies and procedures and making every effort to secure confidential and/or sensitive information.

#J-18808-Ljbffr
Create a job alert for this search

Intermediate site reliability Engineer- (AIOps) • Toronto, Canada

Similar jobs

Site Reliability Engineer

DexianToronto, ON, CA
Full-time

Working Location: Toronto, ON [Hybrid 2 days a week in office].The DevOps and Automation is looking for a Site Reliability Engineer with strong expertise in Dynatrace to ensure the reliability, per... Show more

 • Promoted

Senior Site Reliability Engineer – Cloud & Automation Lead

Tecsys Inc.Toronto, ON, CA
Full-time

A leading supply chain solutions provider is seeking a Site Reliability Engineer to optimize and ensure the reliability of their cloud infrastructure across AWS and Kubernetes.This role emphasizes ... Show more

 • Promoted

Site Reliability Engineer for Observability

PricelineToronto
Full-time

Shape the future of observability as a Site Reliability Engineer in a hybrid environment.Focus on optimizing production visibility and telemetry management to enhance system reliability.This pivota... Show more

 • Promoted

Impactful Site Reliability Engineer Fostering Reliability and Performance

RootlyToronto, ON, CA
Full-time

Join as an impactful Site Reliability Engineer, shaping the technical future and enhancing system reliability.Tackle rewarding challenges in a collaborative startup atmosphere.As a key player, you’... Show more

 • Promoted

Site Reliability Engineer

TELUS DigitalToronto, ON, CA
Full-time

Welcome to TELUS Digital — where innovation drives impact at a global scale.As an award-winning digital product consultancy and the digital division of TELUS, one of Canada’s largest telecommunicat... Show more

 • Promoted

Site Reliability Engineer

McCain FoodsToronto, ON, CA
Full-time

Our Global Technology team’s goal is to leverage technology and data to drive profitable growth, focus on enhancing customer experience and to further our purpose of 'Celebrating real connections t... Show more

 • Promoted

Expert Site Reliability Engineer Position

Okta for DevelopersToronto, ON, CA
Full-time

Ensure secure identity management as a Senior Site Reliability Engineer.Collaborate in a remote team to enhance the reliability and scalability of mission-critical authentication systems.The SRE po... Show more

 • Promoted

Lead Site Reliability Engineer - Access

SimCorpToronto, ON, CA
Full-time

Lead Site Reliability Engineer - Access page is loaded## Lead Site Reliability Engineer - Accesslocations: Torontotime type: Full timeposted on: Posted Yesterdayjob requisition id: R-211433... Show more

 • Promoted

Senior Site Reliability Engineer

Apptoza Inc.Toronto, ON, CA
Full-time

Job Title: Senior Platform Engineer / Senior SRE Developer – Observability (Dynatrace).Work Style: Hybrid (2 days per week in-person at Toronto office preferred).Skills: Digital : Python~Digital : ... Show more

 • Promoted

Site Reliability Engineer — Scale, Automate & Observability

BitcompleteToronto, Ontario, Canada
Full-time

A technology company in Canada is seeking an Intermediate Site Reliability Engineer to enhance cloud infrastructure reliability and support teams in managing large distributed systems.The role invo... Show more

 • Promoted

Senior Site Reliability Engineer II - Remote, Scale-Focused

InstacartToronto, ON, CA
Remote
Full-time

A leading grocery delivery service is seeking a Senior Site Reliability Engineer II in Calgary, Alberta.You will ensure optimal performance and reliability of the platform while establishing incide... Show more

 • Promoted

Site Reliability Engineer

HCLTechtoronto, on, ca
Full-time

Hands-on experience with at least one major public cloud platform (Azure, AWS, or GCP).Strong understanding of cloud infrastructure and application runtime components, including compute, storage, n... Show more

 • Promoted

Senior Site Reliability Engineer Role

ITRidersToronto, ON, CA
Full-time

Elevate your career as a Senior Site Reliability Engineer at our company.Craft observability-as-code solutions using Terraform while optimizing system reliability across diverse environments.We see... Show more

 • Promoted

Site Reliability Engineer with Automation Focus

YelpToronto, ON, CA
Full-time

Join a collaborative, remote SRE team dedicated to ensuring service reliability.In this role, leverage your expertise in automation and systems management to support a platform serving millions.You... Show more

 • Promoted

Senior Site Reliability Engineer Focused on Kubernetes Infrastructure

Chainlink LabsToronto, ON, CA
Full-time

Elevate decentralized architecture as a Senior Site Reliability Engineer.Spearhead Kubernetes-based infrastructure for decentralized applications, driving scalability, security, and operational eff... Show more

 • Promoted

Senior Site Reliability Engineer

CaptivateIQToronto, ON, CA
Full-time

The Site Reliability Engineering team in CaptivateIQ operates across the engineering organization, supporting our development teams by providing them with the tools and processes they need to get t... Show more

 • Promoted

Site Reliability Engineer

LongbridgeToronto, ON, CA
Full-time

Longbridge is a fast-growing online brokerage platform on a mission to make investing smarter, simpler, and more accessible for everyone.As part of our global expansion, we’re looking for a.Site Re... Show more

 • Promoted

Senior Site Reliability Engineer - Remote & Scale Impact

ClickHouseToronto, ON, CA
Remote
Full-time

A leading cloud company is seeking a Senior Site Reliability Engineer to build and lead processes ensuring the reliability and performance of their remote cloud infrastructure.This role requires co... Show more

 • Promoted

Senior Site Reliability Engineer — Kubernetes, AWS & Observability

ThinkificToronto, ON, CA
Full-time

A leading e-learning provider in Canada is seeking a Senior Site Reliability Engineer to enhance and secure their infrastructure supporting online course creators.This role involves improving perfo... Show more

 • Promoted

Senior Site Reliability Engineer- Remote

ClickHouseToronto, ON, CA
Remote
Full-time

Senior Site Reliability Engineer- Remote.Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies.With more than 3,000 custome... Show more