Talent.com
Azure CloudAI Ops Engineer
Azure CloudAI Ops Engineer2105 Merge Canada Holdco Unlimited Liability Company • Mississauga, Ontario
Azure CloudAI Ops Engineer

Azure CloudAI Ops Engineer

2105 Merge Canada Holdco Unlimited Liability Company • Mississauga, Ontario
29 days ago
Job type
  • Full-time
Job description

Join a team dedicated to supporting the crucial mission of improving health outcomes.

At Merative, you can apply your skills – and grow new ones – with colleagues who have deep expertise in health and technology. Merative provides data, analytics and software for the health industry. Our clients include providers, health plans, employers, life sciences companies and governments around the world. With industry-leading products and focused innovation, we help customers improve decision-making and performance so that together, we drive real progress in health. Learn more at merative.com

Merge medical imaging solutions, offered by Merative, combine intelligent, scalable imaging workflow tools with deep and broad expertise to help healthcare organizations improve their confidence in patient outcomes and optimize care delivery.
We’re evolving our CloudOps function into AIOps‑driven SRE. You’ll keep our multi‑tenant SaaS services reliable on Azure/AKS, reduce alert noise, and automate safe fixes using PagerDuty AIOps and Azure AIOps. This role blends coding for operations with hands‑on incident management, and it’s ideal for candidates who enjoy partnering with dev/product while improving patient‑impacting healthcare technology.

What You’ll Do

  • Operate with AIOps: Use Azure Monitor/Log Analytics and Application Insights to detect anomalies, triage issues, and speed up RCA; apply Issues & Investigations (preview) to guide troubleshooting.
  • Make PagerDuty smarter: Configure Event Intelligence for dedup/correlation and set up Event Orchestration/Automation Actions to trigger safe auto‑remediation (AKS rollout restarts, pod auto scaling, Kafka queue drain) with approvals/audit.
  • Run SaaS on AKS: Own Kubernetes health, rollouts, scaling, and config hygiene across regions/tenants.
  • Automate everything: Build tools and runbooks in Python (primary) and PowerShell/Bash; integrate APIs and ChatOps for one‑click remediation.
  • Ship reliably: Define SLIs/SLOs and error budgets; embed reliability gates in CI/CD (Azure DevOps/GitHub Actions/Jenkins), support canary/blue‑green, and enable fast rollback.
  • Provision via code: Manage infra with Terraform, Bicep, and ARM for repeatable, auditable changes.
  • Lead incidents: Take point on on‑call response, clear stakeholder communication, and blameless postmortems; convert learnings into durable SOPs/runbooks.
  • Protect data: Apply Entra ID/RBAC, secrets hygiene, policy‑as‑code, and privacy/security practices appropriate for healthcare SaaS.

Must‑Have Skills

  • Coding for ops: Strong Python; PowerShell/Bash for automation and tooling.
  • Azure AIOps: Azure Monitor/Log Analytics (KQL), Application Insights (Smart Detection, Application Map), and Issues & Investigations (observability agent).
  • PagerDuty AIOps: Event Intelligence (dedup/correlation) and Automation Actions/Event Orchestration for safe remediation; confident on‑call operations.
  • AKS/Kubernetes: Rollout strategies, HPA/KEDA autoscaling, health checks.
  • IaC & CI/CD: Terraform/Bicep/ARM and pipelines in Azure DevOps/GitHub Actions/Jenkins.
  • SRE fundamentals: SLIs/SLOs, error budgets, RCA, blameless postmortems → runbooks/SOPs.
  • Security & compliance: Entra ID/RBAC, secrets hygiene, policy‑as‑code; familiarity with healthcare privacy/security expectations.

Nice‑to‑Have

  • Kafka ops (lag monitoring, safe catch‑up)
  • Teams/ChatOps remediation and status broadcasting
  • Cost & capacity automation (tagging, idle cleanup, forecasting)
  • Resilient routing & security (service mesh, App Gateway/WAF)
  • Multi‑region DR, chaos/game days with documented improvement

How We Work

  • On‑call: Rotating 24×7 coverage; you’ll lead response and keep comms clear.
  • Collaboration: Partner closely with dev/product/security; automation over tickets.
  • Growth: No deep ML required—willingness to learn AI‑assisted ops (e.g., Copilot‑generated KQL, AIOps triage findings) is valued.

Compensation


The salary range provided in this job posting is intended to reflect the general market value for the position. The actual salary offered may vary based on factors such as the candidate’s experience, qualifications, skills, and the specific requirements of the role. This range may also be subject to change as market conditions evolve. We encourage open communication throughout the interview process to discuss compensation expectations. For base-salary + commission sales roles, the range represents On-Target Earnings.

Min – Max :

$85,276.80 - $127,915.20 (CAD)

Benefits

The benefits described represent the current offerings at our organization, however, benefits are subject to change and may vary by location and employment status. We strive to provide a comprehensive benefits package that supports our employee’s health, wellness, and financial goals. Please note that benefits may be discussed in more detail during the hiring process.

  • Vacation to help you rest, recharge, and connect with loved ones

  • Paid leave benefits

  • Extended health, paramedical, dental, and vision benefits

  • Registered retirement and tax-free savings plans

  • Tuition reimbursement, life insurance, EAP – and more!

Create a job alert for this search

Azure CloudAI Ops Engineer • Mississauga, Ontario

Similar jobs
Ace Certified Guidewire Policy Developer

Ace Certified Guidewire Policy Developer

Coforge • burlington, ON, ca
Full-time
Job Title: Ace Certified Guidewire Policy DeveloperSkills: Guidewire cloud, Policy, Gosu, REST/SOAPExperience: 8+ yearsLocation: RemoteDuration:<...Show more
Last updated: 14 days ago • Promoted
Azure CloudAI Ops Engineer

Azure CloudAI Ops Engineer

Merative • Mississauga
Full-time
Join a team dedicated to supporting the crucial mission of improving health outcomes.At Merative, you can apply your skills – and grow new ones – with colleagues who have deep expertise in health a...Show more
Last updated: 26 days ago • Promoted
AI Integration & Onboarding Specialist - oakville

AI Integration & Onboarding Specialist - oakville

SayVo AI • oakville, on, ca
Full-time
AI Integration & Onboarding Specialist.Contractor (with full-time potential).SayVo AI is a fast-moving startup that builds AI employees that call, text, email, and close the loop in the CRM.In a nu...Show more
Last updated: 2 days ago • Promoted
FinOps Analyst/Consultant - oakville

FinOps Analyst/Consultant - oakville

VeUP • oakville, on, ca
Full-time
VeUP exists to empower ambitious businesses to scale rapidly, aligning elite technical execution with strategic commercial goals.We are an AWS Advanced Tier Partner leveraging a team of industry sp...Show more
Last updated: 1 day ago • Promoted
Systems Monitoring & Infrastructure Specialist - oakville

Systems Monitoring & Infrastructure Specialist - oakville

Dexcent • oakville, on, ca
Full-time
Systems Monitoring & Infrastructure Specialist.Operational Technology (OT) environment.This is a contract opportunity and the individual can be fully remote.Dexcent) is an engineering consulting fi...Show more
Last updated: 8 days ago • Promoted
AWS Serverless & Cloud Infrastructure Engineer

AWS Serverless & Cloud Infrastructure Engineer

Astra North Infoteck Inc. • Oakville North, ON, ca
Full-time
Quick Apply
AWS Serverless & Cloud Infrastructure Engineer.AWS Serverless Architecture Design, Core AWS Services, AWS CDK, JavaScript, Node.Experience in designing end to end AWS Serverless solutions.Knowledge...Show more
Last updated: 10 days ago
Operations Specialist - Global Cash Management

Operations Specialist - Global Cash Management

TEKsystems • Bradford West Gwillimbury, Ontario, Canada
Full-time +2
TEKsystems client, a major Big 5 Bank here in Canada, is currently looking to hire an Operations Specialist to join their Global Cash Management team in downtown Toronto! E.USD, GBP, EUR, and exper...Show more
Last updated: 2 days ago • Promoted
Security Administrator - Titanium Transportation Group Inc.

Security Administrator - Titanium Transportation Group Inc.

Titanium Transportation Group Inc. • bolton, on, ca
Full-time
As Security Administrator, you’ll play a pivotal role in safeguarding our organization’s digital assets including its systems and networks.The Security Administrator develops and implements a cyber...Show more
Last updated: 27 days ago • Promoted
Earn money by taking surveys - Remote

Earn money by taking surveys - Remote

Almedia • Vaughan
Remote
Full-time
Get paid for testing apps, games and surveys.Almedia runs a dynamic platform where users earn money online by completing tasks, playing games, and filling out surveys.Since our launch 5 years ago, ...Show more
Last updated: 30+ days ago • Promoted
Cloud Solutions Architect - Azure & AWS Leader

Cloud Solutions Architect - Azure & AWS Leader

Softchoice • Oakville
Full-time
A leading IT solutions provider located in Oakville, Ontario, is seeking a Solution Development Architect.This pivotal role demands expertise in cloud technologies like Azure and AWS, emphasizing i...Show more
Last updated: 30+ days ago • Promoted
Snowflake Cortex expert

Snowflake Cortex expert

Amaris Consulting • vaughan, ON, ca
Full-time
We are looking for a Snowflake Cortex & Snowpark Specialist to design, implement, and optimize advanced data and AI-driven solutions within the Snowflake Data Cloud.You will work closely with Data ...Show more
Last updated: 8 days ago • Promoted
EMS/SCADA Engineer - oakville

EMS/SCADA Engineer - oakville

Pacer Group • oakville, on, ca
Full-time
Network or Transmission Application preferably Reliance.LINUX and Windows Operating Systems.Proficient in Electric Transmission EMS / SCADA /Implementation.Good knowledge of Electric SCADA applicat...Show more
Last updated: 26 days ago • Promoted
Crypto Operations Analyst

Crypto Operations Analyst

Netcoins • Oakville, Ontario, Canada
Full-time
Netcoins is a Canadian cryptocurrency trading platform focused on building trusted, regulated digital asset infrastructure.We operate in a compliance-forward environment and are committed to combin...Show more
Last updated: 2 days ago • Promoted
Infrastructure Specialist

Infrastructure Specialist

Randstad Digital Americas • Oakville, Ontario, Canada
Full-time
We are seeking a highly experienced.Senior Technical Authority for our enterprise application platforms.If you are a seasoned infrastructure expert with a deep command of.In this role, you will act...Show more
Last updated: 17 days ago • Promoted
Senior full stack developer with AWS experience - Luxoft

Senior full stack developer with AWS experience - Luxoft

Luxoft • burlington, on, ca
Full-time
Our Customer is one of the world's largest investment management companies.Based in Southern California, our client manages close to $2 trillion in assets and is looking for a new partner to partic...Show more
Last updated: 2 days ago • Promoted
REMOTE Talend/Databricks Integration Architect - oakville

REMOTE Talend/Databricks Integration Architect - oakville

Insight Global • oakville, on, ca
Remote
Full-time
Insight Global is seeking a Talend/Databricks Integration Architect to join a top aerospace manufacturer in Montreal, QC.This position is remote across Canada following EST working hours.The Talent...Show more
Last updated: 6 days ago • Promoted
Workday Integrations Analyst

Workday Integrations Analyst

Focus on WD • burlington, on, ca
Full-time
We are looking for a Workday Technical Analyst to join a growing team and play a key role in taking Workday to the next level across the organisation.This is a hands-on technical role where you wil...Show more
Last updated: 2 days ago • Promoted
Full Stack Engineer - Set 2 Close | B Corp

Full Stack Engineer - Set 2 Close | B Corp

Set 2 Close | B Corp • burlington, on, ca
Full-time
The ideal candidate brings strong backend development experience, solid database skills, and the ability to contribute to scalable, maintainable applications.Develop and maintain backend services u...Show more
Last updated: 30+ days ago • Promoted