Talent.com

Aws certification Jobs in Ottawa, ON

Create a job alert for this search

Aws certification • ottawa on

Last updated: 4 days ago

Senior Site Reliability Engineer (Remote Canada)

TechInsightsOttawa, ON, CA
Remote
Permanent
Quick Apply

OUR STORY TechInsights is the information Platform for the semiconductor industry.Regarded as the most trusted source of actionable, in-depth intelligence related to semiconductor innovation and su...Show more

Cardiologist for a Premier Clinic in Ottawa

MDSearchOttawa, ON, ca
Full-time +1

Job type: Full-time, Part-time or Flexible .This opportunity is excellent for recent graduates or experienced professionals seeking dynamic environments.The Cardiologist will be responsible for pro...Show more

Responsible AI & Governance Lead – Enterprise AI Platforms - AIRLHV

NavitasPartnersOttawa, Ontario, Canada
Full-time

Responsible AI & Governance Lead – Enterprise AI Platforms.US / Canada (Remote/Hybrid) .We are seeking a Responsible AI & Governance Lead to drive ethical, compliant, and scalable AI adoption acros...Show more

Directeur.trice en certification (Région de l'Outaouais)

Forvis Mazars, CanadaGatineau, Québec, Canada
Permanent

Le Directeur en certification pour la région de l’Outaouais assure la gestion des mandats de certification (audit, examen, missions spéciales) et de compilation, tout en contribuant activement au d...Show more

Software Engineer

h2o.aiOttawa, ON, CA
Full-time
Quick Apply

As the world’s leading agentic AI company, H2O.Generative and Predictive AI to help enterprises and public sector agencies develop purpose-built GenAI applications on their private data.With a focu...Show more

Full Stack Developer Intern

NodaOttawa, Ontario, CA
Full-time
Quick Apply

Connect with us to discover our latest job opportunities! Even if nothing suits you right now, stay in touch — your perfect role may be just around the corner! .Noda is a data and analytics company...Show more

Administrateur de système infonuagique - Microsoft 365/AWS - intermédiaire

TEHORAOttawa, ON, CA
Full-time
Quick Apply

TEHORA est présentement à la recherche d’un(e).Sans être exhaustifs, voici les services et livrables que devra fournir la personne retenue :.Administrer les services infonuagiques Microsoft 365 et ...Show more

Web Developer CN

SimeraOttawa, Ontario, Canada
Full-time

We are seeking a detail-oriented and creative Web Developer to design, build, and maintain our web applications.The ideal candidate has strong coding skills, an eye for design, and a passion for cr...Show more

Senior Journeyperson Gasfitter — Class A

HRS Talent SolutionsOttawa, ON, Canada
Full-time
Quick Apply

Senior Journeyperson Gasfitter — Class A.Day shift, with possible overtime, on-call rotation, or project-based work depending on employer needs.This opportunity is best suited to a senior-level gas...Show more

Nanny Wanted - Seeking Experienced Nanny In Ottawa, On For 2 Kids (Almost 2 And Almost 6); Must Have Access To A Car And Cpr Certification.

CanadianNanny.caOttawa, Ontario, Canada
Part-time

Hey there! Our family in Ottawa, Ontario is in search of a nurturing nanny to care for our infant and primary school-aged children.This live-out position is part-time and offers a competitive hourl...Show more

 • Promoted

Junior Pentester

Software SecuredOttawa, Ontario, Canada
Full-time

Software Secured is an application security firm located in Ottawa, Ontario.We help software development teams get ahead of hackers using a suite of services and products.Software Secured is a plac...Show more

Senior DevOps Engineer

High Tech GenesisOttawa, ON, CA
Full-time

At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do.Be part of a design services company that is amongst the companies that lead the world in tec...Show more

Full Stack Developer Intern

Work in OttawaOttawa, Ontario, CA
Full-time
Quick Apply

Work in Ottawa is an initiative of Invest Ottawa, the economic development agency with a mandate of facilitating economic growth and job creation for the city of Ottawa.As a hub for innovation and ...Show more

System Administrator (Network and Cybersecurity) - Client Delivery

MalleumOttawa, ON, CA
Full-time
Quick Apply

Location: Hybrid - on-site at client locations as required About Malleum Malleum is at the forefront of next-generation cyber defense, partnering with marquee clients across government, defense, fi...Show more

Job Description Writer

QMR Consulting & Professional StaffingOttawa, Ontario, CA
Full-time

The ideal candidates have 5+ years of federal government experience, writing job descriptions.Requirements also include relevant education (degree or diploma), Human Resources certification, and/or...Show more

Auditor (Certification and Verification services)

ERMOttawa, Canada
Full-time

If you’re looking to accelerate your career in ESG, sustainability, and accredited audits—while working with global organizations driving real change—this role places you at the center of it all.ER...Show more

Architecte senior en sécurité cloud / Senior Cloud Security Architect, Services Professionnels AWS / AWS professional services

Amazon Web Services Canada, Inc.Ottawa, Ontario, CAN
Full-time

Aimeriez-vous aider les clients à mettre en œuvre des solutions innovantes sécurisés et à résoudre leurs plus grands défis ? Souhaiteriez-vous le faire en utilisant les dernières pratiques, outils ...Show more

Systems Administrator (Lead)

RebelOttawa, ON, CA
Full-time
Quick Apply

Position Title: Systems Administrator Team Lead Location: Ottawa, ON (377 Dalhousie Street) Work Model: Hybrid - 4 days onsite, 1 day work from home About Rebel OUR CUSTOMERS BRING A VISION - WE BR...Show more

Principal Web Developer

March NetworksOttawa, ON, CA
Full-time
Quick Apply

At March Networks, our goal is to create a positive working environment where all of our employees can thrive.When you join our team, you will enjoy flexibility and support for a healthy work-life ...Show more

People also ask
Senior Site Reliability Engineer (Remote Canada)

Senior Site Reliability Engineer (Remote Canada)

TechInsightsOttawa, ON, CA
9 days ago
Job type
  • Permanent
  • Remote
  • Quick Apply
Job description

OUR STORY TechInsights is the information Platform for the semiconductor industry.

Regarded as the most trusted source of actionable, in-depth intelligence related to semiconductor innovation and surrounding markets, TechInsights’ content informs decision makers and professionals whose success depends on accurate knowledge of the semiconductor industry—past, present, or future.

Over 650 companies and 150,000 users access the TechInsights Platform, the world’s largest vertically integrated collection of unmatched reverse engineering, teardown, and market analysis in the semiconductor industry.

This collection includes detailed circuit analysis, imagery, semiconductor process flows, device teardowns, illustrations, costing and pricing information, forecasts, market analysis, and expert commentary.

TechInsights’ customers include the most successful technology companies who rely on TechInsights’ analysis to make informed business, design, and product decisions faster and with greater confidence.

For more information, visit www.techinsights.com .

WHY WORK WITH US Company-sponsored training and development opportunities Comprehensive benefits package (health, dental, vision, wellness, RRSP Matching, annual fitness reimbursement) Flexible vacation policy Community involvement opportunities through charitable alliances: https://www.techinsights.com/community-involvement Wellness resources and support I nclusive environment that prioritizes diversity, equity, and accessibility High-growth company driven by high performance Expected salary range: $125,200 - $132,500 CAD THE OPPORTUNITY: TechInsights is building the reliability and AI operations foundation for its next chapter — an AI-first intelligence platform that runs the most demanding semiconductor intelligence workflows in the world.

We're looking for a Senior Site Reliability Engineer who wants to own that foundation.

This is a senior individual contributor role at the technical leadership tier of our Site Reliability Engineering team.

You'll own strategic reliability initiatives end-to-end: setting technical direction, defining SLOs and error budgets across our production platform, designing reliability patterns for the AI agent pipelines that power our platform's AI-first capabilities, and enabling our development and AI Engineering teams to build and ship with confidence.

What sets this role apart is its scope.

You're not just keeping the lights on — you're building the observability, Internal Developer Platform (IDP), and service catalog that a fast-scaling AI platform needs from day one.

You'll be the reliability voice in architectural decisions, the engineer who closes the loop between agent failure modes and platform resilience, and the mentor who builds the team's capability rather than their own indispensability.

If you have deep SRE experience and want to apply it to AI workloads — agent loop observability, blast radius management, LLM infrastructure reliability — this is the role where that expertise becomes a differentiator.

This role is a remote role for candidates based in Canada.

  • WHAT YOU’LL DO Platform Reliability & AI Operations Own SLOs, SLIs, and error budgets for all production services; drive error budget discipline across engineering Design reliability patterns for AI agent pipelines: LLM observability, tool-use tracking, failure detection, and graceful degradation Architect for blast radius containment — agent failures must have bounded customer impact through isolation, circuit breaking, and rapid recovery Mature our Canada Central/West active-active architecture toward 24-hour RTO with full regional failover Lead incident response and post-incident reviews that produce durable fixes; maintain DR procedures through regular testing Developer & AI Engineering Enablement Serve as the primary reliability liaison to Software and AI Engineering, translating requirements into actionable standards Partner with AI Engineering on compute provisioning, model serving, inference latency, and workload isolation Own CI/CD pipeline strategy (Bitbucket Pipelines, GitHub Actions) — set standards, optimize deployment frequency, and ensure teams can ship confidently Drive IDP adoption and enable teams on SRE practices: on-call readiness, SLO definition, runbook development, and self-service tooling Represent reliability in architectural discussions; surface risk before it's committed to design Observability, IDP & Service Catalog Own the service catalog — a living inventory of all services, AI agents, dependencies, ownership, and SLOs Operate Datadog as the single pane of glass for service health, infrastructure, and agentic pipeline telemetry Extend observability to AI workloads: LLM latency, token consumption, agent completion rates, and pipeline throughput Build golden path templates in Backstage and/or Atlassian Compass so teams ship reliably without routine SRE involvement Apply AIOps in Datadog to automate anomaly detection, incident triage, and remediation recommendations FinOps, IaC & Continuous Improvement Own infrastructure as code via Terraform and GitOps; enforce IaC policy in partnership with Trust Assurance Own FinOps visibility into AWS cost segments; model cloud cost impact as AI/ML workloads scale Formally mentor junior and intermediate SRE engineers, with accountability for their technical growth and career progression Build AI-assisted automation to progressively reduce toil and scale the team's operational capacity WHAT YOU’LL BRING Technical Requirements Bachelor's degree in Computer Science, Engineering, or equivalent combination of education and experience 6–8 years of progressive experience in site reliability engineering, platform engineering, or DevOps, with demonstrated technical leadership at the senior individual contributor level Deep expertise in AWS (EKS, Lambda, CloudWatch, AWS Config) and multi-region architecture patterns Proficiency with Terraform and GitOps; experience with policy-as-code (Sentinel, OPA/Rego, or equivalent) Hands-on Datadog experience at operational depth: dashboards, SLO tracking, alerting, log management, distributed tracing Strong containerization expertise: Docker, Kubernetes (EKS preferred) Proficiency in Python and/or Bash; experience building operational tooling; solid understanding of Java and Spring Boot microservice architecture sufficient to make reliability and deployment decisions for EKS-hosted services Deep expertise in CI/CD pipeline design and optimization using Bitbucket Pipelines and GitHub Actions Familiarity with IDP tooling (Backstage, Atlassian Compass, or equivalent) strongly preferred Experience with AI/ML workload infrastructure, LLM API integration, or agentic system operations considered a strong asset Professional Skills Leads and owns strategic reliability initiatives end-to-end with a high degree of autonomy; accountable for outcomes, not just tasks Sets technical direction and influences team and department strategy Solves complex, ambiguous reliability problems through systematic analysis and first-principles thinking Formally mentors junior and intermediate engineers; builds team capability through coaching and knowledge transfer Communicates technical reliability concepts clearly to engineering, product, and leadership audiences Approaches operational work with an AI-first posture: builds automation and intelligent tooling as the default Preferred Qualifications Experience designing reliability architecture for agentic AI systems: agent loop observability, blast radius isolation, graceful degradation for LLM-dependent services AWS certifications: Solutions Architect Professional, DevOps Engineer Professional, or equivalent FinOps Certified Practitioner or demonstrated cloud cost management experience at scale IDP implementation or developer experience program leadership Experience in semiconductor, SaaS, or data-intensive platform environments Experience operating in environments with export-controlled or regulated data Knowledge of BCP/DR program management and formal recovery testing As part of the recruitment process for this position, you will be required to submit your latest citizenship and/or permanent residency information.

This information will be used to comply with U.S.

Export Control Laws and Regulations.

WORKING ARRANGEMENT This is a remote position for candidates based in Canada.

Occasional travel may be required.

  • Technology knows no bounds, and neither does TechInsights.

Bringing together talented humans from different perspectives, backgrounds and abilities is something we take seriously.

We’re committed to building an inclusive environment that welcomes you to be your authentic self and allows us to push past the boundaries together.

TechInsights is committed to meeting the needs of people with disabilities.

Accommodations are available on request for candidates taking part in all aspects of the selection process.

AI technology may be used to assist in the screening and assessment of applications for this position.

Our recruiters are involved at every stage, and all hiring decisions are made by People and hiring teams.

As part of any recruitment process, TechInsights collects and processes personal data relating to job applicants.

We are committed to being transparent about how we collect and use that data and to meeting our data protection obligations.

Our Privacy policy can be referenced here: https://www.techinsights.com/privacy-policy Powered by JazzHR