Talent.com
SRE Engineer
SRE Engineerkloia • Markham, York Region, CA
SRE Engineer

SRE Engineer

kloia • Markham, York Region, CA
Il y a plus de 30 jours
Type de contrat
  • Temps plein
Description de poste

Join to apply for the SRE Engineer role at kloia

Description

Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects.

Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for internal projects to build a scalable and reliable platform of common services.

What does SRE do?

In Kloia, the SRE Team focuses on eliminating toil in production workloads. Our main goal is to achieve 24x7 SLA with a support system and team that ‘Follow-the-Sun’ .

Key responsibilities include participating in design and development, making trade-offs between performance, cost, security, and reliability, and supporting the system in production as a reliable escalation point.

As an SRE, you will :

  • Eliminate toil through automation, re-architecting, and refactoring.
  • Approach incidents with an “Automate Everything” mindset.
  • Collaborate with software engineers to troubleshoot incidents.
  • Drive complex infrastructure changes with transparency and zero downtime.
  • Design and implement self-healing, reliable, and scalable infrastructure in a cloud-native environment.
  • Guide and unblock developers across teams to push their products forward.
  • Define SLOs and error quotas for production services.
  • Support our dev-ops culture, including participation in the follow-the-sun on-call rota.

Position : SRE (Site Reliability Engineer)

Location : Remote - LATAM / APAC

Level : Junior / Medior

What does an average day look like?

Proactively support production workloads, troubleshoot to find root causes, and write or review postmortems. Identify infrastructure and observability weaknesses.

Technical challenges include :

  • Optimizing resource allocation in Kubernetes for application performance.
  • Including API Gateway monitoring in APM for full observability.
  • Reducing database query hits.
  • Guiding development team on data layer caching.
  • Our stack is cloud-native, including AWS, Terraform, Docker / Kubernetes, Helm, ELK, Instana, OpsGenie, Node.js, Java, Typescript, Python. We expect candidates to have a deep understanding of Linux-based distributed systems at scale and relevant experience.

    Who should apply?

    This role suits those eager to work with cutting-edge cloud infrastructure at scale, passionate about automation, and capable of explaining complex concepts simply.

    Career benefits :

    Exposure to new technologies, working on products with global reach, and opportunities to develop both development and operations skills. We encourage continuous learning with initiatives like hack days and training.

    Requirements :

  • Excellent communication skills
  • Deep knowledge of Linux distributed systems at scale
  • Experience with AWS or other cloud providers
  • Experience with SQL / NoSQL databases at scale
  • Experience with service lifecycle and monitoring
  • Experience as a software or platform engineer / SRE
  • Experience with DevOps practices
  • Good understanding of Docker
  • Automation mindset
  • Nice to have :

  • Knowledge of Kubernetes
  • Experience with Terraform or other Infrastructure as Code tools
  • Benefits include :

  • Remote work flexibility
  • Home office budget
  • Hackathon days
  • Access to AWS and CNCF / Kubernetes training and certifications
  • R&D focus
  • Social activities like weekly Lunch & Learn, Fridays, socials, and online games
  • #J-18808-Ljbffr

    Créer une alerte emploi pour cette recherche

    Engineer Sre • Markham, York Region, CA

    Offres similaires
    SRE – Kubernetes, Cloud, Terraform, Azure

    SRE – Kubernetes, Cloud, Terraform, Azure

    Astra North Infoteck Inc. • Markham, ON, ca
    Temps plein
    Quick Apply
    Help the Cloud Architectural Artifacts.Networking, firewalling, routing.Implementation level architecture for scalable IaC strategies (e. Contribute to and maintain our org wide Architecture Decisio...Voir plus
    Dernière mise à jour : il y a 18 jours
    Site Reliability Engineer (SRE) – AWS

    Site Reliability Engineer (SRE) – AWS

    Astra North Infoteck Inc. • Toronto, ON, ca
    Temps plein
    Quick Apply
    Required Skills : Digital : Amazon Web Service(AWS) Cloud Computing~Digital : Site Reliability Engineering (SRE)~Dynatrace. Job description : ⦁ SRE Key Responsibilities.Design, implement, and maintain...Voir plus
    Dernière mise à jour : il y a 22 jours
    Staff SRE - Developer Experience & Velocity Leader

    Staff SRE - Developer Experience & Velocity Leader

    Updata Partners • Toronto
    Temps plein
    A tech company in Toronto is looking for a Staff Site Reliability Engineer to enhance engineering productivity and develop self-service platforms. This role demands over seven years of SRE and cloud...Voir plus
    Dernière mise à jour : il y a 7 jours • Offre sponsorisée
    Senior SRE : Global SaaS Platform & Kubernetes

    Senior SRE : Global SaaS Platform & Kubernetes

    Kong Inc. • Toronto
    Temps plein
    A leading developer of API technologies is seeking a Senior Site Reliability Engineer to join the global Platform SRE team in Toronto. The role involves building, operating, and scaling a multi-regi...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée
    Hybrid Cloud SRE : Resilience & Observability

    Hybrid Cloud SRE : Resilience & Observability

    iManage • Toronto
    Temps plein
    A leading cloud software company is seeking a Site Reliability Engineer to enhance scalability and reliability of its cloud platform. The role involves automation, incident management, and collabora...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée
    Sr. Site Reliability Engineer

    Sr. Site Reliability Engineer

    Jerry • Toronto, Canada
    Temps plein
    Senior Site Reliability Engineer Join Jerry.Senior Site Reliability Engineer.IPO startup, $240 million funded, 60× revenue growth, and tackling a $2 trillion market for car ownership.We’re building...Voir plus
    Dernière mise à jour : il y a 7 jours • Offre sponsorisée
    Site Reliability Engineer

    Site Reliability Engineer

    Kyndryl • Toronto
    Temps plein +1
    Join to apply for the Site Reliability Engineer role at Kyndryl.Direct message the job poster from Kyndryl.Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Se...Voir plus
    Dernière mise à jour : il y a 21 jours • Offre sponsorisée
    Senior SRE — Scale, Automation & Cloud Reliability

    Senior SRE — Scale, Automation & Cloud Reliability

    getjerry.com • Toronto, Canada
    Temps plein
    A rapidly growing tech startup is looking for a Sr.Site Reliability Engineer to own its infrastructure.This role demands extensive experience with AWS services and strong coding abilities in Python...Voir plus
    Dernière mise à jour : il y a 5 jours • Offre sponsorisée
    Reservoir Engineer

    Reservoir Engineer

    Aramco • Toronto
    Temps plein
    Aramco energizes the world economy.Aramco occupies a special position in the global energy industry.We are one of the world's largest producers of hydrocarbon energy and chemicals, with among the l...Voir plus
    Dernière mise à jour : il y a 5 jours • Offre sponsorisée
    Senior SRE : AI-Driven CI / CD Platform

    Senior SRE : AI-Driven CI / CD Platform

    RBC • Toronto
    Temps plein
    A leading financial institution in Toronto is seeking a Senior Site Reliability Engineer to enhance their CI / CD deployment portal. The role focuses on improving application delivery and operational ...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée
    Lead SRE - Azure Cloud, Automation & Reliability (Hybrid)

    Lead SRE - Azure Cloud, Automation & Reliability (Hybrid)

    SimCorp • Toronto
    Temps plein
    A global financial technology firm in Toronto is looking for a Lead Site Reliability Engineer to enhance product reliability and efficiency. The role requires extensive experience with Azure and str...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    PowerToFly • Toronto
    Temps plein
    We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    DevOps / SRE Engineer (Remote)

    DevOps / SRE Engineer (Remote)

    Rivalry • Toronto, ON, Canada
    Télétravail
    Temps plein
    Rivalry is a startup uniquely positioned to disrupt the dated online gambling space.The founders and staff come from the gaming and esports scene and are now working their way into the betting worl...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Cloud SRE & IaC Engineer (Kubernetes)

    Senior Cloud SRE & IaC Engineer (Kubernetes)

    OpenText • Richmond Hill
    Temps plein
    A leading information management firm is seeking a Site Reliability Administrator to enhance cloud deployment processes.The role involves managing Kubernetes, on-call support, and collaborating wit...Voir plus
    Dernière mise à jour : il y a 7 jours • Offre sponsorisée
    Software Engineer (SRE)

    Software Engineer (SRE)

    Scotiabank • Toronto C6A, ON, Canada
    Télétravail
    Temps plein
    Global Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank’s Corporate, Investment Banking and Capital Markets businesses.The successfu...Voir plus
    Dernière mise à jour : il y a 17 jours • Offre sponsorisée
    Senior Platform Engineer (SRE)

    Senior Platform Engineer (SRE)

    Rover • Toronto
    Temps plein
    Get AI‑powered advice on this job and more exclusive features.We’re hiring a Staff Platform Engineer to establish Rover’s reliability foundation — from infrastructure to observability, from message...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée
    SRE Engineer

    SRE Engineer

    J&M Group • Toronto
    Temps plein
    Proven experience as a Site Reliability Engineer or similar role.Strong hands-on expertise in Observability tools and practices. Solid experience with Ansible for automation and configuration manage...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée
    Site Reliability Engineer (contract)

    Site Reliability Engineer (contract)

    Capgemini • Toronto
    Temps plein
    Site Reliability Engineer (contract) at Capgemini.We are seeking a Site Reliability Engineer to provide hands-on SRE support, ensuring application reliability, incident resolution, automation, and ...Voir plus
    Dernière mise à jour : il y a 28 jours • Offre sponsorisée