Join to apply for the SRE Engineer role at kloia
Description
Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects.
Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for internal projects to build a scalable and reliable platform of common services.
What does SRE do?
In Kloia, the SRE Team focuses on eliminating toil in production workloads. Our main goal is to achieve 24x7 SLA with a support system and team that ‘Follow-the-Sun’ .
Key responsibilities include participating in design and development, making trade-offs between performance, cost, security, and reliability, and supporting the system in production as a reliable escalation point.
As an SRE, you will :
Position : SRE (Site Reliability Engineer)
Location : Remote - LATAM / APAC
Level : Junior / Medior
What does an average day look like?
Proactively support production workloads, troubleshoot to find root causes, and write or review postmortems. Identify infrastructure and observability weaknesses.
Technical challenges include :
Our stack is cloud-native, including AWS, Terraform, Docker / Kubernetes, Helm, ELK, Instana, OpsGenie, Node.js, Java, Typescript, Python. We expect candidates to have a deep understanding of Linux-based distributed systems at scale and relevant experience.
Who should apply?
This role suits those eager to work with cutting-edge cloud infrastructure at scale, passionate about automation, and capable of explaining complex concepts simply.
Career benefits :
Exposure to new technologies, working on products with global reach, and opportunities to develop both development and operations skills. We encourage continuous learning with initiatives like hack days and training.
Requirements :
Nice to have :
Benefits include :
#J-18808-Ljbffr
Engineer Sre • Markham, York Region, CA