This is Prashant, a Senior Recruiter from Triunity Software Inc. a leading staffing organization.
Follow me on Linkedin :
JD :
NOTE : Considering candidates only local to Montreal or in Quebec and FTE.
Role : Private Cloud Site Reliability Specialist
Location : Montreal, QC ( This is a hybrid (on-site 3 days / week) role )
Type : FTE
Experience : 5+ Years
JD :
Responsibilities :
- Provide L3 support for a private cloud, including on-call rotation
- Work closely with the internal engineering team and provide input on testing of new component releases and infrastructure upgrades, as well as performance, capacity, and monitoring
- Create and improve processes for support, including training, documentation, customer engagement, incident, problem, and change management
- Contribute to internally developed CLIs and APIs to automate SRE's activities and platform's automation
- Work together with L2 teams and other L3 team members internationally.
Qualifications :
5 to 10 years of relevant experience in platforms maintenance / developmentExperience in a least one programming languageExperience with maintaining complex production systems with cloud and legacy technologiesProven Kubernetes and Docker experience-Knowledges of monitoring stack (Grafana, Prometheus, Splunk) usageStrong organizational skills and ability to manage multiple tasks and high-pressure situations for outage resolutionCommunicate effectively with various user groups, e.g. developers and engineers, as well as remote team members.Experience in developing monitoring architecture and implementing monitoring agents, and alertsExperience in Golang, React, Kubernetes OperatorsKnowledge of security protocols, e.g. SSL / TLS, Kerberos