About the role
Our growing AI Engineering Team at the Lab is looking for a DevOps – AWS / Cloud.
This role is part of the AI Engineering team supporting AI and Data services on Cloud as well as on-prem infrastructure.
The DevOps will work closely with AI Developers on building solid solution to enable advanced AI capabilities for Data Scientists and will also contribute to day-to-day operation on the infrastructure management, security, monitoring, and alerting.
What you'll do here :
Administrate multiple systems :
Apply routinely updates especially security patches.
Respond to incidents, and minimize service interruptions.
Investigate and fixes complex defects.
Establish good monitoring and observability on systems health and cost.
Support developers and users :
Provide support in solving system issues and escalations which require complex technical expertise to troubleshoot.
Perform root cause analysis, interprets the results, and develops action plan in backlog (identifying short- and long-term solutions).
Maintain open and efficient communication with developers and users while troubleshooting issues.
Participate in proof-of-concept for medium to large initiatives
Facilitate development and design of technical solutions :
Enhance existing products, and propose new / future releases for better security, scalability, flexibility and efficiency.
Assist development of requirements, understanding impact of architecture on overall solution.
Assist development teams by providing guidance of operation best practices.
What you bring to the table :
Bachelor’s degree in computer science, computer engineering or any combination of equivalent education and experience.
5 to 8 years (senior)
Must have in this role : Kubernetes administration, Network administration, Unix sysadmin, Terraform, AWS basics (EC2, S3, VPC, Route53, Security, logging), GitLab (especially CICD) or Github actions administration , ELasticSearch & Kibana administration
Preferred Experiences : AWS advanced (Lambda, ECS, LB, WAF, Aurora, Sagemaker), Databricks, Airflow, PostgreSQL, VMWare vCenter, Prometheus & Grafana
A self-starter, who can drive and follow through the initiatives from the beginning to the end.
Excellent communication skills, spoken and written.
For candidates located in Quebec, bilingualism is required considering the necessity to interact on a regular basis with English-speaking colleagues across the country.
No Canadian work experience required however must be eligible to work in Canada.
#LI-Hybrid
Il s'agit d'un nouveau rôle au sein de notre équipe en plein croissance | This role is a new member of our growing team.
Senior Ai • Bourassa, Robert,Montréal