Talent.com
08763 Citi Canada Technology Services ULC
Cloud Platform DevOps Engineer - Assistant Vice President08763 Citi Canada Technology Services ULC • Mississauga Ontario Canada
Cloud Platform DevOps Engineer - Assistant Vice President

Cloud Platform DevOps Engineer - Assistant Vice President

08763 Citi Canada Technology Services ULC • Mississauga Ontario Canada
2 days ago
Job type
  • Full-time
Job description

We are seeking an experienced (5+ years), motivated, and hands-on Cloud Platform DevOps Engineer to join our North American AI and DevOps Platform Engineering team. In this critical role, you will be responsible for enhancing the stability, reliability, and performance of our AI and DevOps platforms, which support a diverse ecosystem of AI applications, developer tools, and CI/CD pipeline technologies across the organization. You will actively contribute to infrastructure design, implementation, and maintenance, and facilitate agile development within the team. The ideal candidate is a strong technical leader who champions agile practices, drives continuous improvement, and excels in both coding and coaching, possessing a deep understanding of infrastructure and operational considerations for Artificial Intelligence and Machine Learning initiatives, with proven hands-on experience in DevOps tools and technologies such as Kubernetes, Docker, HELM, Ansible, DevOps tools, or similar CI/CD platforms, and proficiency in scripting and automation (, Python, Bash). We are looking for someone with a track record of implementing scalable, resilient, and high-performance solutions, coupled with strong communication and collaboration skills, and an ability to mentor and guide junior team members, as you join a dynamic team committed to fostering innovation and collaboration.

Responsibilities:

Hands-on DevOps & Infrastructure Engineering

  • Design & Implementation: Lead the design, implementation, and ongoing management of secure, scalable, and resilient infrastructure components.

  • Secret & Certificate Management: Administer and maintain secret and certificate management solutions using HashiCorp Vault, including policy definition and integration.

  • Database Management: Perform hands-on administration and optimization of database systems (PostgreSQL, Oracle, MongoDB), including performance tuning, backup, and recovery strategies.

  • Workflow Orchestration: Deploy, monitor, and troubleshoot data orchestration workflows using Apache Airflow, and develop/optimize DAGs.

  • Messaging Systems: Implement and manage messaging queues such as Kafka and IBM MQ, including cluster setup and configuration.

  • API Integrations: Develop, maintain, and troubleshoot RESTful API and SOAP integrations critical for system connectivity.

  • Build Automation: Implement and optimize build and deployment processes using Gradle.

  • Container Orchestration: Design, implement, and manage container orchestration platforms with Kubernetes and Helm, including integration with CyberArk and HashiCorp for secrets management. Create, debug, and troubleshoot Kubernetes PODs, Jobs, and Deployments using YAML.

  • Storage Management: Configure and manage persistent storage solutions including PVC, SONiC NAS, and S3, with an awareness of storage requirements for AI/ML workloads.

  • Networking & Load Balancing: Set up and maintain load balancing solutions (, Nginx, HAProxy, AWS ELB/ALB, Kubernetes Ingress controllers) for high availability and performance.

  • Monitoring & Logging: Implement, configure, and utilize comprehensive monitoring and logging solutions (Prometheus, Grafana, ELK Stack) to ensure system health and proactively identify issues, including those relevant to AI/ML applications.

  • Automation & Scripting: Develop robust automation scripts and tools using Python, Bash, Go, or similar languages to streamline operations and enhance efficiency.

  • Incident Response: Participate actively in on-call rotations, responding to and resolving critical incidents with hands-on troubleshooting.

  • Documentation: Create and maintain technical documentation, architecture diagrams, and runbooks for infrastructure components and processes.

  • Impediment Resolution: Proactively identify and resolve technical impediments and process bottlenecks within the team and across organizational boundaries, paying special attention to unique challenges posed by AI/ML infrastructure.

  • Backlog Refinement: Collaborate closely with stakeholders (, product owners, technical leads) to ensure a well-defined and prioritized backlog for infrastructure work, technical debt, operational improvements, and AI/ML platform needs.

  • Process Improvement: Drive continuous improvement in the team's agile and DevOps practices, helping them adapt and optimize their workflow for maximum efficiency and quality.

Required Qualifications:

Hands-on DevOps & Infrastructure Engineering Expertise

  • Secret & Certificate Management: Proven hands-on experience with HashiCorp Vault (installation, configuration, policy management, integrations).

  • Database Administration: Strong hands-on experience with at least two of PostgreSQL, Oracle, or MongoDB (installation, tuning, replication, backup/restore).

  • Workflow Orchestration: Hands-on experience deploying, managing, and developing DAGs for Apache Airflow.

  • Messaging Systems: Solid hands-on experience with Kafka and/or IBM MQ (cluster setup, topic management, producer/consumer configuration).

  • Container Orchestration: In-depth hands-on experience with Kubernetes and Helm, including YAML configuration, troubleshooting PODs/Jobs/Deployments, and integrations with secrets management (CyberArk, HashiCorp).

  • Storage Management: Practical experience with Kubernetes PVCs, Persistent Volumes, S3, and/or enterprise NAS solutions (, SONiC NAS).

  • Monitoring & Logging: Strong hands-on experience with Prometheus, Grafana, and the ELK Stack (setup, dashboard creation, query optimization, alert configuration).

  • Scripting & Automation: High proficiency in Python, Bash, or Go for automation, tooling development, and system administration.

  • Cloud Platforms: Extensive hands-on experience with at least one major cloud provider (AWS, Azure, GCP).

  • Infrastructure as Code (IaC): Proficiency with IaC tools such as Terraform or Ansible.

  • CI/CD: Experience designing, implementing, and maintaining CI/CD pipelines (, Jenkins, GitLab CI, GitHub Actions).

  • API Integration: Experience with RESTful API and SOAP web services.

  • Build Tools: Proficiency with Gradle for build automation.

AI/ML Awareness & Support

  • AI/ML Infrastructure Concepts: Understanding of the specific infrastructure requirements for deploying, managing, and scaling Artificial Intelligence and Machine Learning workloads (, GPU resources, specialized storage, MLOps pipelines).

  • Data for AI/ML: Awareness of data management strategies and data governance principles relevant to AI/ML models and training datasets.

  • Monitoring AI/ML Systems: Familiarity with metrics and monitoring approaches for the performance and health of AI/ML applications and their underlying infrastructure.

Agile & Leadership Skills

  • Working Scrum Master Experience: Proven experience acting as a Scrum Master within a technical team where you also performed significant hands-on engineering.

  • Agile & Scrum Mastery: In-depth knowledge and practical application of Agile principles and the Scrum framework.

  • Facilitation & Coaching: Excellent facilitation, coaching, and mentoring skills within a technical context.

  • Communication: Strong verbal and written communication skills, able to bridge technical and process discussions.

  • Technical Leadership: Ability to guide technical discussions, influence architectural decisions, and drive best practices.

Preferred Qualifications:

  • Certified ScrumMaster (CSM) or Professional Scrum Master (PSM) certification.

  • Relevant cloud certifications (, AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, GCP Professional Cloud DevOps Engineer).

  • Experience with site reliability engineering (SRE) principles and practices.

  • Familiarity with other Agile scaling frameworks (, SAFe, LeSS).

  • Exposure to MLOps platforms or tools (, Kubeflow, MLflow).

Education:

  • Bachelor's or Master's degree in computer science, Engineering, or a related technical field or equivalent experience

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Primary Location Full Time Salary Range:

$94, - $141,

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Automated Processing and AI

We use automated processing, including artificial intelligence, for our legitimate business interests (or our reasonable and appropriate business purposes) to identify and align the candidate's skills and abilities with a specific job opening. Additionally, if you so choose, or consent, we can match your skills and abilities to other suitable roles at Citi.

Importantly, all our hiring processes and decisions, including determining your suitability for a role, are conducted, checked, and decided by individuals. Our automated processing and AI do not involve relying on automatic or autonomous decision-making. Please refer to any Jurisdictional Considerations, with specific provisions for your country (where relevant) for further details.

------------------------------------------------------

------------------------------------------------------

Create a job alert for this search

Cloud Platform DevOps Engineer - Assistant Vice President • Mississauga Ontario Canada

Similar jobs

Azure DevOps Engineer

LTIMindtreemississauga, on, ca
Full-time

LTIMindtree is an equal opportunity employer that is committed to diversity in the workplace.Our employment decisions are made without regard to race, color, creed, religion, sex (including pregnan... Show more

 • Promoted

Senior Cloud & DevOps Engineer - IaC, Kubernetes, CI/CD

LazerMississauga, Peel Region, CA
Full-time

Apple, Google, Coinbase, and more.With our product experience, we have designed, engineered, and grown products.Clients seek out our help because we have the talent to deeply understand their needs... Show more

 • Promoted

Senior DevOps Engineer - Azure Cloud & CI/CD

RO WriterMississauga, Peel Region, CA
Full-time

A leading software company in Canada is seeking a Senior DevOps Engineer to build and maintain scalable infrastructure for their applications.The ideal candidate has 4+ years of DevOps experience a... Show more

 • Promoted

DevOps Engineer (AI)

Affinityvaughan, on, ca
Full-time

Client: Enterprise class EMR provider.On behalf of our client, Affinity is seeking a DevOps Engineer to build and evolve the infrastructure behind its flagship EMR platform.You’ll own Kubernetes-ba... Show more

 • Promoted

Platform Engineer: DevOps, Cloud Automation & Observability

Top HatMississauga, Peel Region, CA
Full-time

A leading education technology company is seeking a DevOps Engineer to join their Core Platform team.This role focuses on improving software delivery through the adoption of DevOps practices, tools... Show more

 • Promoted

Cloud DevOps Engineer (Azure)

Onico SolutionsMississauga, Peel Region, CA
Temporary

We are currently looking for an experienced Cloud DevOps Engineer (Azure) with hands on experience developing automation and modernizing applications in Cloud native architecture.Gather and analyze... Show more

 • Promoted

Senior Platform Engineer: AI-Driven Multi-Cloud & DevEx

AequilibriumMississauga, Peel Region, CA
Full-time

A digital consulting firm in Canada is seeking a Senior SRE / Platform Engineer to design and manage multi-cloud infrastructure.The role involves working with development teams to deploy applicatio... Show more

 • Promoted

Senior DevOps Engineer

FigmentMississauga, Peel Region, CA
Full-time

Get AI-powered advice on this job and more exclusive features.Figment powers the future of Web3 through industry-leading blockchain infrastructure.As the leading provider of staking solutions, we h... Show more

 • Promoted

Senior Multicloud DevOps Engineer - Remote

LumenaltaMississauga, Peel Region, CA
Remote
Full-time

A leading software solutions company in Vancouver is seeking a skilled cloud engineer to design and implement scalable cloud solutions.The ideal candidate will have over 6 years of experience with ... Show more

 • Promoted

DevOps Engineer

High 5 GamesMississauga, Peel Region, CA
Full-time

We’re looking for a DevOps Engineer to design, build, and optimize our cloud infrastructure.In this role, you’ll play a key part in developing and deploying scalable services, ensuring smooth deliv... Show more

 • Promoted

Sr. DevOps Engineer

Ringside Talent Acquisition PartnersMississauga, Peel Region, CA
Full-time

The ideal candidate aligns with the responsibilities and qualifications outlined below.Our client is seeking a DevOps Engineer to support CI/CD pipelines, improve automation, and ensure the reliabi... Show more

 • Promoted

Remote DevOps Engineer for Cloud Systems

AthennianMississauga, Peel Region, CA
Remote
Full-time

Become a Remote DevOps Engineer focused on optimizing cloud infrastructure.Collaborate closely with engineering teams to enhance deployment and improve overall system performance and security.This ... Show more

 • Promoted

Senior Cloud & DevOps Engineer - Remote | Unlimited PTO

Lazer TechnologiesMississauga, Peel Region, CA
Remote
Full-time

A world-class digital product studio is seeking a Senior Infrastructure/DevOps Engineer to support a remote-first team.The ideal candidate will have over 5 years of experience, mastery in Docker an... Show more

 • Promoted

Senior DevOps Engineer - Remote, High-Impact Infra & Cloud

ZayZoonMississauga, Peel Region, CA
Remote
Full-time

A financial empowerment platform is seeking a Senior DevOps Engineer to design resilient systems and enhance reliability across the platform.In this remote position, you will scale AWS infrastructu... Show more

 • Promoted

Remote DevOps Engineer for Cloud, IaC & Automation

Modaxo Inc.Mississauga, Peel Region, CA
Remote
Full-time

A leading technology organization is seeking a DevOps Engineer to manage cloud infrastructure and enhance system operations.You will work across multiple business units, ensuring operational excell... Show more

 • Promoted

Senior DevOps Engineer - Cloud, CI/CD & Automation Lead

CATALYST MicroservicesMississauga, Peel Region, CA
Full-time

A technology firm specializing in cloud engineering is seeking a DevOps Engineer to lead DevOps workflows and mentor the development team.This fully remote position welcomes candidates from anywher... Show more

 • Promoted

DevOps Engineer

Crypto Pro NetworkMississauga, Peel Region, CA
Full-time

Web3 through industry-leading blockchain infrastructure.As the leading provider of staking solutions,.Our clients trust Figment for a comprehensive suite of services, including.Backed by a team of ... Show more

 • Promoted

Lead Platform Engineer Enhancing DevOps and System Reliability

Lillio (formerly HiMama)Mississauga, Peel Region, CA
Full-time

Transform early childhood education as a Senior Platform Engineer focused on system performance and collaborative tooling.Drive key initiatives for scalable, reliable digital platforms.In this pivo... Show more

 • Promoted

Senior DevOps Engineer

Insight Globalmississauga, on, ca
Temporary

JOB DESCRIPTION: $60-$80/hour - Hybrid 3 days week onsite - 6 month contract w/ ext.The DevOps Engineer will be responsible for designing, building, and operating reliable, secure, and automated in... Show more

 • Promoted • New!

DevOps Engineer

ConfidentialMississauga, Peel Region, CA
Full-time

To be considered, you must live in Canada and reside in Pacific Time Zone***.Open to Canadian Citizens, Permanent Residents, and Open Work Permit holders.Visa Sponsorship not available.We're lookin... Show more