Job descriptionRole Overview: We are seeking a Platform Specialist to join our team supporting a large-scale Kubernetes platform. The ideal candidate will have deep hands-on expertise in Kubernetes operations, VMware Telco Cloud Automation (TCA), Tanzu Kubernetes Grid (TKG), and VMware-based infrastructure. This role involves designing, deploying, and operating production-grade Kubernetes clusters across multiple data centers in a private cloud environment, with a focus on reliability, automation, and scalability. Work Location: Brampton, Ontario / Onsite Salary Range: CAD 102,000 to CAD 115,000/ yearly Key Responsibilities: - Design, deploy, and manage Kubernetes clusters at scale across multiple production sites using VMware Tanzu Kubernetes Grid (TKG) and VMware Telco Cloud Automation (TCA). - Operate and maintain VMware-based infrastructure including vSphere, VCF, NSX-T, TCA, and TKG/VKS. - Manage cluster lifecycle activities including upgrades, patching, capacity planning, and security hardening. - Contribute to platform automation, monitoring, observability, and disaster recovery practices. - Troubleshoot complex production issues spanning Kubernetes, networking, storage, and underlying infrastructure. - Configure and maintain ingress, load balancing (AVI/AKO), and service mesh solutions. - Implement and maintain GitOps-based deployment pipelines and Infrastructure as Code Experiences desired: - 8+ years of experience in enterprise infrastructure, with at least 4+ years focused on Kubernetes/TKG. - Strong hands-on expertise with Kubernetes administration (CKA certification preferred). - Hands-on experience with VMware Tanzu Kubernetes Grid (TKG) and VMware Telco Cloud Automation (TCA), including cluster provisioning, lifecycle management, and CNF/VNF onboarding. - Solid VMware background including vSphere, VCF, NSX-T, AVI and Tanzu/TKG/VKS. - Proficiency with Infrastructure as Code tools (Terraform, Ansible) and GitOps workflows (ArgoCD) - Experience operating Kubernetes platforms in production at scale across multiple sites. - Working knowledge of container registries (Harbor), Helm, and OCI standards. - Familiarity with monitoring and observability tooling. - Strong understanding of Linux systems, networking, and storage fundamentals. - Excellent troubleshooting and problem-solving skills in complex distributed systems. Nice to Have: • VMware certifications (VCP-DCV, VCAP-DCV, VMware Telco Cloud). • Experience in telecom or large-scale enterprise environments. • Scripting skills in Bash and at least one of Python/Go.