Search jobs > Toronto, ON > Cloud engineer

Cloud Ops Engineer

Extreme Reach
Toronto, ON
Full-time

XR is a global technology platform powering the creative economy. Its unified platform moves creative and productions forward, simplifying the fragmentation and delivering global insights that drive increased business value.

XR operates in 130 countries and 45 languages, serving the top global advertisers and enabling $150 billion in video ad spend around the world.

More than half a billion creative brand assets are managed in XR's enterprise platform.

Above all, we are a supportive and collaborative culture dedicated to DEI. We are caring, dedicated, positive, genuine, trustworthy, experienced, passionate and fun people with loyalty to our customers and our fellow teammates.

It is our belief that the better we work together to help our clients achieve their goals, the more successful XR will be.

The Opportunity

If building and maintaining cloud scalable systems and solving complex problems is your passion, you are reliable, collegial and thrive in a fast-paced collaborative environment, this is the job for you.

The Cloud Operations Engineer will use their expertise to design, develop, and document our cloud infrastructure, and cloud monitoring solutions.

This individual will work with team members to gather requirements and deploy cloud technology to help scale our platform used to power the world's video advertising.

Job Responsibilities :

  • Automate manual ops tasks to streamline processes and reduce manual effort.
  • Identify areas where systems can be improved to increase system reliability and reduce system incidents.
  • Manage and support proactive monitoring solutions across the Production environment
  • Look for trends and themes in issues reported in Live Applications and facilitate investigations by Developers to avoid repeated occurrences
  • Perform actions on the Product codebase (backend / frontend) for real-time diagnosis of major incidents in Live systems
  • Analyze and diagnose 'difficult' or tricky to reproduce problems
  • Perform analysis and reporting on frequently occurring Live problems
  • Assist Developers who are fixing bugs to understand the detail and user scenarios around reported bugs to accelerate triage and fixing
  • Serve in IT Tier 3 support of Extreme Reach Production infrastructure, part of on-call rotations supporting infrastructure and services 24x7
  • Creates and manages Automated Infrastructure solutions. Automated builds and configuration management.
  • Understands the fundamentals of large scale on-prem and cloud mission critical systems; networking, security, redundancy, scalability, monitoring, & performance KPIs.
  • Fast, adaptable, with a proven ability to integrate and exploit new technologies, PAAS offerings, and API's

Requirements

  • 5+ years of hands-on experience with DevOps and tools including GIT, CI / CD environments; designing / writing / delivering system automation
  • AWS Professional certification preferred
  • Strong PowerShell experience, as well as other programming or scripting languages Python, Bash / Shell, Java, JavaScript and / or node.js.
  • Experience working with Jenkins, Ansible and Terraform
  • Ability to think holistically, putting the customer first for a given project or problem.
  • Creative, resourceful, problem solver with an aptitude for systems thinking.
  • Strong written and oral communication skills including the ability to communicate complex issues to technical and non-technical staff and management.
  • Hands-on experience of AWS preferably in a large-scale enterprise system
  • Understanding of Docker & Kubernetes and Container technology
  • Knowledge of Monitoring and alerting tools such as Grafana and DataDog
  • Understanding of general security architecture and design.
  • Understanding of source control and change management.
  • Ability to create and maintain technical references for team members through either Api integrations or static intranet articles including diagrams, spreadsheets, and checklists
  • Ability to prioritize and multitask in a fast-paced environment

Benefits

ER Culture & Why You Will Love Working Here

  • XR has 23 offices worldwide and teams spread throughout the US, EMEA and APAC, our multicultural teams work cross-departmentally and across continents and cultures towards a shared goal
  • It is our belief that the better we work together to help our clients achieve their goals, the more successful XR will be
  • Our leadership is provided a great deal of autonomy and freedom in their individual roles, they are encouraged to be self starters and to continuously develop their skills
  • Feedback from internal Employee Engagement Surveys cites the People, Teamwork and Flexibility as the most rewarding aspects of working at XR.
  • We are a supportive and collaborative culture that values multiple perspectives, fresh thinking and is dedicated to DEI
  • XR celebrates diversity of ideas, people and experiences
  • Generous PTO, flexible work schedules and hybrid working arrangements create a rewarding work-life balance
  • 23 hours ago
Related jobs
Stafflink
Toronto, Ontario

We are seeking an experienced Cloud Ops Engineer to join our client's dynamic team. Proficiency in AWS technologies such as ECS, ELB, EC2, S3, RDS, Redis, IAM, WAF, Route 53, CloudFront, CodeDeploy, CloudFormation, and CloudWatch. Experience setting up and configuring cloud monitoring tools such as ...

Highbrow LLC
Toronto, Ontario

Implement the enterprise cloud capability and enhance the cloud orchestration platform for automated provisioning, management and scalability of hosts, containers, applications, and cloud services (AquaSec, Wiz. Develop APIs and Webhook for multi-directional integration of cloud orchestration platfo...

Extreme Reach
Toronto, Ontario

The Cloud Operations Engineer will use their expertise to design, develop, and document our cloud infrastructure, and cloud monitoring solutions. If building and maintaining cloud scalable systems and solving complex problems is your passion, you are reliable, collegial and thrive in a fast-paced co...

Highbrow LLC
Toronto, Ontario

Implement the enterprise cloud capability and enhance the cloud orchestration platform for automated provisioning, management and scalability of hosts, containers, applications, and cloud services (AquaSec, Wiz. Develop APIs and Webhook for multi-directional integration of cloud orchestration platfo...

Extreme Reach
Toronto, Ontario

The Cloud Operations Engineer will use their expertise to design, develop, and document our cloud infrastructure, and cloud monitoring solutions. If building and maintaining cloud scalable systems and solving complex problems is your passion, you are reliable, collegial and thrive in a fast-paced co...

Promoted
Outlier
Vaughan, Ontario

Are you an experienced software engineer who would like to lend your coding expertise to train AI models?. PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of opportunities that may be of interest and sharing with our affiliates. W...

Promoted
freelance.ca
Toronto, Ontario

Systems administrator experience with service now – 5+ years. Since its inception, the Group has based its development on a strong culture of entrepreneurship and innovation, and on the support and upskilling of its 7800 employees who are committed every day to promoting the complementarity between ...

Promoted
IT Accel
Toronto, Ontario

The Infrastructure Technology Solutions - Cloud Compute team is responsible for the introduction and maintenance of technologies that support our Azure and Google public cloud environments. Assess, engineer, and deliver IaaS, storage, and other compute related technologies and solutions to the publi...

Promoted
Hire IT People Inc
Toronto, Ontario

Systems Administrator working with Solaris 10/11, RHEL 6/7/8, CentOS 7. Maintenance of Linux/Unix system administration in Dev/QA/UAT/DR/Prod environments. ...

SkySys
Toronto, Ontario

We have a newly created position for a WorkspaceONE Engineer to join our team. Working incidents/service request in the WorkspaceONE domain, Engineer will be responsible for day-to-day client interaction and remotely able to assist the greater shared service team. ...