Talent.com
Autodesk
Senior Site Reliability Developer (SRE)Autodesk • Toronto, Canada
No longer accepting applications
Senior Site Reliability Developer (SRE)

Senior Site Reliability Developer (SRE)

Autodesk • Toronto, Canada
13 days ago
Job type
  • Full-time
Job description
Job Requisition ID #26WD94664

Position Overview We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for the Autodesk Platform Services and Emerging Technologies organization. The team delivers high-value, exabyte-scale and cloud data platform components powering desktop, mobile, and web products. This enables our product teams to build cohesive in-product data experiences, our partners to integrate and expand our data, and our end-users to work with their data across all Autodesk products.

This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.

Responsibilities

Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture

Independently manage requirement analysis, solution design, implementation, and release planning

Ensure strict adherence to security, trust, compliance guidelines, and standards

Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security

Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices

Implement and maintain configuration management and infrastructure as code (IaC) using Terraform

Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and periodic maintenance activities

Contribute to remediation of critical vulnerabilities (CVEs)

Promote and document security and best practices across all pillars of DevOps/SRE throughout system design

Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues

Participate in on-call rotations, providing critical 24x7 support for production systems

Minimum Qualifications

Bachelor’s degree or higher in Computer Science, Engineering, or a related field

5+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field

Proficiency with managing AWS resources and understanding of networking and security protocols

Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation

Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory

Experience with container-based technologies like Docker, Kubernetes and AWS ECS

Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch

Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment

Strong experience with UNIX/Linux systems and programming languages such as Python, Go, Bash, Groovy, and Node.js

Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, Flink, Jenkins, GitHub, Jira, Google Apigee, ServiceNow, and Splunk

Preferred Qualifications

Knowledge of applying AI and ML solutions for engineering processes and/or DevOps automation

Knowledge of standardized observability frameworks such as OpenTelemetry

Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer)

Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures

Broad knowledge of data streaming pipelines like Kinesis, Firehose, and Kafka

Knowledge on core Java and SpringBoot concepts in JVM optimization

Knowledge on build tools, e.g. Gradle

Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment

Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership

Learn More About Autodesk Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Salary transparency Salary is one part of Autodesk’s competitive compensation package. For Canada-BC based roles, we expect a starting base salary between $107,000 and $157,300. Offers are based on the candidate’s experience and geographic location, and may exceed this range. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & Belonging We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

What Autodesk Has to Offer

Insurance: Health/Dental/Vision/Life

Work - Life Balance

Paid volunteer time off

6 week paid sabbatical every 4 years

Employee Resource Groups

A "week of rest" at year's end

#J-18808-Ljbffr
Create a job alert for this search

Senior Site Reliability Developer (SRE) • Toronto, Canada

Similar jobs

Senior SRE Leader: Scale Reliability & Observability

RootlyToronto, ON, CA
Full-time

A fast-growing tech startup in Toronto is seeking an experienced Site Reliability Engineer.The role involves enhancing service performance, owning CI/CD pipelines, and building automation tools.Ide... Show more

 • Promoted

Senior Site Reliability Engineer

ThinkificToronto, ON, CA
Full-time

Senior Site Reliability Engineer.Senior Site Reliability Engineer.Are you an experienced Site Reliability Engineer looking for a new challenge?.Senior Site Reliability Engineer.Senior Site Reliabil... Show more

 • Promoted

Site Reliability Engineer - HCLTech

HCLTechtoronto, on, ca
Full-time

Hands-on experience with at least one major public cloud platform (Azure, AWS, or GCP).Strong understanding of cloud infrastructure and application runtime components, including compute, storage, n... Show more

 • Promoted

Senior Site Reliability Engineer

SimCorpToronto, ON, CA
Full-time

Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerlocations: Torontotime type: Full timeposted on: Posted Todayjob requisition id: R-211168Job Advertisement*... Show more

 • Promoted

Senior Site Reliability Engineer I

InstacartToronto, ON, CA
Permanent

Join our team as a Senior Site Reliability Engineer II, where your expertise will play a crucial role in maintaining the backbone of our platform's operations.You'll take on challenges directly, en... Show more

 • Promoted

Senior SRE Leader - Reliability, Scale & Platform Strategy

RelayToronto, ON, CA
Full-time

A dynamic digital banking platform in Toronto is seeking a Senior Site Reliability Engineer to lead their SRE function.This role will set the strategic direction as the company scales, driving the ... Show more

 • Promoted

Site Reliability Engineer

TELUS DigitalToronto, ON, CA
Full-time

Welcome to TELUS Digital — where innovation drives impact at a global scale.As an award-winning digital product consultancy and the digital division of TELUS, one of Canada’s largest telecommunicat... Show more

 • Promoted

Senior Site Reliability Engineer

Apptoza Inc.Toronto, ON, CA
Full-time

Job Title: Senior Platform Engineer / Senior SRE Developer – Observability (Dynatrace).Work Style: Hybrid (2 days per week in-person at Toronto office preferred).Skills: Digital : Python~Digital : ... Show more

 • Promoted

Sr. Site Reliability Engineer I

Axon EnterpriseToronto, ON, CA
Full-time

At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud software.Like our products, we work b... Show more

 • Promoted

Site Reliability Engineer, Observability

PricelineToronto, ON, CA
Full-time

This role is eligible for our hybrid work model: Two days in-office.Site Reliability Engineer, Observability.Our Technology team is the backbone of our company: constantly creating, testing, learni... Show more

 • Promoted

Site Reliability Engineer

HCLTechtoronto, on, ca
Full-time

Hands-on experience with at least one major public cloud platform (Azure, AWS, or GCP).Strong understanding of cloud infrastructure and application runtime components, including compute, storage, n... Show more

 • Promoted

Senior Site Reliability Engineer

CaptivateIQToronto, ON, CA
Full-time

The Site Reliability Engineering team in CaptivateIQ operates across the engineering organization, supporting our development teams by providing them with the tools and processes they need to get t... Show more

 • Promoted

Senior Site Reliability Engineer Role

ITRidersToronto, ON, CA
Full-time

Elevate your career as a Senior Site Reliability Engineer at our company.Craft observability-as-code solutions using Terraform while optimizing system reliability across diverse environments.We see... Show more

 • Promoted

Site Reliability Engineer (SRE)

Tangerine BankToronto, ON, CA
Permanent

Press Tab to Move to Skip to Content Link.Select how often (in days) to receive an alert:.Tangerine is Canada’s leading direct bank.We offer flexible and accessible banking options, innovative prod... Show more

 • Promoted

Senior Site Reliability Engineer Focused on Kubernetes Infrastructure

Chainlink LabsToronto, ON, CA
Full-time

Elevate decentralized architecture as a Senior Site Reliability Engineer.Spearhead Kubernetes-based infrastructure for decentralized applications, driving scalability, security, and operational eff... Show more

 • Promoted

Site Reliability Engineer

Insight GlobalToronto, ON, CA
Full-time

Insight Global is looking for a Site Reliability Engineer/Implementation Lead to support a CCaaS transformation program.The role will focus on implementing monitoring solutions across a distributed... Show more

 • Promoted

Site Reliability Engineer

LongbridgeToronto, ON, CA
Full-time

Longbridge is a fast-growing online brokerage platform on a mission to make investing smarter, simpler, and more accessible for everyone.As part of our global expansion, we’re looking for a.Site Re... Show more

 • Promoted

Senior Site Reliability Engineer

iManageToronto, ON, CA
Full-time

SRE is part of a global organization that leverages the latest technology to communicate with our colleagues across the globe.We organize ourselves into distributed teams – SRE teams are anchored t... Show more

 • Promoted

Sr. Site Reliability Engineer I

AxonToronto, ON, CA
Full-time

Join Axon and be a Force for Good.At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof... Show more

 • Promoted

Senior Site Reliability Engineer in Scarborough

BMOToronto, ON, CA
Full-time

Elevate cloud systems reliability as a Senior Site Reliability Engineer at BMO, hybrid role located in Scarborough.Utilize your development background to enhance operational excellence and service ... Show more