Talent.com
Hitachi Rail
SRE/DevOps Engineer - 67533Hitachi Rail • CA Toronto
SRE/DevOps Engineer - 67533

SRE/DevOps Engineer - 67533

Hitachi Rail • CA Toronto
8 days ago
Job type
  • Full-time
Job description

Function

Cloud & Data Engineering

Our Company

We’re Hitachi Digital Services, a global digital solutions and transformation business with a bold vision of our world’s potential. We’re people-centric and here to power good. Every day, we future-proof urban spaces, conserve natural resources, protect rainforests, and save lives. This is a world where innovation, technology, and deep expertise come together to take our company and customers from what’s now to what’s next. We make it happen through the power of acceleration.

Imagine the sheer breadth of talent it takes to bring a better tomorrow closer to today. We don’t expect you to ‘fit’ every requirement – your life experience, character, perspective, and passion for achieving great things in the world are equally as important to us.

Job description

Meet Our Team

Join our Site Reliability Engineering (SRE) Operations team, where reliability, automation, and operational excellence are at the heart of everything we do. We ensure the stability, availability, and performance of enterprise applications running across modern cloud-native and hybrid platforms, including Kubernetes, APIs, cloud services, databases, Kafka, and API gateways.

As an L1 SRE Operations Engineer, you'll be the first line of defense, monitoring production environments, responding to alerts, executing operational runbooks, and partnering with senior engineers to maintain highly available and resilient platforms. This is an excellent opportunity for professionals looking to build hands-on experience in cloud operations, DevOps, and Site Reliability Engineering.

What You'll Be Doing

  • Monitor enterprise applications, infrastructure, dashboards, logs, and alerts across cloud and on-premises environments.
  • Perform first-level incident triage by analyzing alerts, collecting logs and metrics, and determining whether issues are application or platform related.
  • Execute standardized operational runbooks for incident resolution, deployments, maintenance activities, and routine operational tasks.
  • Monitor and support Kubernetes environments by validating pod health, deployments, namespaces, logs, and service endpoints.
  • Troubleshoot infrastructure and application issues using Linux utilities, networking tools, and monitoring platforms.
  • Escalate complex incidents to L2/L3 engineering teams with complete diagnostic information to accelerate resolution.
  • Support API gateways, web application firewalls (WAF), Kafka platforms, databases, and cloud infrastructure across AWS, Azure, and GCP.
  • Maintain accurate incident documentation, operational records, and knowledge base updates while identifying opportunities to improve runbooks and automation.
  • Collaborate with development, platform engineering, and infrastructure teams during incident response and production support.
  • Assist with onboarding new applications into the operational support framework while ensuring monitoring, alerting, and operational readiness.
  • Contribute to continuous improvement by identifying repetitive manual activities suitable for automation.
  • Provide timely and professional communication to stakeholders during production incidents and operational events.

What You'll Bring to the Team

Required Qualifications

  • 2–5 years of experience in IT Operations, NOC, SRE, DevOps, or Infrastructure Support.
  • Working knowledge of Kubernetes administration and day-to-day cluster operations.
  • Good understanding of Linux administration and command-line troubleshooting.
  • Familiarity with cloud platforms such as AWS, Microsoft Azure, or Google Cloud Platform.
  • Experience with observability and monitoring tools such as Prometheus, Grafana, Splunk, ELK Stack, Datadog, Argos, or AIOps platforms.
  • Ability to execute operational runbooks and follow structured incident response procedures.
  • Experience using Kubernetes CLI (kubectl) to verify pod health, deployments, namespaces, and application logs.
  • Basic scripting knowledge in Python, Bash, or PowerShell for operational automation.
  • Understanding of networking fundamentals including DNS, HTTP/HTTPS, TCP/IP, firewalls, WAF, proxies, connectivity troubleshooting, and diagnostic tools such as ping, curl, netstat, and traceroute.
  • Strong analytical and troubleshooting skills using structured problem-solving techniques such as 5 Whys and Fishbone Analysis.
  • Excellent documentation, communication, and stakeholder management skills.

Preferred Qualifications

  • Experience working with API gateways such as Apigee or Gloo API Gateway.
  • Basic knowledge of SQL and NoSQL databases with the ability to validate database connectivity.
  • Familiarity with messaging platforms such as Apache Kafka.
  • Experience with ITSM and incident management tools including ServiceNow, Jira, xMatters, or similar platforms.
  • Exposure to automation and self-service operations initiatives.
  • Experience using AI-assisted operational tools or chatbots for runbook search, log summarization, and incident analysis.
  • Understanding of cloud-native application architectures, CI/CD pipelines, and production support best practices.
  • Passion for continuous learning, operational excellence, and improving system reliability through automation.

About us

We’re a global, team of innovators. Together, we harness engineering excellence and passion to co-create meaningful solutions to complex challenges. We turn organizations into data-driven leaders that can make a positive impact on their industries and society. If you believe that innovation can bring a better tomorrow closer to today, this is the place for you.

Fostering innovation through diverse perspectives

Hitachi is a global company operating across a wide range of industries and regions. One of the things that sets Hitachi apart is the diversity of our business and people, which drives our innovation and growth.

We are committed to building an inclusive culture based on mutual respect and merit-based systems. We believe that when people feel valued, heard, and safe to express themselves, they do their best work.

How we look after you

We help take care of your today and tomorrow with industry-leading benefits, support, and services that look after your holistic health and wellbeing. We’re also champions of life balance and offer flexible arrangements that work for you (role and location dependent). We’re always looking for new ways of working that bring out our best, which leads to unexpected ideas. So here, you’ll experience a sense of belonging, and discover autonomy, freedom, and ownership as you work alongside talented people you enjoy sharing knowledge with.

We’re proud to say we’re an equal opportunity employer and welcome all applicants for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran, age, disability status or any other protected characteristic. Should you need reasonable accommodations during the recruitment process, please let us know so that we can do our best to set you up for success.

Create a job alert for this search

SRE/DevOps Engineer - 67533 • CA Toronto

Similar jobs

SRE-DevSecOps Engineer

High Tech GenesisToronto, ON, CA
Full-time

Allowed Staffing Countries: Canada, Costa Rica, Mexico or Brazil, (Remote).High Tech Genesis is seeking a 3-month contractor who can hit the ground running to support our SaaS platform on AWS.Kuber... Show more

 • Promoted

Senior SRE: Global SaaS Platform & Kubernetes

Kong Inc.Toronto, Ontario, Canada
Full-time

A leading developer of API technologies is seeking a Senior Site Reliability Engineer to join the global Platform SRE team in Toronto.The role involves building, operating, and scaling a multi-regi... Show more

 • Promoted

DevOps Engineer

Onico SolutionsRichmond Hill, York Region, CA
Permanent

You own and represent the services and tools everyone needs to be successful in the organization.Champion DevOps adoption and ensure best practice are followed and ensure no developers are left beh... Show more

 • Promoted

Senior SRE — Distributed Data Platforms & Cloud Infra

OpenTextRichmond Hill, York Region, CA
Full-time

A leading information management company is seeking a Senior Site Reliability Engineer to ensure the reliability of their customer-facing SaaS platforms.You will work on distributed systems using t... Show more

 • Promoted

Sr. DevOps Engineer

Layer 6 AIToronto
Full-time

We are looking for world-class devops engineers and problem solvers.You will be interacting with machine learning scientists and engineers to develop and automate machine learning pipelines that wo... Show more

 • Promoted

Senior SRE/DevOps Engineer - Kubernetes & Observability

Infotek Consulting Inc.Toronto
Full-time

A consulting firm in Canada is seeking a skilled Site Reliability / DevOps Engineer for a contract role in Toronto.The ideal candidate will have over 10 years of experience in SRE/DevOps, strong ex... Show more

 • Promoted

Sr. DevOps Engineer

emergiTEL Inc.Toronto
Full-time +1

Senior DevOps Engineer – AWS (Fintech / Cryptocurrency)EmergiTEL is hiring a Senior DevOps Engineer – AWS for our client in the fintech / cryptocurrency industry.Compensation: $144,000 – $180,000 p... Show more

 • Promoted

Senior Infrastructure/ DevOps Engineer

LazerToronto, Ontario, Canada
Full-time

Lazer is a world-class digital product studio composed of 180+ senior engineers and designers with backgrounds from companies like Apple, Google, Coinbase, and more.With our product experience, we ... Show more

 • Promoted

DevOps Engineer

LuxeTech Inc.Toronto, Ontario, Canada
Full-time

Senior DevOps Engineer (FinTech / Mission-Critical Infrastructure) Overview LuxeTech is representing a high-growth financial institution currently executing a digital transformation from legacy mon... Show more

 • Promoted

Senior SRE

ViafouraToronto, ON, CA
Full-time

Senior Site Reliability Engineer.Viafoura is a leading audience engagement platform that powers real-time conversations and community experiences for digital publishers and brands worldwide.We're s... Show more

 • Promoted

Senior SRE – Kubernetes Platform & CRE Scaling

Chainlink LabsToronto, ON, CA
Full-time

A global blockchain technology company is seeking an experienced Infrastructure Engineer to design and build foundational infrastructure for its decentralized oracle networks.The ideal candidate wi... Show more

 • Promoted

Senior DevOps Engineer

TeraWatt InfrastructureToronto, ON, CA
Permanent

The once in a century transition to autonomous and electric vehicles is underway and will require a multi-trillion-dollar investment in energy and charging infrastructure, and the real estate to si... Show more

 • Promoted

SRE / DevOps Manager

UpshopToronto
Full-time

We are seeking a seasoned SRE / DevOps Manager to lead our reliability and operations engineering team.You will be responsible for ensuring the scalability, security, and performance of our infrast... Show more

 • Promoted

Senior Infrastructure/ DevOps Engineer

Lazer TechnologiesToronto, Ontario, Canada
Full-time

Senior Infrastructure/ DevOps Engineer at Lazer Technologies Lazer is a world-class digital product studio composed of 180+ senior engineers and designers with backgrounds from companies like Apple... Show more

 • Promoted

Senior FinTech DevOps & SRE: Zero Trust & Cloud

LuxeTech Inc.Toronto, Ontario, Canada
Full-time

A high-growth financial institution in Canada seeks a Senior DevOps Engineer to lead the development of secure and automated deployment frameworks.The successful candidate will possess advanced ski... Show more

 • Promoted

Sr. DevOps Engineer (GCP)

InfoyaToronto, ON, CA
Permanent

Infoya is a global IT solutions provider specializing in transforming complex challenges into streamlined, AI-powered outcomes.Through proprietary technology accelerators and full-scale enterprise ... Show more

 • Promoted

SRE Ansible developer

Tata Consultancy ServicesToronto, ON, CA
Full-time

Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to c... Show more

 • Promoted

Senior DevOps Engineer - AWS (Remote)

LumenaltaToronto, ON, CA
Remote
Full-time

At Lumenalta, we create impactful software solutions that drive innovation and transform businesses.Since 2000, we’ve partnered with visionary leaders to build cutting‑edge tech, solve complex chal... Show more

 • Promoted

Sr. DevOps Engineer

Publicis Groupe CanadaToronto, ON, CA
Full-time

Publicis Groupe Canada is the Canadian subsidiary of Publicis Groupe, the second largest communications group in the world and a global leader concentrated within four main activities: Communicatio... Show more

 • Promoted

DevOps Engineer

KnotchToronto, Ontario, Canada
Full-time

We’re a growth‑stage technology company helping brands optimize content performance and apply AI to modern marketing.Our culture is fast‑paced, entrepreneurial, and highly adaptable—we move quickly... Show more