Talent.com
Specialist Site Reliability Engineer
Specialist Site Reliability EngineerGlobal Talent Alliance, Canada • Montreal (administrative region), QC, CA
Specialist Site Reliability Engineer

Specialist Site Reliability Engineer

Global Talent Alliance, Canada • Montreal (administrative region), QC, CA
Il y a 6 jours
Type de contrat
  • Temps plein
Description de poste

About the job Specialist Site Reliability Engineer

(#11072)

The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions. The overall mandate is to ensure that these solutions have attributes of high robustness, reliability, and availability. This involves system and product analysis, modeling and requirements assessment during the development phase and the analysis of field RAM data to determine solution RAM KPIs and to drive corrective action programs. With the advent of Cloud Computing there is also a need for a RAM specialist that is well versed in Cloud based technologies as well as solution architectures for the cloud.

Separate specializations may exist for hardware and software RAM. The technologies used are primarily distributed digital control systems, communication networks, Global Navigation Satellite Systems (GNSS), embedded and virtualized computing as well as Cloud based solutions.

Main Responsibilities

Solution RAM Assessments

  • Review and approve solution requirements for RAM
  • Determine non-functional requirements and targets for RAM performance
  • Perform analysis and modeling to predict RAM behaviour
  • Adhere to the I&T Development Process

Solution RAM Field Performance

  • Assign requirements to solutions and products to ensure they support the ability to measure RAM Key Performance Indicators (KPIs)
  • Use the field performance measurement to identify key contributors and drive corrective action plans when necessary
  • Review vendor specifications, test results, analysis artifacts
  • Participate in failure review board for selected vendors
  • Review corrective action plans from the vendors
  • Drive to completion the vendor corrective action plans
  • Use the field performance measurement to identify key contributors and drive corrective action plans when necessary
  • Requirements

    Experience

  • Minimum 5-10 years overall work experience
  • Minimum 5 years experience in RAM engineering for complex systems, or 7 years experience in product development for high reliability / availability, or safety critical systems with accountability for product field performance
  • Skills / Knowledge

    Knowledge of hardware and / or software design and development practices and processes with focus on high reliability and high availability applications

  • Knowledge of RAM analysis techniques such as failure rate prediction, Reliability Block Diagrams (RBD), Markov models, Monte Carlo methods, Failure Modes Effects Analysis (FMEA), Fault Tree Analysis (FTA)
  • Analysis of reliability and failure field data, statistical estimation, Root Cause Analysis (RCA)
  • Critical thinking and judgement
  • Ability to assimilate new information quickly and apply to the assignment
  • Ability to deliver with autonomy
  • Organizing work to support multiple projects in parallel
  • Knowledge and / or experience in the following areas

  • Multi-Cloud / Multi-Zone-Based designs with High Availability (HA)
  • Compute Infrastructure : Google Compute Engine (GCE) (servers, databases, firewalls, load balancers, networking and storage)
  • Services for Google Cloud Platform (GCP)
  • Databases including NoSQL Databases, Big Data technologies (Oracle, SQL Server, Postgres, Spark, Hadoop, Cloud databases)
  • Application development concepts and technologies (CI / CD, Java, Python)
  • Education / Certification / Designation

  • Bachelors degree in Electrical Engineering, Mechanical Engineering, Computer Science, Computer Engineering or equivalent degree & experience
  • Assets

  • Knowledge of product design and standards for the rail industry
  • Knowledge of rail industry or other transportation industry operations
  • Working Conditions

    This role may require occasional business travel within North America in accordance with company policy

    #J-18808-Ljbffr

    Créer une alerte emploi pour cette recherche

    Site Reliability Engineer • Montreal (administrative region), QC, CA

    Offres similaires
    Site Reliability Engineer

    Site Reliability Engineer

    TMC Canada • Montreal
    Temps plein +1
    The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Reliability Specialists

    Reliability Specialists

    Laurentide Controls ltd • Montreal
    Temps plein
    Come join the largest supplier of automation and reliability solutions in our region.Industry thrive in Eastern Canada.Several positions for contract assignments are available, offering flexible wo...Voir plus
    Dernière mise à jour : il y a 13 jours • Offre sponsorisée
    Reliability Engineering Lead (contract)

    Reliability Engineering Lead (contract)

    Capgemini • Montreal
    Temps plein
    Reliability Engineering Lead (contract).Be among the first 25 applicants.Reliability Engineering Lead (contract).Get AI-powered advice on this job and more exclusive features.Mirabel facility in Qu...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer

    Site Reliability Engineer

    High Tech Genesis • Montreal, QC, CA
    Temps plein
    At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do.Be part of a design services company that is amongst the companies that lead the world in tec...Voir plus
    Dernière mise à jour : il y a plus de 30 jours
    Site Reliability Engineer

    Site Reliability Engineer

    ApTask • Montreal
    Temps plein
    Direct message the job poster from ApTask.Looking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer

    Site Reliability Engineer

    AKUR8 • Montreal
    Temps plein
    Akur8 is a fast-growing Insurtech scale‑up that transforms insurance pricing and reserving with transparent machine learning. Our SaaS platform injects speed, performance and reliability into insure...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineering Specialist (Hybrid)

    Site Reliability Engineering Specialist (Hybrid)

    Morgan Stanley • Montreal
    Temps plein
    Site Reliability Engineering Specialist (Hybrid).Site Reliability Engineering Specialist (Hybrid).We're seeking someone to join our Data Protection Fleet as a Site Reliability Engineering (SRE) Spe...Voir plus
    Dernière mise à jour : il y a 14 jours • Offre sponsorisée
    Specialist Site Reliability Engineer

    Specialist Site Reliability Engineer

    Global Talent Alliance, Canada • Montreal
    Temps plein
    About the job Specialist Site Reliability Engineer.The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions.The overall ...Voir plus
    Dernière mise à jour : il y a 6 jours • Offre sponsorisée
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Open Systems Technologies • Montreal
    Temps plein
    The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Site Reliability Engineering Specialist (Hybrid)

    Site Reliability Engineering Specialist (Hybrid)

    PowerToFly • Montreal
    Temps plein
    We're seeking someone to join our Data Protection Fleet as a Site Reliability Engineering (SRE) Specialist in Cyber to help drive performance, reliability, enhanced observability and efficiency for...Voir plus
    Dernière mise à jour : il y a 14 jours • Offre sponsorisée
    Senior Site Reliability Expert (Retail)

    Senior Site Reliability Expert (Retail)

    Lightspeed • Montreal
    Temps plein
    Are you actively seeking a new opportunity, or simply exploring the market? Either way, you might have just found the right place!. We’re looking for a Senior SRE to join our Lightspeed Retail group...Voir plus
    Dernière mise à jour : il y a 1 jour • Offre sponsorisée
    Site Reliability Engineer w / Python (Onsite Hybrid)

    Site Reliability Engineer w / Python (Onsite Hybrid)

    NTT DATA North America • Montreal
    Temps plein
    Site Reliability Engineer / ServiceNow SaaS (Onsite Hybrid).NTT DATA is seeking a Site Reliability Engineer to join our Montreal, Quebec, Canada team. The position is onsite‑hybrid, requiring office a...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Targeted Talent • Montreal, QC, Canada
    Permanent
    We are looking for an experienced.Senior Site Reliability Engineer.Our client is a global enterprise company with a product that you've likely used. Experience with coding / software development, ...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer / Platform Operations Engineer

    Site Reliability Engineer / Platform Operations Engineer

    Targeted Talent • Montreal, QC, Canada
    Permanent
    We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client.This is a permanent position that is remote to start with later relocation to.Our client i...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Intelcom | Dragonfly • Montreal
    Temps plein
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Incident Management : Detect and respond to issues, ensuring rapid recovery to minimize downtime.Curren...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer w / Python (Onsite Hybrid)

    Site Reliability Engineer w / Python (Onsite Hybrid)

    NTT DATA, Inc. • Montreal
    Temps plein
    Site Reliability Engineer w / Python (Onsite Hybrid).NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adapt...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Senior Site Reliability / Gitops Engineer

    Senior Site Reliability / Gitops Engineer

    Canonical • Montreal
    Temps plein
    Senior Site Reliability / Gitops Engineer.Join Canonical, a leading provider of open‑source software and operating systems, as a Senior Site Reliability / Gitops Engineer.In this role you will driv...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée
    Site Reliability Engineer (Linux / Cloud Infrastructure)

    Site Reliability Engineer (Linux / Cloud Infrastructure)

    Atlantis IT Group • Montreal
    Temps plein
    Site Reliability Engineer (Linux / Cloud Infrastructure) role with hands-on experience across Linux, distributed systems, scripting, databases, monitoring, containers, cloud SaaS integrations, mess...Voir plus
    Dernière mise à jour : il y a plus de 30 jours • Offre sponsorisée