Senior Site Reliability Engineer

Unreal Gigs
CA
Remote
Full-time
Quick Apply

We are seeking an experienced Site Reliability Engineer (SRE) who is passionate about leveraging data and automation to optimize a highly dynamic infrastructure.

This role entails managing infrastructure and internal tooling to streamline operations and ensure a seamless customer experience.

As a member of our team, you will be instrumental in scaling our infrastructure, automating tasks to reduce manual effort, and fostering a culture of innovation and continuous improvement.

Requirements

What You’ll Do

  • By 30 Days :
  • Utilize your expertise in observability to enhance our existing tools and scale our platform to accommodate growth.
  • Improve automation processes to streamline infrastructure scaling and enhance the development experience.
  • By 90 Days :
  • Contribute to diversifying and scaling our platform across additional regions to meet growing demands.
  • Assess options for upgrading our real-time data pipeline to support enhanced multi-regional capabilities.
  • Provide platform support to engineering teams, leveraging data insights to drive decision-making.
  • By 1 Year :
  • Collaborate with engineering to redefine observability standards for the Fathom platform and implement improvements to minimize friction.
  • Participate in designing and implementing enhancements to our elastic multi-regional storage platform.
  • Lead initiatives to enhance platform reliability and efficiency.

Requirements

Hard Skills :

  • Proficiency with Infrastructure as Code (IaC) and GitOps tools.
  • Strong foundation in Observability best practices and implementation.
  • Experience working in a Software as a Service (SaaS) or Platform as a Service (PaaS) environment.
  • Familiarity with our tech stack, including Google Compute, Kubernetes, Message Queues, Prometheus, ClickHouse, ArgoCD, and Github Actions.

Knowledge of Golang is a plus, and familiarity with Ruby / Rails is a bonus.

Soft Skills :

  • Curiosity-driven with a focus on delivering tangible results.
  • Ability to tackle a wide range of challenges with a generalist mindset.
  • Resilience and determination to solve complex problems.
  • Openness to diverse perspectives and a commitment to decisions once made.
  • Strong collaboration skills, with the ability to communicate complex insights effectively.
  • Independence in managing workload and priorities effectively.

Benefits

What You'll Get

  • The opportunity to shape the dynamic platform of a rapidly growing company.
  • A role that encompasses infrastructure scaling, development team support, and internal tooling development.
  • Collaboration with a dynamic and supportive team.
  • A supportive environment that fosters innovation, creativity, and personal growth.
  • Competitive compensation and benefits package, including :
  • Comprehensive health, dental, and vision insurance plans.
  • Flexible spending accounts for medical expenses.
  • Retirement savings plans with employer matching contributions.
  • Generous vacation and paid time off policies to support work-life balance.
  • Professional development stipend for ongoing learning and skill enhancement.
  • Wellness programs and resources, including gym memberships or wellness app subscriptions.
  • Employee assistance programs for mental health support and counseling.
  • Opportunities for remote work and flexible scheduling.
  • Company-sponsored events, team outings, and social activities to foster camaraderie and collaboration.

Join Us

If you're passionate about driving the data journey at Fathom and contributing your analytical expertise to our mission, we invite you to apply.

Join us and become a key player in our data-driven success story. Apply now!

4 days ago
Related jobs
Unreal Gigs
CA
Quick Apply
Remote
Full-time

We are looking for a Senior Site Reliability Engineer in. As a Senior Site Reliability Engineer for Unreal Gigs you will be in charge of..

0000050007 Royal Bank of Canada
Toronto, Ontario
Full-time

Job Description What is the opportunity? RBC Insurance Technology is seeking to hire a Senior Site Reliability Engineer for its Insurance Technology Platform Support team. The Insurance..

New!
Red Hat, Inc.
Remote, BC, CA
Remote
Full-time

Support and refine toolsets and practices, ensuring engineering squads are equipped with the latest in.. Advocate for reliability engineering principles, sharing best practices and insights across teams to..

Promoted
Tata Consultancy Services
Toronto, Ontario
Full-time

Skills and Responsibilities. Site Reliability Engineer responsibilities include monitoring computer.. A site reliability engineer is a unique role that requires either a background as a sysadmin, a software..

Promoted
LanceSoft, Inc.
Montreal, Quebec
Full-time

XML. Multi tier web or desktop application development experience. Working experience in NoSQL database. Application containers. Docker Skills Desired. LLMs. Prompt Engineering. Kubernetes..

Promoted
Altis Technology
Greater Vancouver Metropolitan Area, Canada
Full-time

One of our clients is looking for an experienced Senior Software Developer who loves to solve the types.. Ensure data quality and reliability through effective error handling and logging. Work cross..