Talent.com
Research Platform Engineer (HPC)
Research Platform Engineer (HPC)General Fusion • Richmond, BC, CA
Research Platform Engineer (HPC)

Research Platform Engineer (HPC)

General Fusion • Richmond, BC, CA
30+ days ago
Job type
  • Full-time
Job description

About Us:

Established in 2002, General Fusion is a global leader in the race to commercialize clean fusion energy. We are pursuing a uniquely practical approach, Magnetized Target Fusion, and aim to provide zero-carbon fusion power to the grid in the early to mid-2030s. Today at our state-of-the-art labs in Richmond, BC, we’re operating a groundbreaking fusion demonstration machine called Lawson Machine 26 (LM26), designed to achieve transformational technical milestones and accelerate General Fusion’s technology to commercialization. Our path to market is funded by a global syndicate of leading energy venture capital firms, industry leaders, and technology pioneers. Learn more at www.generalfusion.com.

Position Overview:

General Fusion research relies heavily on experimental data and computer simulation to design and operate its experimental devices. We’re seeking a versatile technical lead to support the infrastructure that empowers our scientists, including managing our High-Performance Computing (HPC) environment, and contributing to our research data infrastructure.

This is a dual role: as the HPC Administrator, half of your time will be spent ensuring our computer cluster is stable, optimized, and serving the needs of the science teams. The system runs Rocky Linux and comprises 70 computer nodes and 1PB of storage. The other half of your time will be spent contributing to our on-prem data systems that transform and serve our experimental data, with a focus on moving toward modern data architecture patterns and technologies.

This role will help shape the computational research infrastructure at a scientific R&D startup. You'll have opportunities to propose architectural changes, reduce complexity, and build out systems that directly accelerate scientific discovery. If you're energized by working at the intersection of infrastructure, data, and scientific computing, this role is for you.

Responsibilities:

  • Act as the primary source of HPC expertise within General Fusion
  • Cluster administration, including maintaining the OS and software environment, resource provisioning and allocation, managing the job scheduler (SLURM), user account management, and monitoring system health and performance
  • Provide training and support for HPC users
  • Collaborate with IT on networking and physical infrastructure; ensure alignment with IT policies, security standards, and corporate governance requirements, including applicable SOX controls
  • Design high-performance data architectures for storage, retrieval and analysis of complex research datasets; contribute to data versioning, result reuse, and metadata cataloging systems
  • Contribute to the modernization of data processing pipelines, with an eye toward simplification and maintainability
  • Proactive monitoring of system health and performance, across both compute nodes and data pipelines
  • Seek opportunities to consolidate tooling and reduce operational overhead
  • Act as a bridge between traditional HPC computing and modern data platform patterns, helping integrate simulation data with experimental data systems
  • Maintain and improve technical documentation

Contribute to strategic planning and decision-making to help drive the evolution of General Fusion’s data systems

Requirements:

  • Degree in Computer Science, Computer Engineering, Engineering Physics or related field
  • 5+ years professional experience in an applied R&D environment, working in scientific computing and/or research data infrastructure.
  • 2+ years of experience managing HPC clusters, with a solid understanding of InfiniBand, MPI/parallel computing concepts, storage architectures, and workload scheduling (SLURM)
  • 2+ years of platform or data engineering, specifically building systems that serve technical users
  • Experience across the modern Linux systems lifecycle, including OS administration (e.g. Rocky, Ubuntu, RHEL), container orchestration (Apptainer/Singularity, Docker), and declarative infrastructure to ensure environment reproducibility
  • Proficiency in low-level resource management (CPU/memory/IO) and system-level performance tuning
  • Experience implementing alerting, logging, and monitoring tools to track system health and performance (Prometheus, Grafana, or similar)
  • Experience with data pipelines and workflow orchestration, such as task queues, message brokers (e.g. RabbitMQ, Redis), workflow engines (e.g. Airflow, Prefect, Celery), or DAG-based processing
  • Professional Python development experience, including git/GitHub and code review practices
  • Excellent verbal and written communication skills; experience writing technical documentation
  • Proactive and collaborative, you’re comfortable taking ownership, proposing solutions, and can act as a bridge between development, IT, and research teams

Preferred:

  • Experience in a multidisciplinary research or R&D environment, with a background in physics, math, or advanced analytics
  • Good understanding of standard protocols like NFS, SMB, LDAP, DHCP and NTP
  • Database experience, including NoSQL (MongoDB)
  • Experience with big data tools and frameworks, such as modern 'lakehouse' patterns (e.g. Spark, Iceberg, Polars), high-performance analytical formats (Parquet, HDF5), and distributed OLAP engines (e.g. ClickHouse, DuckDB)
  • Experience with data versioning systems (e.g. DVC, LakeFS) and reproducible research best practices

The typical hiring range for this position is $126,000 – $154,000.General Fusion considers many factors when determining total compensation, including job-specific or highly specialized knowledge, skills and experience, proficiency, job location and internal equity.

What We Offer:

  • Flexible hours
  • Four weeks’ vacation
  • Comprehensive benefits package
  • RRSP Contribution – No Employee Match Needed!
  • Support for professional development
  • Great company culture – social events, food trucks, bike rides, Sun Run, etc.

Applications:

We thank all applicants for their interest; only those selected for an interview will be contacted.

General Fusion is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, or age.

Create a job alert for this search

Research Platform Engineer (HPC) • Richmond, BC, CA

Similar jobs

Research Engineer - Decentralized AI Systems

Yotta LabsVancouver, Metro Vancouver Regional District, CA
Full-time

Research Engineer - Decentralized AI Systems.Join to apply for the Research Engineer - Decentralized AI Systems role at Yotta Labs.Yotta Labs is pioneering the development of a Decentralized Operat...Show more

 • Promoted

Senior AI Engineer for Autonomous Systems in Healthcare Applications

Toboggan LabsVancouver, Metro Vancouver Regional District, CA
Full-time

Drive transformative AI projects as a Senior AI Engineer.Engage with innovative technology to deliver autonomous systems that improve efficiency and accuracy in healthcare settings.We invite applic...Show more

 • Promoted

Research Associate in Genetics and Bioinformatics

EURAXESS IrelandVancouver, Metro Vancouver Regional District, Canada
Full-time

Elevate your career as a Research Associate focusing on genetic and molecular biology analysis.The position offers an opportunity to work with high-throughput sequencing data and CRISPR models in a...Show more

 • Promoted • New!

Advanced Technology: R&D Engineer - AI/ML, HPC

Cerebras SystemsVancouver, Metro Vancouver Regional District, CA
Full-time

Advanced Technology: R&D Engineer - AI/ML, HPC.Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of doz...Show more

 • Promoted

Co-op Research Engineer - AI Computing System

Huawei CanadaVancouver, Metro Vancouver Regional District, CA
Full-time

Huawei Canada has an immediate co-op opening for a Researcher.The Advanced Computing and Storage Lab, currently a part of the Vancouver Research Centre, aims to explore adaptive computing system ar...Show more

 • Promoted

Lead Research and Development Engineer

Sonic Incytes Medical Corp.Vancouver, Metro Vancouver Regional District, CA
Full-time

Lead Research and Development Engineer.Lead Research and Development Engineer.This role focuses on developing and refining signal processing and image reconstruction algorithms tailored for accurat...Show more

 • Promoted

AI Research Engineer - Computer Graphics

Robertson & Company Ltd.Vancouver, Metro Vancouver Regional District, CA
Full-time

Our client is a global leader in technology and telecommunications, housing an elite R&D laboratory that pushes the boundaries in hardware and software integration.You will be part of a high-calibe...Show more

 • Promoted

Distinguished Engineer, Search & AI Platform

Referral BoardVancouver, Metro Vancouver Regional District, CA
Full-time

A leading search technology firm in Canada is seeking a Distinguished Engineer to lead architecture and innovation in their Search products.This role involves defining developer and user experience...Show more

 • Promoted

Systems Engineer – AI Medical Devices (Fixed-Term - Canada) Prenuvo HQ - Vancouver - AI

Socotra, Inc.Vancouver, Metro Vancouver Regional District, CA
Temporary

Our award‑winning whole body scan is fast (under 1 hour), safe (MRI has no ionizing radiation), and non‑invasive (no contrast).Our unique integrated stack of optimized hardware, software, and incre...Show more

 • Promoted

Public Sector AI Engineer — Agentic Systems & Production

CohereVancouver, Metro Vancouver Regional District, CA
Full-time

A leading AI research company in Canada seeks a Member of Technical Staff - Public Sector to design and implement agentic AI systems.Responsibilities include building systems for critical use cases...Show more

 • Promoted

Frontier ML Research Engineer (Part-Time)

Great Value HiringVancouver, Metro Vancouver Regional District, CA
Part-time

A research organization seeks exceptional PhD candidates or PostDocs to tackle high-impact questions in AI/ML.Responsibilities include identifying transformative questions, building comprehensive k...Show more

 • Promoted

Principal Researcher - Systems & Networking - Microsoft Research

Microsoft CanadaVancouver, BC, Canada
Full-time

Microsoft Research Asia - Vancouver lab, located in the vibrant city of Vancouver, BC, Canada, our lab represents Microsoft Research Asia’s exciting expansion into the Asia-Pacific region.We’re on ...Show more

 • Promoted

Advanced Research Computing Architect

Simon Fraser UniversityBurnaby, Metro Vancouver Regional District, CA
Full-time

Lead the design and implementation of cutting-edge research computing infrastructure.Utilize expertise in AI and HPC to enhance collaborative research and scalable solutions.This role offers instit...Show more

 • Promoted

AI Implementation and Research Director

Info-Tech Research GroupVancouver, Metro Vancouver Regional District, CA
Full-time

Shape the future of applied AI by directing client engagements and system prototyping.This position melds hands-on delivery with innovative research to enhance AI applications.The AI Implementation...Show more

 • Promoted

PCR/qPCR Specialist in Advanced Diagnostics

Vitacore Industries Inc.Burnaby
Full-time

Elevate your career as a PCR/qPCR Specialist in a forward-thinking R&D team.Spearhead innovative assay design and optimize workflows at the intersection of molecular biology and AI.In this full-tim...Show more

 • Promoted

Remote Research Engineer - Decentralized AI Systems

Yotta LabsVancouver, Metro Vancouver Regional District, CA
Remote
Full-time

A leading tech company is seeking a Research Engineer specializing in decentralized AI systems.The role involves designing efficient workload orchestration for AI applications across a global netwo...Show more

 • Promoted

Principal Security Researcher Focused on Vulnerability and AI Strategies

1PasswordVancouver, Metro Vancouver Regional District, CA
Full-time

Join as a Principal Security Researcher, leading efforts to secure identity systems through advanced research and vulnerability identification.Engage collaboratively in a remote work setting.This s...Show more

 • Promoted

Innovative Systems Engineer for AI-Driven Medical Devices

PrenuvoVancouver, Metro Vancouver Regional District, CA
Full-time

Drive the advancement of AI medical devices as a Systems Engineer.Collaborate with cutting-edge technology teams to ensure compliance and validation in proactive healthcare solutions.In this pivota...Show more