Search jobs > Markham, ON > Data engineer

Data Center (Lab) Engineer

AMD
Markham, ON
$123.5K-$171.5K a year (estimated)
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges.

We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. This is who we are at our best.

One Company. One Team.

AMD together we advance

THE ROLE :

AI GPU Software (AGS) Datacenter group is looking for dynamic and skilled individuals that can contribute to the bring up, support and debug of complex computing systems.

Individual will be part of a growing lab team and is required to do hands on experiments related to issue management as well as hardware level reworks.

The position involves a wide range of activities including deployments of pre-production platforms, test equipment to enable silicon to bring up and validations, contributing to the triage, debug and rework to resolve complicated system level issues.

As a key contributor, individual will be part of a leading AI team to drive and enhance AMD's abilities to deliver the highest quality, industry-leading technologies to market.

THE PERSON :

The ideal candidate possesses an innovative and problem-solving mindset, has a keen eye for system engineering support and development, and is diligent and passionate about Technology.

KEY RESPONSIBILITIES :

  • Setup hardware (CPU / APU, GPU, Memory cards) in server / workstation computing systems to facilitate user defined workloads in the AGS data center labs.
  • Own initial trouble shooting, and debug of a wide variety of systems, firmware, or software issues encountered while maintaining the integrity of large number development equipment.
  • Triage, debug and reproduce issues and validate fixes identified by AGS development teams.
  • Installation, configuration of various OS Distros from console.
  • Setup Network gear, perform and validate Network configurations.
  • Configure file systems using VG / LV / Partitions
  • Integrate automated testing in CI / CD environment (e.g. Jenkins, ansible)
  • Provide logs and statistics that will help in further debug of issues.
  • Own inventory database management and administration through a managed system.
  • Participate in the Agile method of planning, delivery, and collaboration with internal and scaled agile teams.
  • Work with a managed ticketing system and communicate clearly on activities and steps.

PREFERRED EXPERIENCE :

  • Demonstrated experience working in data center and managing all aspects of lab infrastructure.
  • Thorough understanding of server, workstation hardware architecture.
  • Able to read and interpret board schematics.
  • Demonstrated experience in PC / Server environment H / W and S / W setup and administration.
  • Comfortable working in different operating system environments including Linux and Windows.
  • Excellent Hardware and OS Debug, troubleshoot skills.
  • Familiarity with Networking setup and configuration.
  • Hands on experience with various storage solutions (NAS,SAN. etc.) and form factors.
  • Knowledge on Cobbler, Open stack, foreman or any other provisioning automation tool is a big plus.
  • Experience with power supplies monitoring and sequencing.
  • Proficient at documenting experimental results in a structured manner for ease of reference.
  • Demonstrated ability to work with JIRA and Confluence project management and documentation tools.
  • Team player with strong communication, analytical and problem-solving skills.
  • Must be a self-starter capable of working in a dynamic environment with minimal supervision and driving tasks to completion with utmost quality.

ACADEMIC CREDENTIALS :

Bachelor's or Master's degree in Computer / Electrical Engineering, or related technical discipline.

LOCATION : Markham, Ontario, Canada

LI-DR1

Benefits offered are described : AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

30+ days ago
Related jobs
AMD
Markham, Ontario

Setup hardware (CPU/APU, GPU, Memory cards) in server/workstation computing systems to facilitate user defined workloads in the AGS data center labs. Demonstrated experience working in data center and managing all aspects of lab infrastructure. Our mission is to build great products that accelerate ...

Canonical
Toronto, Ontario

Collaborate with other MAAS engineers, other data centre hardware specialists, teams responsible for certifying hardware for Ubuntu, kernel engineers and other teams who use the hardware lab. We are hiring a Data Center Infrastructure Engineer to build and maintain MAAS test labs. As a Data Center I...

Advanced Micro Devices, Inc
Markham, Ontario

AMD together we advance_ THE ROLE: The Data Center GPU Group is looking for self-driven Functional Validation Engineers. THE ROLE: The Data Center GPU Group is looking for self-driven Functional Validation Engineers. Our mission is to build great products that accelerate next-generation computing ex...

Equinix
Toronto, Ontario

Canadian Armed Forces - Data Center Facility Engineer (HVAC, Mechanical, Electrician, Security Systems). Data Center Operations, Critical Facilities Engineer. Equinix is the world’s digital infrastructure company®, operatingover 250 data centers across the globe. Joining our operations team means th...

S.i. Systems
Toronto, Ontario

Data Engineer to perform ETL development (PostgreSQL) on Guidewire billing/policy/claims center for a major insurance client . Data Analysis: Understanding the data structures and terminology used in Guidewire applications will be essential for performing accurate data analysis and troubleshooting. ...

Equinix
Toronto, Ontario

Senior Mechanical Design Engineer (Data Center HVAC). Knowledge of data center HVAC mechanical engineering design. Equinix is the world’s digital infrastructure company®, operatingover 250 data centers across the globe. Joining our operations team means that you will be at the forefront of all we do...

Advanced Micro Devices, Inc
Markham, Ontario

AMD together we advance_ THE ROLE: The Datacenter Team is looking for dynamic and energetic Silicon Validation Engineers. You should feel comfortable to be hands-on with Linux-based datacenter platforms in the labs. THE ROLE: The Datacenter Team is looking for dynamic and energetic Silicon Validatio...

CB Canada
Vaughan, Ontario

Install equipment in a data center or network closet. ML350, ML110 (2 Hyper-V VMs, data migration to file server). ...

Promoted
Care.com
Aurora, Ontario

We are in urgent need of a dog sitter who is reliable and trustworthy.It would be nice if you had some experience and references.Looking forward to hearing from you....

Promoted
Outlier
Richmond Hill, Ontario
Remote

Outlier helps the world’s most innovative companies improve their AI models by providing human feedback.Are you an experienced English writer who would like to lend your expertise to train AI models?.Outlier is looking for talented writers with fluency in English to help train generative artificial ...