Search by job, company or skills

Dell Technologies Capital

Server Platform Validation Principal Engineer

Save
new job description bg glownew job description bg glow
  • Posted 18 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Function: Hardware / Platform Validation

Role Summary

Hands-on System Development Engineer responsible for end-to-end validation of x86 and ARM server platforms, with strong emphasis on server board‑level validation, firmware/BIOS testing, and system bring‑up. This is a lab‑intensive role spanning hardware, firmware, OS, and platform integration across pre-production and NPI environments.

Key Responsibilities

  • Perform end‑to‑end server solution validation across CPU, memory, PCIe, storage, networking, power, thermal, BIOS, BMC, FPGA, and OS.
  • Lead server board‑level bring‑up and validation, including power sequencing, POST/boot‑flow debug, and unit test validation.
  • Validate CPU initialization, memory training (DDR4/DDR5), chipset integration, power management, and RAS features.
  • Design and execute validation for BIOS/UEFI, OpenBMC, CPLD, secure boot, TPM, IPMI, and Redfish.
  • Execute hardware stability, stress, thermal, and power tests across multiple server platforms.
  • Analyze failures and root‑cause issues across HW / FW / OS boundaries using logs, serial console, and lab debug tools.
  • Collaborate with silicon vendors, ODMs, hardware, firmware, and OS teams to support NPI milestones.

Mandatory Qualifications

  • 10+ years of hands‑on server platform validation / system development experience. Bachelor's/Master's degree in EE, ECE, Computer Engineering, or equivalent.
  • Strong experience with server board‑level validation and bring‑up (power, clocks, reset, PCIe, memory).
  • Hands‑on experience with x86 and ARM server platforms across multiple generations.
  • Strong BIOS/UEFI and OpenBMC firmware validation experience.
  • Board‑level debug experience using JTAG/XDP, PCIe analyzers, logic analyzers, oscilloscopes.
  • Proven experience in hardware stability, stress, thermal, and power validation.
  • Strong debug skills across hardware, firmware, and Linux OS.

Core Technical Skills

  • PCIe Gen4/Gen5, DDR4/DDR5, NVMe, CXL
  • Server power, thermal, and high‑speed design concepts
  • Linux debug tools: lspci, dmidecode, mcelog, ipmitool
  • Python/Bash for automation; working knowledge of C/C++

Desired / Preferred Skills

  • Hands‑on experience with GPU validation and testing:

-NVIDIA GPUs (CUDA, nvidia‑smi, DCGM, NVLink/NVSwitch)

-AMD GPUs (ROCm, rocm‑smi, Infinity Fabric)

-AI/HPC benchmarks (MLPerf, HPL, NCCL/RCCL)

  • Exposure to multi‑GPU server platforms and high‑density AI systems.
  • Experience with OCP platforms, hyperscale or cloud server validation.
  • Familiarity with compliance testing (PCI‑SIG, UEFI SCT).

More Info

Job Type:
Industry:
Employment Type:

Job ID: 148523543