Function: Hardware / Platform Validation
Role Summary
Hands-on System Development Engineer responsible for end-to-end validation of x86 and ARM server platforms, with strong emphasis on server board‑level validation, firmware/BIOS testing, and system bring‑up. This is a lab‑intensive role spanning hardware, firmware, OS, and platform integration across pre-production and NPI environments.
Key Responsibilities
- Perform end‑to‑end server solution validation across CPU, memory, PCIe, storage, networking, power, thermal, BIOS, BMC, FPGA, and OS.
- Lead server board‑level bring‑up and validation, including power sequencing, POST/boot‑flow debug, and unit test validation.
- Validate CPU initialization, memory training (DDR4/DDR5), chipset integration, power management, and RAS features.
- Design and execute validation for BIOS/UEFI, OpenBMC, CPLD, secure boot, TPM, IPMI, and Redfish.
- Execute hardware stability, stress, thermal, and power tests across multiple server platforms.
- Analyze failures and root‑cause issues across HW / FW / OS boundaries using logs, serial console, and lab debug tools.
- Collaborate with silicon vendors, ODMs, hardware, firmware, and OS teams to support NPI milestones.
Mandatory Qualifications
- 10+ years of hands‑on server platform validation / system development experience. Bachelor's/Master's degree in EE, ECE, Computer Engineering, or equivalent.
- Strong experience with server board‑level validation and bring‑up (power, clocks, reset, PCIe, memory).
- Hands‑on experience with x86 and ARM server platforms across multiple generations.
- Strong BIOS/UEFI and OpenBMC firmware validation experience.
- Board‑level debug experience using JTAG/XDP, PCIe analyzers, logic analyzers, oscilloscopes.
- Proven experience in hardware stability, stress, thermal, and power validation.
- Strong debug skills across hardware, firmware, and Linux OS.
Core Technical Skills
- PCIe Gen4/Gen5, DDR4/DDR5, NVMe, CXL
- Server power, thermal, and high‑speed design concepts
- Linux debug tools: lspci, dmidecode, mcelog, ipmitool
- Python/Bash for automation; working knowledge of C/C++
Desired / Preferred Skills
- Hands‑on experience with GPU validation and testing:
-NVIDIA GPUs (CUDA, nvidia‑smi, DCGM, NVLink/NVSwitch)
-AMD GPUs (ROCm, rocm‑smi, Infinity Fabric)
-AI/HPC benchmarks (MLPerf, HPL, NCCL/RCCL)
- Exposure to multi‑GPU server platforms and high‑density AI systems.
- Experience with OCP platforms, hyperscale or cloud server validation.
- Familiarity with compliance testing (PCI‑SIG, UEFI SCT).