Search by job, company or skills

Etched Marketing

Rack Integration Engineer – AI Infrastructure

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 21 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About Etched

Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.

Job Summary

We are seeking a highly skilled Rack Level Systems Integration Engineer to ensure the production quality and independent operation of our cutting-edge AI systems from the factory side in Taiwan. You will be a critical bridge between design and manufacturing, providing oversight, hands-on L11-level debugging, and failure triage for our Sohu system product. This includes Top-of-Rack (ToR) switches, high-power water cooling systems (CDUs, heat exchangers), and rack-level management functions. This role requires profound domain expertise and a combination of hands-on physical ability and deep system architecture knowledge to ensure optimal performance, reliability, and readiness for deployment.

Key Responsibilities

  • Production Quality & Oversight: Independently oversee the final assembly, quality control, integration, and failure triage of rack solutions at Contract Manufacturers (CMs) in Taiwan, ensuring test plans and SOPs manifest correctly on the production line.
  • System-Level Hardware Debugging: Apply strong methodology to identify, triage, and debug complex system-level hardware design failures (e.g., pinpointing bus/CPU failures) and rack-level errors.
  • Design Influence: Serve as a key contributor to next-generation product design by collecting factory-learned lessons and feeding feedback back to cross-functional US design and validation teams (EE, thermal, power).
  • Architectural Expertise: Utilize a deep understanding of system architecture, including CPU/memory/accelerator interaction, and knowledge of physical rack connections (PDUs, liquid cooling, and switches).
  • Integrate all components within a data center rack, including servers, storage, networking equipment, power, and cooling solutions.
  • Manage and integrate high-power water cooling systems, including CDUs, heat exchangers, and rack-level manifolds.
  • Configure and integrate Top-of-Rack (ToR) switches and smart PDUs within the rack environment.
  • Perform hands-on systems engineering tasks, including physical racking/stacking, launching a Linux machine, and reading debug logs.
  • Coordinate hardware and software integration efforts on the L11 level with key stakeholders.
  • Create and maintain detailed documentation for rack-level integrations, including installation procedures, test results, and final output assessments.

You may be a good fit if you have

  • A Bachelor's degree in Electrical Engineering, Mechanical Engineering, or a related technical field.
  • A minimum of 5 years of experience in system integration, hardware debug, system bring-up, and New Product Introduction (NPI) for racks.
  • Demonstrable domain expertise from integrating complex, high-density AI/GPU/TPU systems (Nvidia, AMD, etc.) at major ODMs (Wistron, Quanta, Foxconn).
  • Required Technical Depth: Proven ability to perform system-level hardware debugging and L11 validation, with a rigorous methodology for troubleshooting design failures (not just known components/general IT).
  • Strong hands-on technical skills in Linux environments, system-level debug logs, and physical rack assembly.
  • Exceptional communication skills for effectively assessing final output, providing technical direction, and sharing knowledge with factory and US-based teams.
  • Functional knowledge of ToR switches and extensive experience with high-power liquid cooling systems and rack-level management functions.
  • Experience or deep understanding of both Electrical and Mechanical engineering principles as they apply to rack-level integration.

We encourage you to apply even if you do not believe you meet every single qualification.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 147307681