Department: Infrastructure – Technical Program Management
Reports to: Director, TPM – Global Data Center Delivery
About the Role
GMI Cloud is expanding its global AI infrastructure footprint with multi-megawatt data center clusters across the U.S. and Asia. We are seeking a Technical Program Manager (TPM) to drive the management and delivery of GPU cluster infrastructure, including partner sites. This role will work at the intersection of AI hardware platforms, high-performance networking, and data center infrastructure, coordinating across solution architects, engineering teams, vendors, and contractors to deliver production-ready AI clusters.
In this role, you will lead multi-disciplinary programs across procurement, engineering, and operations, ensuring on-time, on-budget, and high-quality delivery of data center infrastructure projects. You'll drive alignment among internal stakeholders and external partners, while establishing scalable processes and performance metrics for GMI Cloud's fast-growing deployment portfolio.
Responsibilities
- Coordinate programs for GMI GPU cluster planning, provisioning, and lifecycle management (procure, deploy, retire).
- Partner with internal stackholders, procurement, infrastructure architect, SRE, BD/Sales teams to translate customer needs into execution plans.
- Drive cross-team coordination spanning multiple timezones: define milestones, manage dependencies, surface risks, and ensure timely delivery.
- Establish KPIs: utilization, provisioning lead time and implement SLA frameworks, execution runbook, incident playbooks and post-incident reviews.
- Drive process standardization and vendors/partners accountability across multiple regional projects; introduce automation initiatives to reduce manual toil.
- Lead vendor and partner engagements, manage procurement timelines and contracts as needed.
- Communicate program status, trade-offs, and roadmap to senior leadership and stakeholders..
Qualifications
- 5+ years technical program management experience, with 2+ years focused on infrastructure, platform, or compute resource programs (preferably AI/GPU clusters).
- Strong technical background (BS/MS in CS, Engineering, or equivalent) with hands-on familiarity of Linux, orchestration (Kubernetes), and distributed storage/networking concepts.
- Track record coordinating cross-functional engineering organizations, resolving dependencies, and delivering complex infrastructure projects on schedule.
- Proven experience managing programs across distributed multi-timezone teams (US, APAC).
- Strong program management fundamentals: scope, schedule, budget, quality, risk, and change control.
- Data-driven: experience defining and tracking operational KPIs and driving improvements.
- Excellent written and verbal communication, documentation, stakeholder management skills.
- Experience with on-prem datacenter deployments, and procurement cycles.
- Flexible working hours and willingness to attend critical meetings outside standard local business hours; ability to travel domestically and internationally as required
- PMP, Agile/scrum or equivalent project management certification preferred.