Google welcomes people with disabilities.
Minimum qualifications:
- Bachelor's degree or equivalent practical experience.
- 2 years of experience with software development in Python or C++, or 1 year of experience with an advanced degree.
Preferred qualifications:
- Master's degree or PhD in Computer Science or related technical fields.
- Experience with Generative AI, Large Language Models (LLM), or Machine Learning infrastructure, including model deployment, performance optimization, profiling, and debugging.
- Experience with distributed computing leveraging graphics processing units (GPU) or tensor processing units (TPUs).
- Ability to scope and solve problems independently, and thrive in a dynamic, fluid environment where AI technologies are continuously advancing.
- Ability to collaborate effectively with cross-functional and cross-regional teams.
About the jobGoogle's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
The AI and Infrastructure team is redefining what's possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide.
We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
Responsibilities
- Optimize performance on Google's AI infrastructure across the technical stack.
- Conduct in-depth performance profiling, debugging, and troubleshooting of AI/ML training and inference workloads.
- Develop tools and software for the AI/ML Infrastructure to deliver exceptional end-to-end developer experience.
- Partner closely with cross-functional, cross-regional teams to ensure the AI/ML infrastructure delivers exceptional value and drives success for the customers.
- Shape the future of the AI/ML infrastructure by identifying gaps in the existing products and recommending enhancements.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form .