
TPU Software Area Tech Lead, Cloud Platforms
職缺摘要
技術需求
學歷要求
Bachelor
職缺描述
-
Drive the technical roadmap across a various hardware, data center, and cloud infrastructure portfolio while leading next-generation TPU product introductions.
-
Set and communicate team priorities, support the organization's goals and develop the mid-term technical goal and roadmap. Align strategy, processes, and decision-making across teams.
-
Develop, test, and help deploy and debug the lower level software for TPU systems including firmware, driver, user space libraries, Linux Kernel, power, thermal, and test development.
-
Design and implement superpod software to control and manage TPU AI hypercomputers containing thousands of TPU machines, constructing and connecting TPU slices with shape requested by users.
-
Build and evolve the TPU hypercomputer health ecosystem, integrating hardware and networking quality assurance, repair, and monitoring. Partner with cross-functional infrastructure, engineering, and external teams to plan and execute end-to-end programs, from product development to productivity gains. .
Minimum qualifications:
-
Bachelor's degree or equivalent practical experience.
-
8 years of experience in software development.
-
7 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture.
-
5 years of experience with design and architecture; and testing/launching software products.
Preferred qualifications:
-
Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
-
Experience in developing software that interacts with hardware (e.g., firmware, embedded systems, system software).
-
Experience in production monitoring, logging, and observability tools.
-
Familiarity with networking protocols and technologies.
-
Familiarity with machine learning concepts.