Infra Hardware TPM - Compute Systems (Technical Leadership)

Bellevue, Washington


Employer: Meta
Industry: 
Salary: Competitive
Job type: Full-Time

Meta is seeking a Technical Program Manager (TPM) experienced in managing large-scale compute hardware systems, including roadmapping and deployment. This position will collaborate with cross-functional teams in Meta's Infrastructure organization to build the core building blocks of Meta's infrastructure, which support our portfolio of applications. This role will focus on creating strategies, roadmaps, and executing plans to support the development and deployment of new generation compute products spanning across x86, and ARM products. These platforms are critical to supporting Meta's various software workloads, and are a key enabler to supporting our infrastructure scale. The TPM will work with external and internal partners to influence and define roadmaps based on technical and business considerations, lead customer and stakeholder alignment, and drive integration with Meta's capacity planning tools and systems across a portfolio of products. On individual programs, this role will be responsible for leading design and development, delivering hardware into data centers, influencing orchestration systems, software tooling, and provisioning, driving testing, software performance evaluation and tuning for production applications and use cases. This role will work with infrastructure hardware development, infrastructure software, capacity planning, data center, network infrastructure and infrastructure sourcing teams. Meta's Infrastructure Engineering organization is responsible for the growth, management, and 24x7 upkeep of all Meta's products and services.

Infra Hardware TPM - Compute Systems (Technical Leadership) Responsibilities


  • Own overall success in developing new compute systems and capabilities, managing the roadmap from concept to full production, and ensuring alignment with the rest of our infrastructure within a matrix organization. This role encompasses a range of areas, including infrastructure hardware development, infrastructure software, capacity planning, data centers, network infrastructure, and infrastructure sourcing teams, across multiple physical locations.

  • Create and gather alignment across multiple organizations regarding the business value of new compute systems. Value propositions include adopting new technologies, efficiency improvements, and enabling new use cases.

  • Create execution plans, including influencing hardware architecture, determining the most appropriate system and rack-level technologies, and ensuring they work together as a cohesive solution.

  • Deliver hardware products into the data center

  • partner with a range of teams to validate and optimize software performance in small and large scale testing.

  • Influence orchestration of system and cluster level software tooling and provisioning.

  • Support planning for migration of existing software applications to enable adoption of new generations of hardware.

  • Drive overall communication to leadership, stakeholder and core working teams in regular cadence to bring awareness.

  • Develop and manage the overall program including defining scope, requirements, development model, schedules, and deliverables with engineering teams, partners, and stakeholders.

  • Provide hands-on program management during analysis, benchmarking, design, development, testing, implementation, and post implementation phases.

  • Perform risk assessment, risk mitigation and change management on programs.

  • Drive internal process improvements across multiple teams and functions.


Minimum Qualifications


  • B.S. in Computer Science, Electrical Engineering or a related technical discipline, or equivalent experience.

  • 10+ years of systems engineering, hardware engineering, software engineering, or technical product/program management experience.

  • Understanding of compute hardware components, performance metrics, bottlenecks, and dependencies in single and distributed systems.

  • Experience delivering complex tech programs and/or products from inception to delivery.

  • Knowledge of user needs, gathering requirements, and defining scope.

  • Experience operating autonomously across multiple teams, demonstrated critical thinking, and thought leadership.

  • Communication experience and experience working with technical management teams to develop systems, solutions, and products.

  • Organizational, coordination and multi-tasking experience.

  • Analytical and problem-solving experience with large-scale systems.

  • Experience establishing work relationships across multi-disciplinary teams and multiple partners in different time zones.


Preferred Qualifications


  • Knowledge of storage, accelerator (GPU), and networking hardware and technologies.

  • Experience with data center architecture and deployment.

  • Experience working with ODMs (Original Design Manufacturer) and other vendors.

  • Experience working with capacity planning, migration and turn-up.

  • Web or Internet start-up environment and technical infrastructure management experience.

Created: 2024-08-27
Reference: 1139539013805299
Country: United States
State: Washington
City: Bellevue
ZIP: 98004


Similar jobs: