SiteOps Data Center Production Operations Engineer

Newton County, Georgia


Employer: Meta
Industry: 
Salary: Competitive
Job type: Full-Time

Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. To apply, click "Apply to Job" online on this web page.

SiteOps Data Center Production Operations Engineer Responsibilities


  • Perform deep dives and analysis of complex technical issues within the data center, ranging from automated tooling to hardware failures and network issues.

  • Work as a subject matter expert with cross functional teams on large scale data center projects and initiatives.

  • Provide cross data center support and identify potentially larger issues, displaying effective communication when something is identified.

  • Work with internal hardware teams and vendors to help drive complex technical issues to resolution, provide an ownership stake in ensuring high quality levels of hardware, and influence future design to ensure ease of serviceability.

  • Use data analytics to drive maximum server fleet up-time and utilization rates by understanding hardware failure rates and SLAs to customers.

  • Identify trends and systemic issues in the fleet and drive resolution.

  • Build cross functional relationships and have the ability to influence policies and procedures to improve global data center operations.

  • Participate in an on-call rotation.


Minimum Qualifications


  • Requires a Bachelor's degree in Computer Science, Electronics and Communications Engineering, Information Systems, Analytics, or a related field, followed by five years of progressive, post-baccalaureate work experience in the job offered or a computer-related occupation. Requires five years of experience in the following:

  • 1. Hardware, OS repair, Tooling and Automation, or Project Management

  • 2. Triaging, debugging, and troubleshooting complex systemic issues in a Linux server environment

  • 3. Out-of-band/lights-out server communication methods, such as IPMI and serial console

  • 4. Interdependencies of data center functions and technologies including electrical, cooling, structured cabling, security, network and server systems

  • 5. Enterprise level networking and storage platforms

  • 6. Supporting large-scale AI clusters

  • 7. Debugging, modifying, and developing in at least one of the following industry-standard languages: Bash, PHP, Python, SQL, or Perl.


Start preparing
Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.
Visit interview prep

Created: 2024-06-17
Reference: 1110571966714201
Country: United States
State: Georgia
City: Newton County


Similar jobs: