Data Center Production Operations Engineer
Meta is seeking a Data Center Production Operations Engineer to support the reliability, efficiency, and scalability of our global data center infrastructure. In this role, you will perform hands-on server hardware operations, including deployment, maintenance, troubleshooting, and decommissioning of production server fleets that power Meta's family of apps and services. You will work within established operational procedures to ensure data center systems meet performance and availability standards, collaborating closely with infrastructure engineering, facilities, and supply chain teams to keep production environments running at scale.
Responsibilities
- Work within Meta's ticketing system
- First point of contact for break fix technicians
- Responsible for assisting with projects (retrofits, new process details, etc.) and repairs throughout the data center
- Understand and debug hardware and Linux OS related issues
- Identify and help create documentation for the global data center knowledge base
- Assist with process improvements and best practices in data center operations
- Participate in on-call rotation (once a month on call for a week, after hours, first point of contact)
Qualifications
- Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
- Currently has, or is in the process of obtaining, a Bachelor's or Master's degree in technical field, or equivalent experience/certification
- Knowledge of Linux and server hardware support
- Working knowledge and experience in at least one of the following core areas: Networking, Programming/Scripting, Hardware, or OS repair
- Solid communication skills are a requirement for this role Experience modifying and developing in Python, SQL, and/or shell scripting
- Working conceptual knowledge of technologies such as HTTP, DNS, RAID, and DHCP