Facebook

Data Center Production Operations Engineer

Posted on: 14 Apr 2021

Altoona, IA

Job Description

Facebook is seeking a forward thinking experienced Engineer to join the Production Operations team within Data Center Operations. Our data centers, and the tens of thousands of servers installed in them, are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Facebook is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success. We seek an IT professional with advanced hands-on technical skills in Networks, Server Hardware and Linux (ideally in a Data Center environment). Having extensive knowledge of managing servers and performing complex projects in a large-scale distributed data center environment is a core competency of this individual. The candidate should also have deep knowledge and experience in at least one of the following core areas: Networking, Project Management, Tool and Automation, Hardware and OS repair.

Data Center Production Operations Engineer Responsibilities

* Perform deep dives and analyze complex technical issues within the data center, ranging from automated tooling to hardware failures and network issues.

* Work as a technical lead with cross functional teams on large scale data center projects and initiatives.

* Provide cross data center support and identify potentially larger issues, displaying effective communication when something is identified.

* Work with internal hardware teams and vendors to help resolve complex technical issues, maintain high hardware quality levels and influence future design to ensure ease of serviceability.

* Understand/analyze issues and be able to update and develop scripts and smaller sets of software.

* Use data to drive maximum server fleet up-time and utilization rates, by understanding hardware failure rates and SLAs to customers. Identify trends and systemic issues in the fleet and drive resolution.

* Mentor team members to evaluate and identify better ways to resolve issues and define updates to tools and processes.

* Provide guidance and mentor technical leads and the go-to technical resource for management.

* Build cross functional relationships and have the ability to influence policies and procedures to improve global data center operations.

* Participate in an on-call rotation.

Minimum Qualifications

* BS, BA or BEng in technical field or commensurate experience.

* 5+ years of infrastructure or related experience.

* Knowledge of Linux and hardware systems support in an Internet operations environment.

* Knowledge of the interdependencies of data center functions and technologies.

* Experience managing multiple projects within the same time schedule.

* Knowledge of enterprise level networking and storage equipment installs.

* Knowledge of out-of-band/lights-out server communication methods, such as IPMI and serial console.

* Time and project management experience.

* Experience in modifying and developing in commonly used scripting or programming languages.

* Proven communication skills.

Preferred Qualifications

* Experience in providing technical guidance to external vendors.

* Experience in a large-scale data center environment.

Locations

About the Facebook company

Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities we're just getting started.

Facebook is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or who are neurodivergent, and to candidates with sincerely held religious beliefs or requiring pregnancy related support. If you need support, please reach out to accommodations-ext@fb.com.

Facebook

Menlo Park, CA

Facebook, Inc. provides various products to connect and share through mobile devices, personal computers, and other surfaces worldwide. The company’s products include Facebook that enables people to connect, share, discover, and communicate with each other on mobile devices and personal computers; Instagram, a community for sharing photos, videos, and messages; Messenger, a messaging application for people to connect with friends, family, groups, and businesses across platforms and devices; and WhatsApp, a messaging application for use by people and businesses to communicate in a private way. It also provides Oculus, a hardware, software, and developer ecosystem, which allows people to come together and connect with each other through its Oculus virtual reality products. As of December 31, 2018, it had approximately 1.52 billion daily active users. The company was founded in 2004 and is headquartered in Menlo Park, California.