Facebook

Data Center Production Operations Manager

Posted on: 9 Mar 2021

Los Lunas, NM

Job Description

Facebook is seeking a forward thinking experienced individual to join the Data Center Operations Team. The person should enjoy working in a fast paced environment where adaptability and flexibility is key to their success. We seek an IT professional with management and leadership experience and advanced hands-on technical skills in Server Hardware, Project Management, Quality Management, Data Analytics, Networks, OS repair, Linux and Automation (ideally in a datacenter environment). Having depth and breadth of knowledge of managing servers in a large-scale distributed environment is a core competency of this individual. The Production Operations Manager is responsible for managing and maintaining server production including uptime, utilization, systemic technical issues and repairs throughout the Data Center.

Data Center Production Operations Manager Responsibilities

* Establishing and managing a Data Center Operations Team accountable for the maintenance and operation of server hardware and supporting infrastructure at scale

* Accountable for the health of server capacity delivering Facebooks products and services from the datacenter site, and for ensuring operational delivery through collaboration and partnership with both remote and local peer organizations

* Work with peer organizations and regional teams that affect and deliver services to datacenter operations such as network operations, project management, facilities/maintenance management, logistics, hardware design, automated tooling and supply chain operations in order to successfully maintain data center uptime to enable ongoing business growth

* Mentoring and developing engineers and technicians such that they can run daily operations with minimal supervision

* Build and lead a diverse, world-class data center operations team, developing both the technical capabilities and leadership qualities of engineers and technicians

* Collaborating with other Production Operations Managers in datacenter sites around the globe to evolve and optimize processes and approaches in a globally consistent way to allow Facebook to scale and grow effectively

* Creating and driving a culture of ownership, innovation, collaboration, accountability, and safety. Support and contribute thought leadership to the development and implementation of business practices, process and automated tooling which enables the growth and ongoing management of our global datacenter IT footprint

* Manage server upgrades, integration, automated OS provisioning process, rebuilds and other projects as required. Understand and debug network, hardware, and Linux OS related issues

* Identify and participate in the creation of documentation for the global DC knowledge base. Implement process improvements and inform best practices in data center operations

* Predicting data center growth and scaling issues before they occur and implement solutions. Deep knowledge and ownership of a hyper-scale computing fleet through the use of data trending and analysis to identify trends and systemic issues

* reporting out globally as required

* Drive specifications for tooling and automation that facilitate deployment, monitoring, automated remediation and decommissioning of server hardware at scale

Minimum Qualifications

* BS or BA in technical field or commensurate experience

* 4+ years experience managing 5+ technical resources

* Knowledge with Linux and hardware systems support in an Internet operations environment

* Familiarity with Python, SQL and/or shell scripting knowledge

* 2+ years experience managing multiple projects within the same time schedule

* Solid knowledge of enterprise level infrastructure

* Understanding of out-of-band/lights-out server communication methods, such as IPMI and serial console

* Communication skills

* Proven time and project management skills

* Experience training, mentoring, and leading other engineers and technicians

Preferred Qualifications

* 4+ years of experience in large-scale data center hardware deployments and building scalable infrastructure

Locations

About the Facebook company

Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities we're just getting started.

Facebook is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or who are neurodivergent, and to candidates with sincerely held religious beliefs or requiring pregnancy related support. If you need support, please reach out to accommodations-ext@fb.com.

(Colorado only*) Minimum salary of $137,000/year + bonus + equity + benefits
*Note: Disclosure as required by sb19-085(8-5-20)

Facebook

Menlo Park, CA

Facebook, Inc. provides various products to connect and share through mobile devices, personal computers, and other surfaces worldwide. The company’s products include Facebook that enables people to connect, share, discover, and communicate with each other on mobile devices and personal computers; Instagram, a community for sharing photos, videos, and messages; Messenger, a messaging application for people to connect with friends, family, groups, and businesses across platforms and devices; and WhatsApp, a messaging application for use by people and businesses to communicate in a private way. It also provides Oculus, a hardware, software, and developer ecosystem, which allows people to come together and connect with each other through its Oculus virtual reality products. As of December 31, 2018, it had approximately 1.52 billion daily active users. The company was founded in 2004 and is headquartered in Menlo Park, California.

Similar Jobs