Facebook

Data Center Product Hardware Platform Lead Engineer

Posted on: 15 Mar 2021

Fremont, CA

Job Description

Facebook is seeking a forward thinking, experienced Product Hardware Platform Lead Engineer to join the Data Center Site Operations team. The Product Hardware Platform Engineering team is responsible for the overall performance of Facebooks production compute and storage platforms through their life-cycles in our data centers. This role will lead a team focused on maintaining and improving the health of the platform from verification testing into mass production through end-of-life. Key responsibilities include identifying systemic hardware, firmware, and tooling issues; engaging in hands-on problem solving; and collaborating effectively with cross-functional engineering and tooling teams to improve performance of the fleet. Our data centers, and the tens of thousands of servers installed in them, are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Facebook is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success. We seek an individual who can quickly absorb and understand the technical challenges of subject matter experts and local site operations teams, create alignment between these globally distributed teams as well as partner organizations, and can set informed priorities and direction while getting buy-in and commitment from relevant stakeholders.

Data Center Product Hardware Platform Lead Engineer Responsibilities

* Lead the team that provides end-to-end lifecycle ownership (verification test through end of life decommissioning) of hardware platforms and new technologies in the data centers

* Serve as the central point of contact representing the hardware platforms and new technologies across SiteOps, and be the subject matter experts on hardware platform issues, for datacenter operations teams

* Drive complex technical investigations globally and spanning multiple disciplines such as Hardware, Software/Firmware, Networking and Power & Cooling

* Issue timely alerts and support fixes to operations teams, and assure a robust feedback pipeline to engineering teams

* Provide serviceability feedback on production hardware to engineering design teams

* Provide technical mentorship on large scale data center projects and initiatives to global, cross-functional teams

* Build strong relationships and collaboration with engineering and cross functional teams across the company. Actively solicit feedback from teams, and use that feedback to improve operational effectiveness as infrastructure scales

* Own the cross-functional communication with other technical operations groups to help resolve incidents

* Collaborate with stakeholders, functional owners and subject matter experts to interpret and articulate business and operations needs

* Ability to travel up to 30% required

Minimum Qualifications

* BS or BA in technical field or commensurate experience

* 10+ years experience in hardware validation, working with cross functional teams to deliver products to production

* Experience working across a diverse global organization and building partnerships with cross functional teams inside and outside of the organization

* Experience triaging and debugging hardware platforms

* Experience in processing and analyzing large sets of data

* Proven knowledge of server and storage platforms, principles, technologies, protocols, and standards

* Experience managing multiple concurrent projects and managing tight timelines

* Experience working independently within a multi-disciplinary team of hardware and operations engineers

* Experience working with Linux or Unix Operating systems

* Proven technical skills, experience creating documentation for users of all levels

Preferred Qualifications

* Large-scale data center environment experience, including hardware deployments, deep system knowledge of Linux, Server Hardware, networking, network protocols, supply chain and Data Center automation

* Bash, PHP, Python, or Perl scripting experience

* Experience in data center system and process automation

* Leadership presence and presentation skills

Locations

About the Facebook company

Facebook's mission is to give people the power to build community and bring the world closer together. Through our family of apps and services, we're building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together. Whether we're creating new products or helping a small business expand its reach, people at Facebook are builders at heart. Our global teams are constantly iterating, solving problems, and working together to empower people around the world to build community and connect in meaningful ways. Together, we can help people build stronger communities we're just getting started.

Facebook is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or who are neurodivergent, and to candidates with sincerely held religious beliefs or requiring pregnancy related support. If you need support, please reach out to accommodations-ext@fb.com.

Facebook

Menlo Park, CA

Facebook, Inc. provides various products to connect and share through mobile devices, personal computers, and other surfaces worldwide. The company’s products include Facebook that enables people to connect, share, discover, and communicate with each other on mobile devices and personal computers; Instagram, a community for sharing photos, videos, and messages; Messenger, a messaging application for people to connect with friends, family, groups, and businesses across platforms and devices; and WhatsApp, a messaging application for use by people and businesses to communicate in a private way. It also provides Oculus, a hardware, software, and developer ecosystem, which allows people to come together and connect with each other through its Oculus virtual reality products. As of December 31, 2018, it had approximately 1.52 billion daily active users. The company was founded in 2004 and is headquartered in Menlo Park, California.

Similar Jobs