Logo
META

Data Center Facility Operations Reliability Engineer

META, Boston, Massachusetts, us, 02298

Save Job

Overview

Summary:

Ready to make your application Please do read through the description at least once before clicking on Apply. Meta was built to help people connect and share, and over the last decade, our tools have played a critical part in changing how people around the world communicate with one another. With over two billion people using the service and hundreds of offices around the globe, a career at Meta offers countless ways to make an impact in a fast-growing organization.Our Data Centers are the foundation upon which our rapidly scaling infrastructure efficiently operates to deliver our innovative services. Meta is seeking an experienced and self-motivated Reliability Lead to join our Asset Management & Reliability team within Facility Operations. This person will work at the leading edge of Facility Operations to identify and manage asset reliability risks that could adversely affect data center operations. Managing stakeholders spread across time zones is a significant challenge and key to the success of our individual projects and overall asset management, quality and reliability program. Responsibilities

Support the asset care and maintenance strategies for critical assets based on Meta Processes

Support the development of standards, guidelines and processes to execute reliability program function

Lead and facilitate asset criticality assessments, RCM studies, PM Optimization and other reliability studies

Perform reliability analytics include Weibull distribution, Monte Carlo simulation and other reliability analysis

Act as liaison between Reliability and other partner teams (AM, Quality, SSU, Retrofits)

Support the development of standardized PM template to facilitate trending

Works with appropriate technical teams to evaluate reliability and maintainability of data center equipment to significantly influence reliability and maintainability improvements

Works with Asset Management and Quality teams to evaluate the failure data and other information and build that into a global reliability database

Provides input for key documents such as reliability process playbooks, executive, briefs, presentations and program metrics

Support the spares development and sustainment program

Support the development and stewardship of maintenance strategies

Support Master Data and asset onboarding process

Develop or recommend engineering solutions to repetitive failures and all other problems that adversely affect plant operations

Define, design, develop, monitor, and refine an asset maintenance plan that includes both (a) value-added preventive maintenance tasks and (b) predictive and other non-destructive testing methods designed to identify and isolate inherent reliability issues

Develop Reliability Improvement Process (RIP) reports on critical asset failures

Work with Maintenance to analyze asset characteristics, including: asset availability, overall equipment effectiveness, remaining useful life

Provide technical support to Operations, Maintenance management, and technical personnel

Apply value analysis to repair/replace, repair/redesign, and make/buy decisions

Minimum Qualifications

Bachelor’s degree in Mechanical, Electrical Reliability Engineering or similar technical discipline

10+ years of experience in reliability engineering (related to electrical or mechanical cooling equipment)

Experienced in Reliability Centered Maintenance (RCM)and Failure Maintenance Effect Analysis activities for maintenance /process/equipment design optimization to meet reliability requirements

Proficient in usage of EAM solutions to extract data and develop meaningful insights

Certifications in Maintenance & Reliability such as CMRP, CRL, CRE

Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)

Preferred Qualifications

Experience with data center equipment such as critical cooling systems, generators, main switchboards, network gear

Proficient in data analysis techniques that can include Process Control, Reliability modeling and prediction, Fault Tree Analysis, Weibull Tree Analysis, Six Sigma (6σ) Methodology

Proficient in developing and executing test plans for assets

Certifications in Maintenance & Reliability such as CMRP, CRL, CRE

Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)

Public Compensation $133,000/year to $190,000/year + bonus + equity + benefits Industry: Internet Equal Opportunity Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.

#J-18808-Ljbffr