Data Center Facilities Manager

Beijing ByteDance Technology Co Ltd

Ashburn, VA

JOB DETAILS
SKILLS
Asset Management, Budgeting, Business Strategy, Capital Expenditure (CAPEX), Capital Project, Change Management, Collocation, Commissioning, Communication Skills, Conflict Resolution, Construction, Construction Design, Continuous Improvement, Corrective Action, Corrective and Preventative Action (CAPA) Systems, Cost Control, Cross-Functional, Electrical Engineering, Electricity, Energy Efficiency, Environmental Health, Facilities Engineering, Facilities Management, Financial Management, Forecasting, Identify Issues, Leadership, Low Voltage (LV), Maintain Compliance, Mechanical Engineering, Mentoring, Network Operations Center, On Call, Operational Audit, Operational Expenditure (OPEX), Operational Support, Operations Processes, People Management, Performance Metrics, Power Plant, Preventative Maintenance, Project Lifecycle, Project Management Professional (PMP), Regulations, Resource Management, Risk, Risk Management, Root Cause Analysis, Safety Compliance, Safety/Work Safety, Semiconductors, Service Level Agreement (SLA), Standard Operating Procedures (SOP), Succession Planning, Talent Management, Vehicle Fleets, Vendor/Supplier Evaluation, Vendor/Supplier Management, Vendor/Supplier Planning
LOCATION
Ashburn, VA
POSTED
6 days ago

The Datacenter Facility Operation team supports the company's hyper-scale growth by operating, maintaining, and optimizing our critical infrastructure. We ensure 100% uptime, maximum energy efficiency, and operational excellence across our global data center footprint. The team focuses on scaling critical infrastructure (Power and Cooling) through rigorous standard operational procedures, innovation, and culture of safety.

As a Data Center Facility Manager, you will be responsible for the overall operational excellence, reliability, and financial management of critical infrastructure within your assigned data center site(s). You will transition from tactical hands-on troubleshooting to strategic leadership-working with cross-functional teams, driving colocation partner governance, and maintaining ultimate accountability for site uptime, safety, and efficiency. You will bridge the gap between high-level business strategy and ground-level execution, ensuring that our infrastructure scales seamlessly to meet the demands of our server fleet.

Responsibilities People Leadership & Talent Development:

  • Lead, mentor, and develop a high-performing team of data center facility operation engineers and technicians; build a culture of accountability, safety, and continuous improvement.
  • Manage shift planning, resource allocation, and succession planning to ensure 24/7 technical coverage.

Operational Excellence & SLA Governance:

  • Accountable for site uptime and strict adherence to strict Service Level Agreements (SLAs). Serve as the escalation point for major site incidents.
  • Establish, audit, and govern the maintenance programs of colocation partners to ensure high-quality execution of preventative and corrective maintenance.
  • Oversee Root Cause Analysis (RCA) and Corrective Actions/Preventive Actions (CAPA) for critical infrastructure failures, ensuring lessons learned are institutionalized globally.

Vendor Strategy & Contract Management:

  • Manage critical colocation and vendor partnership, driving Key Performance Indicators (KPIs) and operational governance.
  • Lead regular Quarterly Business Reviews (QBRs) and operational audits with partners and critical equipment vendors (Generators, UPS, Chillers).

Financial & Asset Management (CapEx/OpEx):

  • Own the site operational budget (OpEx) and forecast lifecycle capital improvement projects (CapEx).
  • Identify opportunities for infrastructure optimization, energy efficiency (PUE reduction), and cost-saving initiatives without compromising reliability.

Risk & Change Management:

  • Serve as the assigned site authority for Critical Environment Work Authorizations (CEWA) and high-risk Method of Procedures (MOPs).
  • Enforce a zero-injury safety culture, ensuring compliance with global and local environmental, health, and safety (EHS) regulations.

Deployment, Commissioning & Lifecycle Support:

  • Partner with Design, Construction, and Global Commissioning teams to oversee data hall fit-outs, capacity expansion, commissioning tests and facility audits.
  • Ensure seamless handovers from construction to operations (Operational Readiness), including updated single-line diagrams, SOPs, and EOPs.Minimum Qualifications
  • Education: Bachelor's Degree in Electrical Engineering, Mechanical Engineering, or a related technical discipline.
  • Experience: 5+ years of experience in critical infrastructure operations (Data Centers, Semiconductor Fabs, or Power Plants).
  • Technical Acumen: Deep understanding of data center tiering standards (TIA-942 Rated 3/4, Uptime Tier III/IV), Power Redundancy (2N, N+1 distributed), and Cooling Redundancy (Concurrent Maintainability).
  • Operations Knowledge: Proven track record of managing high-risk change management (MOPs/SOPs/EOPs) and conducting Root Cause Analysis (RCA) for complex electrical/mechanical failures.
  • Systems Familiarity: Strong practical knowledge of Megawatt-class Diesel Generators, Static/Rotary UPS, Medium/Low Voltage Switchgears, Chillers, Cooling Towers, CRAH/AHU units, and BMS/EPMS systems.

Preferred Qualifications

  • Certifications: CDCP (Certified Data Center Professional), CDFM (Certified Data Center Facilities Manager), PMP, or equivalent professional engineering licenses.
  • Vendor & Financial Management: Demonstrated experience managing large-scale vendors/colocation providers and controlling OpEx/CapEx budgets.
  • Soft Skills: Outstanding communication, cross-functional collaboration, and conflict-resolution skills. Ability to communicate complex technical incidents clearly to executive leadership. At least 2+ years experience in a people management, supervisory, or site-lead role within a hyperscale or large-scale colocation environment.
  • Agility: Ability to thrive in a fast-paced, ambiguous environment and participate in an on-call rotation for emergency escalation.

About the Company

B

Beijing ByteDance Technology Co Ltd