Data Center Incident Program Manager - IC4

Oracle Corp

Nashville, TN

JOB DETAILS
SALARY
$90,100–$209,500 Per Year
SKILLS
Accidental Death and Dismemberment (AD&D), Analysis Skills, Artificial Intelligence (AI), Automation, Best Practices, Cloud Computing, Communication Skills, Continuous Improvement, Corrective Action, Cross-Functional, Customer Relations, Data Analysis, Dental Insurance, Detail Oriented, Distribution Control, Documentation, Embedded Systems, Establish Priorities, Financial Planning, Flexible Spending Accounts, HVAC, Healthcare, Incident Management, Incident Response, Integrated Circuits (ICs), Leadership, Legal, Life Insurance, Machine Tool, Maintain Compliance, Network Architecture/Engineering, Network Operations Center, Occupational Health, Operational Improvement, Oracle, Organizational Skills, Problem Solving Skills, Process Improvement, Process Management, Project Tracking, Project/Program Management, Record Keeping, Reporting Skills, Risk Management, Root Cause Analysis, Security Protocols, Stock Purchase Plans, System Integration (SI), Team Building, Team Lead/Manager, Team Player, Test Requirements, Training Program, Vision Plan
LOCATION
Nashville, TN
POSTED
1 day ago

This hybrid position is available in Broomfield, Colorado, or Nashville, Tennessee. Candidates must be able to work on-site according to the team''s schedule.

The Incident Program Manager will be responsible for overseeing and supporting all incidents related to critical operations, ensuring rapid resolution and minimal disruption to data center functionality. This role involves managing incident response processes, supporting new training initiatives for teams, automating incident tooling to enhance efficiency, and driving root cause analysis (RCA) along with corrective actions to prevent future occurrences. You will serve as a key coordinator for incident management, collaborating across teams to maintain the security, reliability, and uptime of the data center''s operations.

Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.

True innovation starts when everyone is empowered to contribute. That's why we're committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Disclaimer:

Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.

Range and benefit information provided in this posting are specific to the stated locations only

US: Hiring Range in USD from: $90,100 to $209,500 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle''s differing products, industries and lines of business.

Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following:

  1. Medical, dental, and vision insurance, including expert medical opinion

  2. Short term disability and long term disability

  3. Life insurance and AD&D

  4. Supplemental life insurance (Employee/Spouse/Child)

  5. Health care and dependent care Flexible Spending Accounts

  6. Pre-tax commuter and parking benefits

  7. 401(k) Savings and Investment Plan with company match

  8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

  9. 11 paid holidays

  10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

  11. Paid parental leave

  12. Adoption assistance

  13. Employee Stock Purchase Plan

  14. Financial planning and group legal

  15. Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - IC4

Responsibilities

  • Incident Support & Management: Provide comprehensive support for all incidents in data center critical operations, including monitoring, triage, and coordination of responses to issues involving power, cooling, HVAC, network, and other infrastructure systems to ensure uninterrupted service.
  • Training Initiatives: Support the development and implementation of new training programs for data center teams, focusing on incident response best practices, system familiarity, and proactive risk mitigation to build team capabilities and preparedness.
  • Automate Incident Tooling: Identify opportunities to automate incident detection, alerting, and resolution tools, integrating with existing systems like BMS and EPMS to streamline workflows, reduce manual intervention, and improve overall response times.
  • RCA & Corrective Actions: Manage and drive root cause analysis for incidents, collaborating with cross-functional teams to develop and implement corrective actions, track progress, and ensure lessons learned are applied to prevent recurrence.
  • Cross-Department Collaboration: Work closely with Data Center Operations, engineering, security, and vendor teams to align on incident protocols, share insights, and foster a culture of continuous improvement in critical operations.
  • Incident Documentation & Reporting: Maintain detailed records of all incidents, RCAs, corrective actions, and outcomes. Generate reports for stakeholders to provide visibility into incident trends, resolution effectiveness, and areas for enhancement.
  • Process Optimization & Continuous Improvement: Analyze incident data to recommend enhancements in processes, tools, and training, aiming to boost the efficiency, reliability, and resilience of data center operations.

Requirements

  • Minimum of 3 years of experience in incident management, program management, or related roles within data center operations or mission-critical environments.
  • Proven experience supporting incidents in data center infrastructure, including familiarity with systems like BMS, EPMS, power, cooling, and HVAC.
  • Hands-on experience with root cause analysis, corrective action planning, and incident tooling automation in high-stakes settings.
  • Knowledge of data center architecture, including network, servers, power distribution, environmental controls, and security protocols.
  • Strong problem-solving and analytical skills to lead RCAs and drive solutions under pressure.
  • Attention to detail in documenting incidents, tracking actions, and monitoring compliance.
  • Excellent communication and collaboration skills for training support, cross-team coordination, and reporting.
  • Ability to handle high-pressure situations with composure, prioritizing actions to minimize downtime.

Responsibilities

  • Incident Support & Management: Provide comprehensive support for all incidents in data center critical operations, including monitoring, triage, and coordination of responses to issues involving power, cooling, HVAC, network, and other infrastructure systems to ensure uninterrupted service.
  • Training Initiatives: Support the development and implementation of new training programs for data center teams, focusing on incident response best practices, system familiarity, and proactive risk mitigation to build team capabilities and preparedness.
  • Automate Incident Tooling: Identify opportunities to automate incident detection, alerting, and resolution tools, integrating with existing systems like BMS and EPMS to streamline workflows, reduce manual intervention, and improve overall response times.
  • RCA & Corrective Actions: Manage and drive root cause analysis for incidents, collaborating with cross-functional teams to develop and implement corrective actions, track progress, and ensure lessons learned are applied to prevent recurrence.
  • Cross-Department Collaboration: Work closely with Data Center Operations, engineering, security, and vendor teams to align on incident protocols, share insights, and foster a culture of continuous improvement in critical operations.
  • Incident Documentation & Reporting: Maintain detailed records of all incidents, RCAs, corrective actions, and outcomes. Generate reports for stakeholders to provide visibility into incident trends, resolution effectiveness, and areas for enhancement.
  • Process Optimization & Continuous Improvement: Analyze incident data to recommend enhancements in processes, tools, and training, aiming to boost the efficiency, reliability, and resilience of data center operations.

Requirements

  • Minimum of 3 years of experience in incident management, program management, or related roles within data center operations or mission-critical environments.
  • Proven experience supporting incidents in data center infrastructure, including familiarity with systems like BMS, EPMS, power, cooling, and HVAC.
  • Hands-on experience with root cause analysis, corrective action planning, and incident tooling automation in high-stakes settings.
  • Knowledge of data center architecture, including network, servers, power distribution, environmental controls, and security protocols.
  • Strong problem-solving and analytical skills to lead RCAs and drive solutions under pressure.
  • Attention to detail in documenting incidents, tracking actions, and monitoring compliance.
  • Excellent communication and collaboration skills for training support, cross-team coordination, and reporting.
  • Ability to handle high-pressure situations with composure, prioritizing actions to minimize downtime.

About the Company

O

Oracle Corp

For over three decades, Oracle has been the center of innovation for business software birthplace of the first commercially available relational database, the first suite of internet-based applications, and the next-generation enterprise-computing platform, Oracle Fusion. Today, Oracle provides the world's most complete, open, and integrated business software and hardware systems, with more than 370,000 customers including - 100 of the Fortune 100 - representing a variety of sizes and industries in more than 145 countries around the globe. And Oracle's 110,000 global employees - including 30,000 developers working full-time on Oracle products -are critical to that success. Oracle Supports Workforce Diversity
COMPANY SIZE
10,000 employees or more
INDUSTRY
Computer/IT Services
FOUNDED
1977