Artificial Intelligence (AI), Data Quality, Incident Response, Machine Tool, Reliability Engineering, Safety Systems, Systems Analysis, Workflow Analysis
Seeking an AI Platform Reliability Engineer to ensure the reliability, observability, and safety of AI systems and analytics workflows in production.
This role involves building monitoring, tracing, and diagnostics; implementing versioning and rollout controls; supporting data quality and anomaly detection; and responding to incidents.
The candidate should have strong engineering discipline in observability, release safety, and operational tooling, applying these skills to modern AI and agent-based systems.
Responsibilities include maintaining monitoring tools, designing evaluation practices, supporting data reliability, enforcing operational safeguards, and collaborating with engineering teams.
The position offers a competitive salary range, benefits, and opportunities to contribute to scalable, trustworthy AI solutions.
O
Oracle
For over three decades, Oracle has been the center of innovation for business software birthplace of the first commercially available relational database, the first suite of internet-based applications, and the next-generation enterprise-computing platform, Oracle Fusion. Today, Oracle provides the world's most complete, open, and integrated business software and hardware systems, with more than 370,000 customers including - 100 of the Fortune 100 - representing a variety of sizes and industries in more than 145 countries around the globe. And Oracle's 110,000 global employees - including 30,000 developers working full-time on Oracle products -are critical to that success.
Oracle Supports Workforce Diversity