citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire._** **Required Skills** **Infrastructure & Reliability** + Experience building and operating high-availability, fault-tolerant systems + Strong understanding of distributed systems, performance monitoring, and resiliency patterns + Experience with incident response, root-cause analysis, and production troubleshooting **AI-Native Engineering (NEW)** + Hands-on experience applying Generative AI or Agentic AI (e.g., LangChain, AutoGPT, custom agents) to: + Infrastructure lifecycle management + Observability and anomaly detection + Incident response and remediation automation + Ability to design or integrate AI-driven workflows for operational efficiency and reliability + Familiarity with building or integrating autonomous agents for DevOps/SRE use cases **Cloud & Multi-Cloud Ecosystems** + Strong experience with **multi-cloud environments** (OCI, AWS/Azure) + Deep understanding of cloud infrastructure design, deployment, and resource optimization + Experience managing hybrid or cross-cloud architectures **DevOps/SRE Practices** + Advanced competency in CI/CD pipelines (Jenkins, Kubernetes) + Infrastructure as Code (Terraform) + Observability tools (Prometheus, Grafana) + Strong focus on **automation-first operations** **Data Technologies** + Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) + Experience with ETL frameworks and large-scale data processing + Understanding of columnar storage systems **BI & Reporting** + Experience supporting or integrating BI tools (Tableau, Power BI, Oracle Analytics) **Programming & Tools** + Strong proficiency in Python, Java, or Go + Experience with Docker, Kubernetes, and shell scripting **Problem-Solving** + Strong troubleshooting skills with ability to perform root-cause analysis + Experience resolving complex production issues in distributed systems **Responsibilities** **Responsibilities** Work with the Site Reliability Engineering (SRE) team to take shared ownership of services and platform components. Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.