Do you want to help build some of the largest and most consequential enterprise and customer technology systems in the world? Join Apple's Information Systems and Technology (IS&T) organization. IS&T is the engine behind everything Apple does for customers and for the people who build for them. It's Apple's central nervous system. Supporting 2.5 billion active Apple devices, processing billions of secure transactions, and keeping the technology that defines modern life running flawlessly, IS&T makes the impossible feel effortless."
Do you love building solutions to handle global complexity and immense scale? Imagine what you could do here.
Infrastructure Services is part of IS&T and the foundation of Apples global network operations - managing data center equipment and systems to deliver compute, storage, and networking services for teams across Apple, including its internal developer community. From individual facilities to a worldwide network, Infrastructure Services ensures the technology underneath everything works without question.
We are seeking a Senior DevOps Engineer with deep expertise in cloud infrastructure, Kubernetes, and platform operations, combined with a forward-looking mindset in AI-driven automation. This role partners closely with the Senior DevOps Engineering Manager to scale and modernize our cloud platform, improve operational excellence, and embed intelligent automation across DevOps workflows. Cloud Platform and Infrastructure: Design, build, and operate scalable, secure, and cost-efficient cloud environments on AWS. Lead cloud migration and modernization efforts, from VMs to containers to cloud-native architectures. Establish and enforce infrastructure standards, governance, and best practices. Drive improvements in availability, scalability, and performance.
Kubernetes and Platform Engineering: Architect, deploy, and operate large-scale Kubernetes platforms. Build and maintain multi-cluster and multi-tenant architectures. Improve developer experience through platform tooling and self-service capabilities. Optimize workloads for cost, performance, and reliability. Lead cluster lifecycle management, upgrades, and security hardening.
AI-Driven Automation: Design and implement AI-powered automation across DevOps workflows, including incident triage, root cause analysis, and runbook automation. Build and integrate intelligent systems using LLM APIs and observability platforms. Identify automation opportunities across engineering teams and drive adoption. Evaluate and implement emerging AI/ML tools for operational use cases.
Software Engineering (Golang): Develop internal tools, operators, and automation systems using Golang. Build APIs, controllers, and integrations for infrastructure and platform services. Contribute to reusable frameworks for automation and orchestration.
Operations and Reliability Engineering: Own and improve production operations, including incident response and postmortems. Define and implement SLOs, SLIs, and error budgets. Enhance observability (metrics, logs, traces) using tools such as Prometheus and Grafana. Drive continuous improvement in operational processes and runbooks.
Leadership and Collaboration: Act as a technical leader and mentor within the DevOps team. Partner with architects, developers, and product teams on system design and reliability. Influence roadmap decisions for cloud platform evolution. Help upskill team members in Kubernetes, cloud, and AI-driven DevOps practices.Strong experience in DevOps, SRE, or Platform Engineering roles. Deep hands-on experience with Kubernetes in production at scale. Strong expertise in AWS (Azure or GCP experience also valued). Proficiency in Golang for building infrastructure and automation tools. Strong understanding of CI/CD systems and pipeline design. Strong understanding of distributed systems and microservices architectures. Proven experience in production operations and incident management. Experience building or maintaining observability platforms.10+ years experience in DevOps, SRE, or Platform Engineering roles. Experience with AI/ML or LLM-based automation in DevOps workflows. Familiarity with orchestration frameworks such as LangChain or LangGraph. Experience building Kubernetes operators or controllers. Background in platform engineering or internal developer platforms.
We’re a diverse collection of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways. The people who work here have reinvented entire industries with the Mac, iPhone, iPad, and Apple Watch, as well as with services, including iTunes, the App Store, Apple Music, and Apple Pay. And the same passion for innovation that goes into our products also applies to our practices — strengthening our commitment to leave the world better than we found it.
There’s a place here for every kind of brilliant. Everyone here is an innovator, or an innovator-to-be, no matter what your team or your role. So bring your passion, courage, and original thinking and get ready to share it, because every new product, service, or feature we invent is the result of people working together to make each others’ ideas stronger. Innovation at this level depends on people who represent the variety of the human experience and inspire us with their own fresh perspectives. Together, we’ll do amazing work that can make a difference in people’s lives. Including your own. Learn more about working at Apple.