Drive, support, and deliver on a strategy to operate on a build broad use of Amazons utility computing web services (e.g., AWS EC2, AWS S3, AWS RDS, AWS CloudFront, AWS EFS, CloudWatch, EKS) • Identify opportunities to improve resiliency, availability, security, high-performance platforms in Public Cloud using JPMC best practices • Improve reliability, quality, and reduce time to resolve issues in production incidents on software applications • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve • Provide primary operational support and engineering for the public cloud platform • Debug and optimize systems and automate routine tasks • Collaborate with a cross-functional team to identify potential risks in production and opportunities to improve user experiences at every interaction • Drive work streams to ensure Applications meet strict non-functional requirements for Public Cloud On-boarding • Evaluate production readiness through game days, resiliency tests, and chaos engineering exercises • Utilize programming languages like Java, Python, SQL, Node, Go, and Scala, Open Source RDBMS and NoSQL databases, Container Orchestration services including Docker and Kubernetes, and a variety of AWS tools and services • Monitor metrics and program health, anticipate and clear blockers, manage escalations. Experience across the SDLC process - Design and/or Development and/or support Experience/knowledge using monitoring solutions like CloudWatch, Prometheus, Datadog Experience/knowledge of writing Infrastructure-as-Code (IaC), using tools like CloudFormation or Terraform Experience with one or more public cloud platforms like AWS, GCP, Azure Experience with automation of Infrastructure tasks using Python or other languages Knowledge in leveraging AI/LLM tools to enable self-service on diagnosing infrastructure errors/failures.