site reliability engineering or related field)Proficiency in Terraform and programming languages such as Python, Go, or JavaDeep expertise in cloud platforms, particularly AWS, and container orchestrationStrong background in distributed systems, performance tuning, and automationHands-on experience with configuration management tools such as Puppet, Chef, or SaltPreferred:Bachelor''s Degree in Computer Science, Information Technology, Engineering, or associated disciplineExperience working with advanced ETL data workflows including technologies such as AWS EMR, Azure Synapse, Azure Data Factory, or Apache Hive/Spark/AirflowExperience with IaC deployment of AKS/EKS/GKE architectureExperience with enterprise Data Lake environments using technologies such as DataBricks or SnowflakeCompetencies:Expert analytical/quantitative, problem-solving, and deductive reasoning skills, experience performing advanced troubleshooting and root cause analysis of complex technical issuesExcellent organizational, planning, and time management skills and ability to work independently and in a team environment to manage competing priorities and meet deadlinesAdvanced verbal and written communication skills with the ability to present findings, conclusions, alternatives, and information clearly and conciselyExperience working with all levels of business professionals, management, stakeholders, and vendors with the ability to build effective relationships through trust and diplomacyCooley offers a competitive compensation and excellent benefits package and is committed to fair and equitable employment practices. Specific duties and responsibilities include, but are not limited to, the following:Position responsibilities:Monitor and maintain production systems to ensure high availability and performanceImplement and manage service-level indicators (SLIs), objectives (SLO's), agreements (SLA's), and error budgetsParticipate in on-call rotations and incident response, including root cause analysis and postmortemsDevelop and maintain infrastructure as code (IaC) using TerraformAutomate deployment, scaling, and recovery processes to reduce manual interventionPartner with DevOps to build and maintain CI/CD pipelines to support safe and efficient software deliveryImplement observability solutions using metrics, logs, traces, and alerting systems (Prometheus, Grafana, DataDog, etc.)Proactively identify and resolve system bottlenecks and reliability risksWork closely with Infrastructure, DevOps, Development, and security teams to embed reliability into the development lifecycleContribute to a culture of blameless post-mortems and continuous improvementDocument operational procedures and share knowledge across teamsAll other duties as assigned or requiredSkills and experience:Required:After orientation at Cooley LLP, exhibit proficiency in the Microsoft Office suite, iManage and other firm applicationsAbility to work extended and/or weekend hours, as requiredAbility to travel, as required6+ years direct applicable experience (e.g.