Site Reliability Engineer - CTJ - Poly

Microsoft Corp

Reston, VA

JOB DETAILS
SALARY
$119,800–$234,700 Per Year
SKILLS
ARM (Advanced RISC Machine), Amazon Web Services (AWS), Apache Hadoop, Apache Spark, Artificial Intelligence (AI), Automation, Bash Scripting, Best Practices, Cloud Computing, Computer Science, Continuous Improvement, Data Management, Data Recovery, Data Sets, Data Storage, Disaster Recovery, Docker, Enterprise Applications, Federal Government, Government, Government Requirements, Hybrid Cloud, Identify Issues, Incident Management, Information Technology & Information Systems, Integrated Circuits (ICs), Java, Legal, Local Government, Machine Tool, Microsoft C# (C Sharp), Microsoft Exchange Server, Microsoft Product Family, Microsoft SharePoint, Microsoft Windows Azure, Network Architecture/Engineering, On Call, Power Amplifier, Productivity Management, Programming Languages, Project/Program Management, Protective Services, Python Programming/Scripting Language, Regulatory Requirements, Reliability Engineering, Requirements Management, Scripting (Scripting Languages), Security Clearance, Sensitive Compartmented Information (SCI), Single Scope Background Investigation (SSBI), Skype, Software Engineering, Strategic Planning, Systems Administration/Management, Team Lead/Manager, Technical Writing, Telemetry, Top Secret Clearance, Transformation Tools, United States Citizen, Windows PowerShell
LOCATION
Reston, VA
POSTED
14 days ago

Overview

We are seeking a Senior Site Reliability Engineer to lead a team that builds and operates Microsoft CISO security engineering services in highly regulated environments, including U.S. Government Cloud deployments. In this space, success requires both operational rigor and strong software engineering fundamentals, maintainable code, extendable design, robust telemetry, and disciplined lifecycle practices that make reliability a built-in feature.

This role is rooted in software engineering as a reliability lever. You will work with teams that deliver production code, automation, and self-healing capabilities, and partner with feature engineering teams to bake in reliability, diagnosability, security, and compliance from design through operations. You will help operate and evolve large-scale enterprise applications, and multi-petabyte data platforms where availability, resilience, and uptime are mission critical. You will amplify impact by developing engineers, setting up reliability strategies, and influencing how services are built and run across organizational boundaries.

Responsibilities

Responsibilities:

  • Write secure, high-quality code that is maintainable, scalable, and performant.

  • Architect, implement, and optimize hybrid and cloud infrastructure using Infrastructure as Code (e.g., Containers, Bicep, Terraform, AKS etc.) to improve availability, scale, security, and operational efficiency.

  • Design and implement data governance, storage, backup, and disaster recovery for a multi-petabyte Azure environment, ensuring integrity, security, and performance.

  • Build and operate large-scale data pipelines and data transformations to support analytics, governance, and operational needs.

  • Evaluate emerging engineering tools and practices and incorporate them into the roadmap to continuously improve efficiency, reliability, and scale.

  • Deliver automation to improve service health, manageability, reliability, telemetry, and alerting, with a focus on resiliency.

  • Create and maintain clear technical documentation and design specifications aligned with best practices.

  • Partner with engineering, project management, and operations to evolve services and optimize infrastructure in support of organizational goals.

  • Participate in an on-call rotation to operate live services; troubleshoot and mitigate complex issues, escalate as needed, and write post-incident reviews to share learnings.

  • Identify opportunities for automation using scripts, pipelines, policy‑driven guardrails, or AI‑enabled tooling to reduce manual toil and increase engineering productivity.

Qualifications

Required/minimum qualifications:

Masters Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Bachelors Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.

Other requirements:

Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph. Ability to meet Microsoft, customer and/or government security screening requirements are required pre-offer and post-hire for this role. Failure to maintain or obtain the appropriate U.S. Government clearance and/or customer screening requirements may result in employment action up to and including termination.
  • Clearance Verification: This position requires successful verification of the stated security clearance to meet federal government customer requirements. You will be asked to provide clearance verification information prior to an offer of employment.
  • Citizenship & Citizenship Verification: This position requires verification of U.S. citizenship due to citizenship-based legal restrictions. Specifically, this position supports United States federal, state, and/or local United States government agency customer and is subject to certain citizenship-based restrictions where required or permitted by applicable law. To meet this legal requirement, citizenship will be verified via a valid passport, or other approved documents, or verified US government Clearance.
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Additional or preferred qualifications:

Doctorate Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Masters Degree in Computer Science, Information Technology, or related field AND 6+ years technical experience in software engineering, network engineering, or systems administration OR Bachelors Degree in Computer Science, Information Technology, or related field AND 8+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.

  • 4+ years of experience building, deploying, and operating containerized applications and infrastructure as code (e.g., Docker, Kubernetes, Azure Container Apps/AKS/ACI, Terraform, Azure Bicep, ARM templates).

  • 4+ years of experience writing and maintaining scripts for deployment, orchestration, and automation (e.g., PowerShell, Python, Bash).

  • Experience working with large datasets, data pipelines, and data transformation patterns (batch and/or streaming).

  • Experience with one or more major cloud platforms (Azure, AWS, or Google Cloud).

  • Hands-on experience with Azure services and infrastructure (e.g., ARM templates, IaaS, VMs, Key Vault, Event Hubs, Synapse, Spark/Hadoop), or equivalent services in AWS or Google Cloud.

  • Familiarity with data pipeline and transformation tooling (e.g., Spark, Hadoop) and operating at scale.

  • Familiarity with large-scale Microsoft enterprise services (e.g., Microsoft 365: Exchange, SharePoint, Skype, Teams).

  • Familiarity with petabyte-scale datasets and building reliable data pipelines and transformations that support mission-critical services.

  • Proficiency in at least one programming language (e.g., C# or Java) and scripting languages such as PowerShell, Bash, and Python.

Site Reliability Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $160,200 - $261,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

About the Company

M

Microsoft Corp

DO WHAT YOU LOVE
Make your mark on the world’s most used technologies. Develop the next hit mobile application. Pioneer a startup that could be the next big thing. At Microsoft, you choose your path.

Headquartered in Redmond, Washington, Microsoft is a top innovator in both the consumer and enterprise technology industry. Just a few of the many things our products do are unleash creativity, connect businesses, and make learning more fun. But our continued success is based on one thing: our employees. We hire amazing, talented people and give them the opportunities—and the tools—to succeed.

WHY MICROSOFT?
As a Microsoft employee, you’re surrounded by a diverse group of the smartest people in your field. This fosters new ideas, better business results, and creates a dynamic work environment. In the office, you’re constantly challenged and supported by your colleagues. Every day holds something new and exciting.

We also offer unparalleled depth and breadth of career opportunities. As an industry leader in multiple fields, working for Microsoft means being able to do whatever you feel passionate about—and being able to make an impact in that field. From day one, we give our employees significant responsibility. This means that you’ll know that you directly contributed to something that has a positive impact on people worldwide. Whether you choose to work in management, dive deep into the newest technology, or explore multiple professions, you’ll find everything you need at Microsoft to drive your career—and to make a difference.

WE GET IT – YOU’RE MORE THAN YOUR JOB
Everyone works differently and is motivated by different things. We also understand that there’s more to you than your job. That’s why we offer competitive pay and a wide assortment of benefits-- to help you make the most of life at work and away from it.

GET THE BALL ROLLING
COMPANY SIZE
10,000 employees or more
INDUSTRY
Computer Software
FOUNDED
1975
WEBSITE
http://www.microsoft.com