Senior SRE Engineer

Blue Ribbon Global Technologies

Washington, DC

JOB DETAILS
SKILLS
Agile Programming Methodologies, Amazon Web Services (AWS), Analysis Skills, Artificial Intelligence (AI), Automation, Background Investigation, Budgeting, Building Codes, Cloud Computing, Computer Science, Continuous Deployment/Delivery, Continuous Integration, Cross-Functional, DevOps, Documentation, Enterprise Architecture, GitHub, IT Service Management (ITSM), ITIL (IT Infrastructure Library), Instrumentation, Jenkins, Linux Operating System, Machine Tool, Mentoring, Microservices, Microsoft Product Family, Microsoft Windows Azure, On Call, Performance Metrics, Production Support, Python Programming/Scripting Language, Reporting Dashboards, Root Cause Analysis, Scripting (Scripting Languages), ServiceNow, Software as a Service (SaaS), Technical Leadership, Technical/Engineering Design, Topology, User Interface/Experience (UI/UX)
LOCATION
Washington, DC
POSTED
2 days ago

We are seeking a high-caliber Senior SRE Engineer to join a premier client in Washington, DC, to spearhead the evolution of their enterprise observability platform. This is a high-impact role designed for a technical leader with nearly a decade of specialization in Dynatrace SaaS, tasked with architecting and automating large-scale monitoring solutions across complex AWS and Azure environments. You will bridge the gap between infrastructure and applications, leveraging Davis AI and Grail to drive proactive reliability, mentoring cross-functional DevOps teams, and establishing a gold standard for full-stack visibility in a mission-critical, multi-cloud landscape.

Core Responsibilities
  • Enterprise Architecture: Lead the design, governance, and rollout of Dynatrace observability for distributed microservices, serverless workloads, and multi-region cloud environments.
  • Full-Stack Optimization: Configure deep code-level visibility (PurePath), Smartscape topology mapping, and advanced APM instrumentation to ensure comprehensive system transparency.
  • AI-Driven Insights: Harness Davis AI for causal analysis and root cause identification; develop custom dashboards, alerting profiles, and auto-remediation workflows to minimize MTTR.
  • End-User Experience: Implement Real User Monitoring (RUM) and Synthetic Monitoring to analyze user journeys and establish performance KPIs.
  • Automation & DevOps: Drive "Observability as Code" by building CI/CD pipelines (GitHub Actions, Jenkins) and automating infrastructure via Terraform, CloudFormation, or AWS CDK.
  • Log Management: Manage high-volume log ingest pipelines and processing rules using Dynatrace Grail and Log Management features.
  • SRE Advocacy: Define and monitor SLIs, SLOs, and error budgets while participating in on-call rotations and developing detailed RCA documentation.
Qualifications
  • Extensive Expertise: 9+ years of hands-on experience specifically focused on Dynatrace implementation and management at an enterprise scale.
  • Foundational Experience: 5+ years in SRE, DevOps, or Cloud Infrastructure roles, with deep knowledge of Linux systems and networking.
  • Cloud Proficiency: Advanced experience navigating and securing AWS and Azure environments.
  • Automation Skills: Strong proficiency in Python or similar scripting languages for building self-service tooling and automation.
  • Tooling Integration: Proven ability to integrate observability stacks with ITSM and communication tools like ServiceNow, PagerDuty, and Microsoft Teams.
  • Methodology: Experience working within a SAFe Agile delivery environment and a solid understanding of the ITIL framework.
  • Education: Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Location/Flexibility: Ability to work on-site in the Washington, DC area as required and provide off-hours support for critical production incidents.

Required Skills :

Basic Qualification :

Additional Skills :

Background Check : No

Drug Screen : No

About the Company

B

Blue Ribbon Global Technologies