Solutions Architect SRE (DYNATRACE)

Soft source inc

Troy, DC

JOB DETAILS
SKILLS
Access Control, Agile Programming Methodologies, Amazon Web Services (AWS), Analysis Skills, Artificial Intelligence (AI), Automation, Best Practices, Cloud Applications, Cloud Computing, Communication Skills, Computer Science, Computer Security, Continuous Deployment/Delivery, Continuous Integration, Customer/Client Research, DevOps, Digital Certificates, Documentation, GitHub, HTTP (HyperText Transport Protocol), IT Service Management (ITSM), ITIL (IT Infrastructure Library), Identify Issues, Incident Management, Incident Response, Instrumentation, Internet Application, Jenkins, Linux Operating System, Metadata, Metrics, Microservices, Microsoft Windows Azure, Mobile Applications, On Call, Performance Analysis, Performance Metrics, Presentation/Verbal Skills, Python Programming/Scripting Language, Reporting Dashboards, Root Cause Analysis, Sales Pipeline, Scripting (Scripting Languages), ServiceNow, Software as a Service (SaaS), Topology, User Interface/Experience (UI/UX), Web Browsers, Web Client Plug-ins, Writing Skills
LOCATION
Troy, DC
POSTED
1 day ago
We are seeking a highly experienced Senior Observability Engineer with deep, hands?on expertise in Dynatrace SaaS to lead enterprise?scale observability deployments across AWS and Azure. This role will drive the design, automation, and rollout of Dynatrace capabilities for large, complex environments while partnering closely with DevOps, SRE, Cloud, and Application teams. The ideal candidate has 9+ years of Dynatrace implementation experience, strong DevOps and automation skills, and the ability to mentor engineering teams.

Dynatrace Expertise
  • Lead large enterprise-scale deployments of Dynatrace observability across distributed microservices, serverless workloads, and multi?region multi-cloud environments.
  • Maintain Dynatrace governance and best practices, support multi-tenants, fine grained access controls, and logical segmentation of teams, apps, and environments.
  • Configure and optimize APM instrumentation, Deep code?level visibility, PurePath distributed tracing, Smartscape topology mapping, and other advanced Dynatrace features to ensure full?stack observability.
  • Build and maintain custom dashboards, management zones, tagging rules and entity metadata strategies.
  • Develop and tune alerting profiles, anomaly detection rules, Davis AI configurations, and auto-remediation workflows.
  • Leverage Davis AI to automatically identify Root Cause using causal analysis, correlate metrics, logs, traces, and events to reduce noise and eliminate false positives.
  • Build HTTP, and Browser Synthetic Monitoring and performance baselines.
  • Configure Real User Monitoring (RUM) for web and mobile applications, including User journey analysis, User experience insights, and performance KPIs.
  • Implement and manage log ingest pipelines, log processing rules, retention policies, and Dynatrace Grail/Log Management features
  • Integrate with GitHub Actions, Jenkins, ServiceNow, PagerDuty, and Teams
  • Build OTel integrations and custom plugins.
DevOps Automation
  • Implement CI/CD pipelines using tools such as GitHub Actions, AWS CodePipeline, and Jenkins.
  • Automate infrastructure provisioning through Infrastructure-as-Code (IaC) using Terraform, CloudFormation, or AWS CDK.
  • Develop self-service automation tools using Python or other scripting languages.
Incident Management & Response
  • Proficient in ITIL framework and ITSM tools such as ServiceNow.
  • Production on-call responder with strong troubleshooting capabilities.
  • Develop RCA documentation, and Knowledge articles
  • Apply SRE principles, including SLIs, SLOs, and error budgets.
Security & Compliance Implementation
  • Manage service accounts and access permissions
  • Create, deploy, and manage digital certificates.
  • Respond to security incidents and execute remediation tasks effectively.
Education & Additional Experience
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 9+ years of Dynatrace implementation experience,
  • 5+ years of experience in DevOps, SRE, or infrastructure roles
  • Knowledge of Linux systems and networking.
  • Working in a SAFe Agile delivery environment.
  • Excellent written and verbal communication skills.
  • Demonstrated ability to work independently and manage priorities.
  • Availability to work outside of standard business hours as required.

About the Company

S

Soft source inc