Senior Datadog Engineer (NO C2C)

inSync Staffing

Washington, DC

JOB DETAILS
SKILLS
ARM (Advanced RISC Machine), Amazon Relational Database Service (RDS), Amazon Web Services (AWS), Analysis Skills, Capacity Management, Cloud Computing, Computer Science, Continuous Deployment/Delivery, Continuous Integration, Customer Support/Service, Database Administration, Dental Insurance, DevOps, Employee Benefits, Engineering, Federal Government, Government, Hybrid Cloud, Identify Issues, Information Technology & Information Systems, Leadership, Microsoft .NET, Microsoft SQL Server, Microsoft Windows Azure, MySQL, Network Monitoring, Network Performance/Analysis, Node.js, Operational Audit, Oracle, Performance Analysis, Performance Engineering, Performance Management, Performance Metrics, Performance Tuning/Optimization, Platform as a Service (PaaS), PostgreSQL, Production Systems, Python Programming/Scripting Language, Query Analysis, Redis, Reliability Engineering, Reporting Dashboards, Resource Utilization, Root Cause Analysis, SQL (Structured Query Language), Securities and Exchange Commission (SEC), Service Level Agreement (SLA), ServiceNow, Software Engineering, Splunk, System Operations, Systems Reliability, Technical Leadership, Telemetry, Trend Analysis, United States Citizen, World Wide Web Consortium (W3C)
LOCATION
Washington, DC
POSTED
2 days ago



Citizenship Requirement: U.S. Citizen Only (No Dual Citizenship)

Location: SEC Headquarters Washington, DC 20549
Position Type: Onsite (Government Site No Telework)
Contract Duration: 6 Months, Extension Possible)

Clearance Requirement: Public Trust

Role Summary

We are seeking a Senior Datadog Cloud Engineer to provide technical leadership for enterprise observability, application performance monitoring (APM), distributed tracing, and cloud monitoring supporting the SEC Infrastructure Support Services (ISS) contract. This hands-on engineering role is responsible for designing, implementing, and optimizing an enterprise observability platform across hybrid cloud and containerized environments while improving system reliability, availability, and operational performance.

The ideal candidate will possess deep expertise in Datadog (or equivalent observability platforms), Azure and AWS cloud monitoring, Kubernetes/OpenShift, performance engineering, and enterprise-scale operational analytics.

Key Responsibilities

Observability Platform Engineering

  • Engineer, administer, and optimize enterprise observability platforms including:
    • Datadog (preferred)
    • Dynatrace
    • New Relic
    • Splunk Observability
    • Grafana/Prometheus
  • Design and maintain:
    • Dashboards
    • APM
    • Distributed tracing
    • Log pipelines
    • RUM (Real User Monitoring)
    • Synthetic monitoring
    • Network Performance Monitoring
  • Build and maintain:
    • SLOs
    • SLIs
    • Alerting policies
    • Monitoring standards
  • Instrument applications and infrastructure using:
    • OpenTelemetry
    • Language-specific tracers
    • W3C TraceContext propagation
    • Unified service tagging
  • Develop integrations between observability platforms and:
    • ServiceNow
    • CI/CD pipelines
    • On-call notification systems
  • Define and enforce enterprise telemetry tagging standards

Cloud & Container Monitoring

  • Design monitoring solutions for:
    • Microsoft Azure
    • Amazon Web Services (AWS)
  • Monitor:
    • PaaS services
    • Serverless workloads
    • Networking
    • Identity services
    • Managed databases
  • Engineer monitoring for:
    • AWS RDS
    • Aurora
    • Azure SQL
    • PostgreSQL
    • MySQL
    • SQL Server
    • Oracle
    • DynamoDB
    • Cosmos DB
    • ElastiCache/Redis
  • Design observability solutions for:
    • OpenShift
    • Kubernetes
    • Service Mesh
    • Cluster health
    • Workload performance
  • Develop reusable Infrastructure-as-Code monitoring modules

Performance Engineering

  • Lead performance investigations involving:
    • Latency
    • Reliability
    • Capacity
    • Resource utilization
  • Analyze:
    • Distributed traces
    • Dependency maps
    • Code profiling
    • Database query performance
    • Exception tracking
    • RUM correlations
  • Partner with engineering teams to implement performance improvements
  • Develop trace-based deployment tracking and change correlation processes
  • Provide technical leadership during production incidents and root cause analysis

Capacity & Reliability Engineering

  • Analyze operational telemetry and performance trends
  • Develop capacity planning dashboards and executive reports
  • Define:
    • Capacity thresholds
    • Alert baselines
    • Scaling recommendations
  • Improve:
    • Monitoring coverage
    • Alert quality
    • Operational maturity
    • SLA/KPI compliance

Required Technical Skills

  • Minimum 8 years of IT infrastructure or platform engineering experience
  • Minimum 5 years focused on:
    • Observability
    • Performance Engineering
    • Site Reliability Engineering (SRE)
  • Hands-on experience with enterprise observability platforms including:
    • Datadog (strongly preferred)
    • Dynatrace
    • New Relic
    • Splunk Observability
    • Grafana/Prometheus
  • Expertise with:
    • Application Performance Monitoring (APM)
    • Distributed tracing
    • OpenTelemetry
    • Continuous profiling
    • Service dependency mapping
  • Experience instrumenting applications using:
    • Java
    • .NET
    • Python
    • Node.js
    • Go
  • Experience with:
    • Microsoft Azure
    • Amazon Web Services (AWS)
    • Hybrid cloud environments
  • Experience monitoring:
    • Kubernetes
    • OpenShift
    • Container platforms
  • Experience with:
    • SQL performance tuning
    • Database monitoring
    • Query analytics
    • Execution plans
  • Experience integrating observability with:
    • ServiceNow
    • CI/CD pipelines
    • DevOps workflows
  • Experience with Infrastructure-as-Code using:
    • Terraform
    • ARM Templates
    • Bicep
  • Strong troubleshooting, analytical, and performance optimization skills

Preferred / Nice-to-Have Skills

  • Experience supporting federal government environments
  • Experience leading enterprise observability strategy
  • Experience with enterprise operational dashboards and executive reporting
  • Experience supporting 24x7x365 production environments
  • Advanced knowledge of cloud-native monitoring and telemetry governance

Qualifications & Experience

  • Bachelor's degree in:
    • Computer Science
    • Information Technology
    • Engineering
    • Related technical field
  • Minimum 8 years of relevant infrastructure/platform engineering experience
  • Minimum 5 years specializing in observability, SRE, or performance engineering
  • Ability to obtain and maintain a Public Trust clearance
  • U.S. Citizenship required (No Dual Citizenship)

About the Team / Company

This role supports the SEC Infrastructure Support Services (ISS) contract, providing senior technical leadership for enterprise observability across hybrid cloud, containerized, and on-premises environments. The engineer will collaborate with cloud, platform, and application teams to improve system reliability, operational visibility, and performance while supporting mission-critical SEC infrastructure.



Benefits (employee contribution):
  • Health insurance
  • Health savings account
  • Dental insurance
  • Vision insurance
  • Flexible spending accounts
  • Life insurance
  • Retirement plan

All qualified applicants will receive consideration for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. Rate of pay within the stated range will depend on the qualification of the applicant.

About the Company

i

inSync Staffing

We recognize the VMS program management team is our customer and needs to be serviced with integrity, so we built and continue to improve upon our delivery methods as we strive to provide the highest quality service possible. inSync Staffing’s management team recognized ten years ago the inevitable changes to the staffing industry being brought about by technology and the growing trend of Fortune 1000 corporations to outsource management of their contingent workforces to meet compliance and cost control goals. Rather than swim upstream against the changes, inSync Staffing has embraced MSP and VMS programs as our customers, not competitors. We asked program managers how they want to be serviced. The result of their input is that we have structured inSync Staffing as a recruiting and customer service organization, unlike traditional staffing companies who sell directly to the end client. Our delivery model allows us concentrates our resources on how to best supply candidates in a very competitive MSP/VMS program environment.
COMPANY SIZE
50 to 99 employees
INDUSTRY
Staffing/Employment Agencies
FOUNDED
2014
WEBSITE
http://www.insyncstaffing.com/default.html