Job Role – DevOps / Observability Engineer
Location – Remote
Salary - $110k/yr + 5% Org bonus + 5% performance bonus
Experience range – Between 8 -12 years
Must have skills : DevOps - AWS (Strong), New Relic, PagerDuty
Job Description :
DevOps / Observability Engineer (New Relic & PagerDuty) We are seeking a highly skilled DevOps / Observability Engineer with strong expertise in New Relic telemetry, monitoring, and PagerDuty alert management to support and optimize the observability ecosystem for the TRAIT (Technician Reporting and Information Tool) application.
The ideal candidate will have hands-on experience in DevOps operations, application monitoring, incident management, and cloud/platform observability.
The role will focus on improving telemetry visibility, optimizing dashboards, reducing alert noise, and enhancing operational reliability.
Key Responsibilities Analyze and optimize existing New Relic dashboards, telemetry, and monitoring setup for the TRAIT application.
Review and refine PagerDuty alert triggers, escalation policies, and incident workflows to ensure only actionable events generate alerts.
Identify obsolete dashboards, alerts, and monitoring components and optimize them based on current operational requirements.
Support DevOps operational activities including monitoring production environments, incident response, root cause analysis, and reliability improvements.
Collaborate with development, infrastructure, and support teams to improve application observability and operational health.
Assist in automation and monitoring integration within CI/CD and cloud environments.
Recommend and implement observability best practices for logging, metrics, tracing, and alerting.
Must-Have Skills
Strong hands-on experience with New Relic including APM, telemetry, dashboards, alerting, and observability optimization.
Experience in configuring and managing PagerDuty alerts, on-call workflows, escalation policies, and incident management processes.
Good experience in DevOps/SRE operations including production monitoring, troubleshooting, and operational support.
Experience with cloud and DevOps tools such as Amazon Web Services / Microsoft Azure, CI/CD pipelines, Linux, scripting, or infrastructure monitoring.