Expert Site Reliability Engineer

Altera Digital Health Inc

MO(remote)

JOB DETAILS
SALARY
$95,000–$110,000 Per Year
SKILLS
ARM (Advanced RISC Machine), Analysis Skills, Automation, Budgeting, Cloud Computing, Clustering Software, Computer Science, Continuous Deployment/Delivery, Continuous Improvement, Continuous Integration, Customer Experience, DNS (Domain Name System), DevOps, Enterprise Applications, Firewalls, GitHub, Healthcare, Healthcare Providers, Hybrid Cloud, IT Service Management (ITSM), ITIL (IT Infrastructure Library), Identify Issues, Incident Management, Incident Response, Information Technology & Information Systems, Load Balancing, Microsoft .NET, Microsoft IIS Web Server (Internet Information Services), Microsoft Message Queue (MSMQ), Microsoft Product Family, Microsoft SQL Server, Microsoft Windows Azure, Microsoft Windows Operating System, Microsoft Windows Server, Microsoft Windows System Administration, On Call, Operational Improvement, Patient Care, Performance Tuning/Optimization, Problem Solving Skills, Production Systems, Python Programming/Scripting Language, Query Optimization, Reliability Engineering, Root Cause Analysis, Scripting (Scripting Languages), ServiceNow, Software Administration, Software Engineering, State Laws and Regulations, Systems Administration/Management, Systems Engineering, TCP/IP (Transmission Control Protocol/Internet Protocol), Technical Leadership, Windows PowerShell
LOCATION
MO
POSTED
2 days ago

Site Reliability Engineer (SRE) - Remote

Overview

As a Site Reliability Engineer (SRE) at Altera, you will be responsible for ensuring the reliability, scalability, and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability, automate operations, and improve the customer experience. You will act as a technical leader in monitoring, troubleshooting, incident response, and continuous improvement across our cloud and hybrid environments.

Key Responsibilities

  • Maintain and improve the reliability, availability, and performance of our production environments.
  • Lead the investigation and resolution of complex application, database, and infrastructure issues.
  • Participate in incident management, conduct root cause analysis (RCA), and contribute to post-incident reviews to prevent future occurrences.
  • Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments.
  • Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers.
  • Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency.
  • Partner with engineering and cloud teams to refine deployment, monitoring, and support processes.
  • Provide technical leadership during major incidents and act as a key escalation point for critical issues.

Qualifications

Experience:

  • 7+ years of experience supporting enterprise applications, infrastructure, or cloud environments.
  • Monitoring & Observability: Strong experience with APM tools such as LogicMonitor, AppDynamics, Azure Monitor, SentryOne, Dynatrace, Datadog, or New Relic.
  • Microsoft Stack: Deep knowledge of Windows Server administration, IIS, .NET applications, Windows Clustering, MSMQ, Event Logs, and PerfMon.
  • Database Skills: Strong SQL Server experience, including performance tuning, query optimization, blocking analysis, and Always On Availability Groups.
  • Cloud & Networking: Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS, TCP/IP, load balancing, firewalls).
  • ITSM & ITIL: Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles.

Preferred Skills:

  • Scripting with PowerShell, Python, or similar languages.
  • Infrastructure as Code (Terraform, ARM Templates, Bicep).
  • CI/CD pipelines and deployment automation (Azure DevOps, GitHub Actions).
  • Experience with Kubernetes and containerized workloads.
  • Experience implementing SLOs, SLIs, and Error Budgets.
  • Experience in a healthcare technology or patient care environment.

Education:

  • Bachelor's Degree in Computer Science, Information Technology, or Engineering is preferred; equivalent professional experience will be considered.

Working Arrangements

  • This is a remote position open to candidates within the United States.
  • You will participate in an on-call rotation to support our 24x7 healthcare environment.
  • Occasional after-hours work is required for activations, upgrades, and major incidents.

Travel

  • Travel is not a requirement for this role.

Our company complies with all local/state regulations in regard to displaying salary ranges. If required, the salary range(s) are displayed below and are specifically for those potential hires who will perform work in or reside in the location(s) listed, if selected for the role. Any offered salary is determined based on internal equity, internal salary ranges, market data, ranges, applicant's skills and prior relevant experience, certain degrees and certifications (e.g. JD, technology), for example.

Salary Range

$95,000-$110,000

Why Altera?

At Altera Digital Health, you will have the opportunity to profoundly impact the lives of patients by empowering healthcare providers to deliver superior care. You will join a passionate and gifted team committed to innovation and excellence. We offer a competitive compensation and benefits package and the opportunity to work in a fast-paced and dynamic environment.

About the Company

A

Altera Digital Health Inc