Production Support Analyst/SRE Reliability engineer

Veterans Sourcing Group

South Jordan, UT

JOB DETAILS
SALARY
SKILLS
Apache Kafka, Artificial Intelligence (AI), Automation, C++ Programming Language, Cloud Computing, Communication Skills, Consulting, Continuous Deployment/Delivery, Continuous Integration, Debugging Skills, Distributed Computing, Docker, Go Programming Language (Golang), ITIL (IT Infrastructure Library), Identify Issues, Incident Management, Java, Linux Operating System, On Call, Operating Systems, Operational Support, Problem Solving Skills, Production Control, Production Support, Production Systems, Python Programming/Scripting Language, Reliability Analysis, Reliability Engineering, Root Cause Analysis, SQL (Structured Query Language), Scala Programming Language, Scripting (Scripting Languages), ServiceNow, Splunk, Standard Operating Procedures (SOP), Systems Reliability, Unix Operating Systems
LOCATION
South Jordan, UT
POSTED
30+ days ago

Production Support Analyst / SRE Reliability engineer
Location: South Jordan, UT (Hybrid)
12 months Contract-to-Hire | 
Pay rate: $53/hr W2

Interview Process:

  • 1st Round: Zoom
  • 2nd Round: Onsite

Experience & Education:

  • 2–5 years of relevant experience
  • Bachelor's Degree required

Shifts:

  • Morning: 8:00 AM – 5:00 PM
  • Evening: 12:30 PM – 8:00 AM
  • Weekend: On-call (Remote)

Key Responsibilities:

  • Monitor and support production systems across OS, applications, and network
  • Troubleshoot incidents, perform root cause analysis, and resolve live issues
  • Collaborate with Dev teams to reduce recurring issues and improve system reliability
  • Automate repetitive tasks using Python/scripting
  • Maintain SOPs and support operational readiness activities
  • Participate in on-call rotation and critical event support

Must-Have Skills:

  • Strong hands-on experience with  Linux/Unix (OS-level troubleshooting)
  • Production support experience (incident management, debugging live systems)
  • Python scripting (automation-focused, not development-heavy)
  • SQL knowledge
  • Experience with  ServiceNow (ticketing)
  • Understanding of  ITIL principles
  • Excellent communication skills

Nice to Have:

  • Exposure to Java, Go, C++, Scala
  • Monitoring tools (Grafana, Splunk, Dynatrace, etc.)
  • Cloud experience
  • Snowflake knowledge
  • CI/CD, Kafka, Docker, or distributed systems exposure
  • Awareness of SRE concepts or Agentic AI

Role Overview:

This role supports  Production Support / SRE (Reliability Engineering) functions—focused on system stability, incident resolution, automation, and improving platform reliability in a large-scale Linux environment.

About the Company

V

Veterans Sourcing Group