What You Will Do* Monitoring and Incident Response: Monitor for, and respond to, failure modes across diverse IT infrastructure and software platforms impacting the reliable delivery of data; Requires breadth of knowledge and critical thinking to fault isolate and resolve complex technical problems* Performance Monitoring & Capacity Planning: Monitor and optimize system performance with tools like PRTG and Tanium; establish baselines and perform capacity planning for growth and resilience* Infrastructure Lifecycle Management: Participate in lifecycle management responsibilities, including patching, inventory, configuration management and documentation* Relationship Development: Develop and maintain relationships with key customers and stakeholdersWho You Are (Basic Qualifications)* Experience supporting production applications and underlying IT infrastructure, including compute systems, virtualization platforms, and cloud environments* Experience with incident management, performing root cause analysis, and maintaining system stability in a fast-paced, production support environment* Working knowledge of operating systems (Windows and/or Linux) with the ability to troubleshoot application and system-level issues, along with a solid understanding of basic networking concepts* Hands-on experience with system monitoring and observability tools (e.g., Splunk, Grafana, or similar), including alerting, log analysis, and performance tuning to ensure application availability and reliability* Exposure to scripting or automation (e.g., PowerShell, Python, or Bash) to streamline support tasks, improve operational efficiency, and reduce manual intervention* Experience collaborating across technical teams (development, infrastructure, and operations) and effectively communicating with business stakeholders to resolve issues and support application needs* This role is not eligible for visa sponsorshipWhat Will Put You Ahead* Bachelor's degree in Information Technology, Computer Science, or a related field (or equivalent practical experience)* Familiarity with data pipelines or ETL processes, including monitoring, troubleshooting, and support of data workflows* Familiarity with Grafana, Splunk, DataDog or native platform logging aggregation and visualization tools (e.g AWS Cloudwatch)* Hands-on experience in process control or industrial environments, especially petrochemical or manufacturing sectors* Experience managing complex system, network, storage, and cloud infrastructure in 24x7x365 industrial or enterprise environmentsAt Koch companies, we are entrepreneurs. You will contribute to the monitoring, lifecycle management, configuration, and incident response of a diverse technology environment, including on-premises servers and virtualization platforms, cloud-based systems, and internally developed ETL applications, helping to ensure the reliability, performance, and availability of production data pipelines and underlying infrastructure through proactive issue identification, root cause analysis, and continuous improvement of system stability and scalability.