Production Engineer

TPI Global (formerly Tech Providers, Inc.)

Plano, TX

JOB DETAILS
SKILLS
ABAP Programming Language, AWS Lambda, Access Authorization, Access Control, Accounting, Amazon Web Services (AWS), Apache, Application Programming Interface (API), Architectural Services, Automation, Autoscaling, Best Practices, Capacity Management, Change Control, Change Order Management, Cloud Computing, Continuous Deployment/Delivery, Continuous Integration, Cost Control, Cross-Functional, Cryptography, Data Management, Data Processing, DevOps, Disaster Recovery, Documentation, Failover, Finance, Functional Testing, Git, Help Desk, High Availability, Identify Issues, Incident Management, Incident Response, Java, JavaScript Frameworks, Jenkins, Microservices, Microsoft Windows Azure, OAuth, Operations Processes, Oracle, Oracle Database, Oracle Enterprise Manager, Oracle PL-SQL, Problem Solving Skills, Production Support, Production Systems, Python Programming/Scripting Language, Quality Assurance, REST (Representational State Transfer), Radiography, Reliability Engineering, Resource Utilization, Right-Sizing, Root Cause Analysis, SAP, SAP Administration, SAP FICO, SQL (Structured Query Language), Scripting (Scripting Languages), Service Level Agreement (SLA), Simple Queue Service (SQS), Snowflake Schema, Software Engineering, Software Patches, Stored Procedures, Support Documentation, System Architecture, System Validation, Systems Analysis, Systems Maintenance, Systems Reliability, Systems Scalability, Technical/Engineering Design, Test Automation, Testing, Warehousing
LOCATION
Plano, TX
POSTED
1 day ago

Description:
Production Engineer JL15
6 months C-H
Plano, TX
Onsite


Responsibilities:
  • 3-4 years of experience in production engineering and site reliability engineering (SRE) to design, implement, and maintain highly available, scalable, and resilient systems.
  • Own end-to-end operational responsibilities include monitoring, incident response, root cause analysis, capacity planning, and automation to ensure optimal system performance and reliability in production environments.
  • Collaborate cross-functionally with development, QA, and infrastructure teams to streamline CI/CD pipelines, automate deployments, and enforce best practices for security, compliance, and disaster recovery.
  • Utilize a broad set of tools and technologies to proactively detect, troubleshoot, and resolve production issues, minimizing downtime and improving service-level objectives (SLOs) and service-level agreements (SLAs).

Requirements:
Snowflake Developer, Oracle, Python
  • Develop, maintain, and optimize data pipelines and workflows using Snowflake and Oracle databases to ensure reliable data availability in production.
  • Write and optimize advanced SQL and PL/SQL queries and stored procedures for efficient data processing and transformation.
  • Automate data ingestion, validation, and monitoring tasks using Python scripting and orchestration tools like Apache Airflow or Prefect.
  • Monitor database health, query performance, and resource utilization using Snowflake Resource Monitors, Oracle Enterprise Manager, and cloud monitoring tools.
  • Troubleshoot and resolve production incidents related to data inconsistencies, pipeline failures, or performance degradation.
  • Implement security best practices including role-based access control, data masking, and encryption in Snowflake and Oracle environments.
  • Collaborate with DevOps teams to integrate database changes into CI/CD pipelines using Git, Jenkins, or Azure DevOps.
  • Perform root cause analysis for recurring issues and implement automation to reduce manual intervention.
  • Manage cloud resource costs by tuning Snowflake warehouse sizes and Oracle instance configurations.
  • Document operational procedures, runbooks, and system architecture for knowledge sharing and compliance.
  • Requirements:
Java, JavaScript, Cloud-based Microservices, Spring Boot, AWS
  • Build, deploy, and maintain cloud-native microservices using Java, Spring Boot, and JavaScript frameworks, ensuring high availability and scalability.
  • Design and implement RESTful APIs and event-driven architectures using AWS services such as Lambda, ECS/EKS, SQS, and SNS.
  • Develop and maintain CI/CD pipelines with Jenkins, GitLab CI, or AWS CodePipeline for automated testing and deployment.
  • Monitor application and infrastructure health using AWS CloudWatch, Prometheus, Grafana, and distributed tracing tools like Jaeger or AWS X-Ray.
  • Troubleshoot production issues, perform root cause analysis, and implement fixes to improve system reliability.
  • Implement security controls including IAM roles, OAuth2, JWT, and encryption for data in transit and at rest.
  • Collaborate with cross-functional teams to design fault-tolerant, resilient systems with automated failover and recovery.
  • Optimize cloud resource usage and cost through rightsizing and autoscaling configurations.
  • Automate operational tasks and incident response using scripting and infrastructure as code (Terraform, CloudFormation).
  • Maintain detailed documentation of system architecture, deployment processes, and operational runbooks.

SAP Finance and Accounting Techno-Functional
  • Provide production support and incident management for SAP FI/CO modules, ensuring minimal downtime and business continuity.
  • Analyze and troubleshoot system issues related to configuration, custom code (ABAP), and interfaces with external systems.
  • Use SAP Solution Manager for incident management, change requests, transport management, and deployments across development, QA, and production landscapes, ensuring smooth coordination and control of SAP system changes.
  • Monitor batch jobs, system performance, and error logs using SAP CCMS and ST22 transaction codes.
  • Automate routine operational tasks and workflows using SAP Business Workflow and background job scheduling.
  • Collaborate with functional teams to validate system changes and support end-user issue resolution.
  • Participate in SAP upgrades, patches, and integration projects, ensuring smooth transitions and minimal impact.
  • Implement and maintain security roles, authorizations, and compliance controls within SAP.
  • Document support procedures, configuration changes, and troubleshooting guides.
  • Use monitoring and alerting tools to proactively detect and resolve production issues.

About the Company

T

TPI Global (formerly Tech Providers, Inc.)