Data Engineer with DevOps

System One

Dallas, TX

JOB DETAILS
SALARY
$120,000–$120,000 Per Hour
SKILLS
Agile Programming Methodologies, Amazon Simple Storage Service (S3), Apache, Apache Hadoop, Apache Hive, Apache Spark, Apiary/Beekeeping, Bash Scripting, Big Data, Communication Skills, Continuous Deployment/Delivery, Continuous Integration, Cross-Functional, Data Analysis, Data Management, Data Modeling, Data Processing, Data Quality, Data Sets, DevOps, Docker, Documentation, Elasticsearch, Error Handling, GitHub, Gradle, HDFS (Hadoop Distributed File System), Jenkins, Linux Operating System, Maven, Metadata, Microsoft SQL Server, Microsoft Windows Azure, Model Review, Model Validation, Oracle, Oracle Database, Performance Tuning/Optimization, Process Modeling, Python Programming/Scripting Language, Risk, SQL (Structured Query Language), SQLite, Sales Pipeline, Scalable System Development, Scripting (Scripting Languages), Structured Data, Team Player, Unix Shell Programming, Validation Testing, Windows PowerShell
LOCATION
Dallas, TX
POSTED
2 days ago

Job Title: Senior Data Engineer with DevOps

Duration : Full Time
Location : Pittsburgh, PA, Cleveland, OH, or Dallas, TX

Work Mode : 5 Days Onsite
Years Of Exp : 8+ Yrs

Job Description:
We are seeking a Data Engineer with 5 years of experience to design and maintain scalable data pipeline supporting analytics, reporting, and operational needs. The role involves collaborating with cross functional teams to ensure data alignment with business requirements and enterprise standards.

Duties and Responsibilities:

  • Design and build scalable data pipelines aligned with business needs
  • Process large dataset (batch + sometimes near Realtime)
  • Ensure data quality, consistency, and governance standards across systems
  • Support data integration and transformation efforts for analytics and reporting platforms
  • Maintain data dictionaries, metadata, and documentation
  • Participate in data architecture reviews and model validation processes
  • Support analytics reporting and risk platforms.

Required Qualifications
  • 5+ years of experience in data engineering and big data processing
  • Strong expertise in Apache Spark (Spark Core, Spark SQL) and PySpark for large scale batch processing
  • Experience working with structured and semi structured data, including complex transformations and performance tuning
  • Proficiency in data ingestion and integration from sources like Oracle, SQL Server, Hive, HDFS, and S3; transform data into ‘curated data models'
  • Experience writing data to Hive tables, Data Lakes (Iceberg), and downstream reporting systems
  • Strong knowledge of SQL and data modeling concepts
  • Hands on experience with Apache Airflow for workflow orchestration (DAG design, scheduling expectations, monitoring)
  • Proficiency in shell scripting for job automation, file validation, dependency handling, and logging. Trigger Spark Jobs, perform file checks and validation; Archive & purge data; mange job dependency, logging & error handling
  • Strong understanding of batch processing and batch job scheduling frameworks
  • Experience migrating from CA7/Control M Airflow (daily, hourly, weekly schedules) CI/CD for data pipelines
  • Fundamentals in Linux and Networking
  • Docker, OCP containerization / Kubernetes
  • Knowledge of CI/CD pipeline tools: Tools commonly include Jenkins, GitHub Actions, Azure DevOps, GitLab Cl, Maven, and Gradle
  • Automate operational tasks using Python, Bash/Shell, and PowerShell
  • Implement monitoring and alerting, Application Insights. Enable centralized logging with tools such as ELK.
  • Experience ensuring data quality, reliability, and compliance in regulated environments
  • Good communication and documentation skillsGood communication and collaboration skills in cross-functional teams
  • Agile/Safe methodologies
Must Have Skills
  • Airflow
  • Containerization
  • DevOps
  • Elastic Stack & Elasticsearch
  • GitHub
  • Hadoop Hive
  • Jenkins
  • JSONWebToken(JWT)
  • Kubernetes
  • OpenShift
  • Oracle
  • Python
  • Shell Script
  • SQLite


Ref: #404-IT Pittsburgh


About the Company

S

System One

Every day, System One focuses on services and solutions that require a high degree of specialization, in-demand technical skills, and large-scale operational expertise. We are essential partners to those on the front lines of our nation’s most critical infrastructure, technology, and life sciences initiatives. 

Founded more than 40 years ago as a staffing partner to the engineering industry, today System One is a diversified organization operating in over 50 locations and putting more than 9,000 people to work in the United States, Canada, and the United Kingdom.

COMPANY SIZE
2,500 to 4,999 employees
INDUSTRY
Staffing/Employment Agencies
WEBSITE
https://systemone.com