Data Engineer (Data Pipelines & Modeling)

Katalyst Healthcares & Life Sciences

Warrendale, PA

JOB DETAILS
SKILLS
Amazon Web Services (AWS), Apache Spark, Application Programming Interface (API), Big Data, Cloud Computing, Computer Science, Continuous Deployment/Delivery, Continuous Integration, Cross-Functional, Data Management, Data Migration, Data Modeling, Data Quality, Data Warehousing, Database Extract Transform and Load (ETL), Enterprise Data Integration, Forecasting, GCP (Good Clinical Practices), GitHub, Healthcare, Jenkins, Microsoft Windows Azure, Python Programming/Scripting Language, SQL (Structured Query Language), Workforce Planning
LOCATION
Warrendale, PA
POSTED
30+ days ago
Responsibilities:
  • Design and implement robust data ingestion pipelines from multiple sources (APIs, databases, files, streaming systems).
  • Support C4C offline database migration, ensuring data accuracy and consistency.
  • Integrate data from enterprise systems into centralized data platforms.
  • Design and implement data models for Workforce planning.
  • Service operations forecasting.
  • Develop optimized schemas for reporting and analytics.
  • Ensure data quality, integrity, and consistency across models.
Requirements:
  • Strong experience in data engineering and pipeline development.
  • Proficiency in Python / SQL.
  • Hands-on experience with Apache Spark or similar big data tools.
  • Strong understanding of ETL/ELT concepts and data warehousing.
  • Ability to work independently and in cross-functional teams.
  • Bachelor's / Master's in Computer Science, IT, or related field.
Good to Have:
  • Exposure to CI/CD tools like Jenkins or GitHub Actions.
  • Knowledge of cloud platforms (AWS / Azure / GCP).
  • Experience in healthcare or regulated environments.

About the Company

K

Katalyst Healthcares & Life Sciences