Cloud Computing, Cost Effectiveness Analysis, Data Analysis, Data Management, Data Processing, Data Quality, Data Warehousing, Database Extract Transform and Load (ETL), GCP (Good Clinical Practices), Performance Tuning/Optimization, Quality Monitoring, SQL (Structured Query Language), Scala Programming Language, Scalable System Development, Streaming Technology
Job Title: Data Engineer
Location: Bentonville, Arkansas (onsite)
# of Positions:7 ( total was 13 but they closed 6)
We are seeking a Data Engineer with Spark & Streaming skills builds real-time, scalable data pipelines using tools like Spark, Kafka, and cloud services (GCP) to ingest, transform, and deliver data for analytics and ML.
Responsibilities:
Design, develop, and maintain ETL/ELT data pipelines for batch and real-time data ingestion, transformation, and loading using Spark (PySpark/Scala) and streaming technologies (Kafka, Flink).
Build and optimize scalable data architectures, including data lakes, data warehouses (BigQuery), and streaming platforms.
Performance Tuning: Optimize Spark jobs, SQL queries, and data processing workflows for speed, efficiency, and cost-effectiveness
Data Quality: Implement data quality checks, monitoring, and alerting systems to ensure data accuracy and consistency.