Data Engineer - Deb

Sumeru Solutions

Frisco, TX

JOB DETAILS
SKILLS
Application Programming Interface (API), Best Practices, Business Intelligence, Cisco Unity, Cloud Computing, Communication Skills, Computer Programming, Data Analysis, Data Cleaning, Data Management, Data Processing, Data Quality, Data Sets, Data Warehousing, Dimensional Modeling, Ecosystems, Error Handling, Metadata, Microsoft Product Family, Microsoft Windows Azure, Performance Tuning/Optimization, Power BI, Production Control, Production Support, Python Programming/Scripting Language, Reconciliation, Reporting Dashboards, Requirements Management, SQL (Structured Query Language), Scalable System Development, Snowflake Schema, Software Engineering, Star Schema, Structured Data, Training Data Sets, Use Cases, Warehousing
LOCATION
Frisco, TX
POSTED
3 days ago

Job Description:

We are seeking a hands-on Data Engineer or Associate Data Engineer with strong programming skills in Python and SQL and practical experience building scalable data ingestion, transformation, and quality pipelines using Azure ecosystem.

This role will focus on building scalable ingestion pipelines into the Bronze layer, developing Silver and Gold layer patterns, with strong focus on data quality, and enabling governed semantic models through Microsoft Fabric for analytics and reporting.

Technical Skills: Python, SQL, PySpark, Pandas, Azure Databricks, Azure Data Factory, Snowflake, Star Schema, Dimensional Modeling and Microsoft Fabric.

Key Responsibilities

Implement medallion lakehouse architecture patterns across Bronze, Silver, and Gold layers.

Build and maintain ingestion pipelines that bring structured, semi-structured data, files, API sources into the Bronze layer of the lakehouse.

Develop Silver layer patterns focused on data cleaning, standardization, deduplication, validation, and business-rule application.

Develop Gold layer datasets that are curated, analytics-ready, and suitable for enterprise reporting and semantic model consumption.

Build reliable data pipelines using Azure Databricks, SQL, and Python.

Design and support orchestration workflows using Azure Data Factory Implement data quality checks, reconciliation logic, exception handling, and monitoring across pipelines.

Support enterprise data warehouse design, including dimensional modeling, star schemas, fact tables, and dimension tables.

Prepare curated datasets that support semantic models, governed reporting, dashboards, and business intelligence use cases.

Enable semantic layers and semantic models through Microsoft Fabric, including Fabric Lakehouse, Fabric Warehouse, OneLake, and Power BI semantic model integration.

Optimize data pipelines, transformations, models, and processing jobs for performance, scalability, and maintainability.

Collaborate with data analysts, BI developers, business stakeholders, and platform teams to translate requirements into trusted, reusable data products.

Communicate technical concepts clearly to both technical and non-technical stakeholders.

Required Skills and Experience

Data Engineer 3-5 years of industry experience in data engineering, analytics engineering and enterprise data platforms

Strong hands-on experience with SQL and Python for data pipeline development, transformation, validation, and performance optimization.

Experience building ingestion pipelines into a lakehouse, cloud data platform, or enterprise data warehouse.

Experience designing and implementing Bronze, Silver, and Gold layers in a medallion lakehouse architecture.

Hands-on experience with Azure Databricks and distributed data processing.

Experience with Snowflake, including data loading, transformation, performance tuning, and integration with cloud platforms.

Experience implementing data cleaning, standardization, deduplication, validation, reconciliation, and data quality rules.

Working experience with enterprise data warehouses, dimensional modeling, star schemas, fact tables, and dimension tables.

Hands on experience with Microsoft Fabric a plus Experience creating curated datasets for reporting, analytics, dashboards, and semantic model consumption.

Understanding of data governance, metadata management, lineage, monitoring, and production support best practices.

Ability to own specific data domains, act as a subject matter expert, and guide implementation patterns.

Preferred Qualifications

Azure, Microsoft Fabric, Databricks, or Snowflake certification preferred.

Experience with Delta Lake, Unity Catalog, OneLake, or similar lakehouse governance technologies.

Experience implementing reusable ingestion, Silver transformation, and Gold curation patterns.

Experience supporting Power BI or Microsoft Fabric semantic model development.

Experience building production-grade data pipelines for enterprise analytics, reporting, or data platform modernization.

About the Company

S

Sumeru Solutions