Databricks Engineer

InterSources Inc.

MD, MD

JOB DETAILS
SKILLS
Access Control, Agile Programming Methodologies, Amazon Simple Storage Service (S3), Amazon Web Services (AWS), Apache Spark, Application Programming Interface (API), Artificial Intelligence (AI), Automation, Best Practices, Business Intelligence, Centralized Operations/Management, Cisco Unity, Cloud Applications, Cloud Architecture, Cloud Computing, Cryptography, Data Analysis, Data Lake, Data Management, Data Mart, Data Modeling, Data Quality, Data Science, Data Sets, Data Storage, Data Warehousing, Database Extract Transform and Load (ETL), Documentation, ERP (Enterprise Resource Planning), Enterprise Data Integration, Error Handling, Family Educational Rights and Privacy Act (FERPA), ISO (International Organization for Standardization), Information Technology Consulting, Information/Data Security (InfoSec), International Electro-Technical Commission (IEC), Internet Security, JDBC (Java Database Connectivity), Machine Learning, Metadata, Metrics, Microsoft Windows Azure, Online Marketing, Oracle, Peoplesoft, Performance Analysis, Predictive Modeling, Privacy Controls, Privacy Regulations, Progress Reports, Python Programming/Scripting Language, Quality Assurance, Quality Monitoring, Regulatory Compliance, SQL (Structured Query Language), Sales Pipeline, Salesforce.com, Scala Programming Language, Security Auditing, Snowflake Schema, Software Development, Star Schema, Storage Architecture, System Integration (SI), Team Player, Technical Writing, Unstructured Data, User Experience Design (UXD), User Interface Design, Validation Testing, Web Programming
LOCATION
MD, MD
POSTED
23 days ago
Title:Databricks Engineer
Location:MD
On-site/Remote/Hybrid: REMOTE
Duration: 6-12+ Months
Interview Process: 2 Rounds
No of submissions:
No of Positions:

We are seeking a Databricks Engineer to design, build, and operate a Data & AI platform with a strong foundation in the Medallion Architecture (raw/bronze, curated/silver, and mart/gold layers). This platform will orchestrate complex data workflows and scalable ELT pipelines to integrate data from enterprise systems such as PeopleSoft, D2L, and Salesforce, delivering high-quality, governed data for machine learning, AI/BI, and analytics at scale.

You will play a critical role in engineering the infrastructure and workflows that enable seamless data flow across the enterprise, ensure operational excellence, and provide the backbone for strategic decision-making, predictive modeling, and innovation.

Responsibilities:
1. Data & AI Platform Engineering (Databricks-Centric): x Design, implement, and optimize end-to-end data pipelines on Databricks, following the Medallion Architecture principles. x Build robust and scalable ETL/ELT pipelines using Apache Spark and Delta Lake to transform raw (bronze) data into trusted curated (silver) and analytics-ready (gold) data layers. x Operationalize Databricks Workflows for orchestration, dependency management, and pipeline automation. x Apply schema evolution and data versioning to support agile data development.
2. Platform Integration & Data Ingestion: x Connect and ingest data from enterprise systems such as PeopleSoft, D2L, and Salesforce using APIs, JDBC, or other integration frameworks. x Implement connectors and ingestion frameworks that accommodate structured, semistructured, and unstructured data. x Design standardized data ingestion processes with automated error handling, retries, and alerting.
3. Data Quality, Monitoring, and Governance: x Develop data quality checks, validation rules, and anomaly detection mechanisms to ensure data integrity across all layers. x Integrate monitoring and observability tools (e.g., Databricks metrics, Grafana) to track ETL performance, latency, and failures. x Implement Unity Catalog or equivalent tools for centralized metadata management, data lineage, and governance policy enforcement.
4. Security, Privacy, and Compliance: x Enforce data security best practices including row-level security, encryption at rest/in transit, and fine-grained access control via Unity Catalog. x Design and implement data masking, tokenization, and anonymization for compliance with privacy regulations (e.g., GDPR, FERPA). x Work with security teams to audit and certify compliance controls.
5. AI/ML-Ready Data Foundation: x Enable data scientists by delivering high-quality, feature-rich data sets for model training and inference. x Support AIOps/MLOps lifecycle workflows using MLflow for experiment tracking, model registry, and deployment within Databricks. x Collaborate with AI/ML teams to create reusable feature stores and training pipelines.
6. Cloud Data Architecture and Storage: x Architect and manage data lakes on Azure Data Lake Storage (ADLS) or Amazon S3, and design ingestion pipelines to feed the bronze layer. x Build data marts and warehousing solutions using platforms like Databricks. x Optimize data storage and access patterns for performance and cost-efficiency.
7. Documentation & Enablement: x Maintain technical documentation, architecture diagrams, data dictionaries, and runbooks for all pipelines and components. x Provide training and enablement sessions to internal stakeholders on the Databricks platform, Medallion Architecture, and data governance practices. x Conduct code reviews and promote reusable patterns and frameworks across teams.
8. Reporting and Accountability: x Submit a weekly schedule of hours worked and progress reports outlining completed tasks, upcoming plans, and blockers. x Track deliverables against roadmap milestones and communicate risks or dependencies.

Required Qualifications:
x Hands-on experience with Databricks, Delta Lake, and Apache Spark for large-scale data engineering. x Deep understanding of ELT pipeline development, orchestration, and monitoring in cloud-native environments.
x Experience implementing Medallion Architecture (Bronze/Silver/Gold) and working with data versioning and schema enforcement in enterprise grade environments.
x Strong proficiency in SQL, Python, or Scala for data transformations and workflow logic.
x Proven experience integrating enterprise platforms (e.g., PeopleSoft, Salesforce, D2L) into centralized data platforms.
x Familiarity with data governance, lineage tracking, and metadata management tools.

Preferred Qualifications:
x Experience with Databricks Unity Catalog for metadata management and access control.
x Experience deploying ML models at scale using MLFlow or similar MLOps tools.
x Familiarity with cloud platforms like Azure or AWS, including storage, security, and networking aspects.
x Knowledge of data warehouse design and star/snowflake schema modeling.

About Us:
InterSources Inc, is a Small, Woman, and Minority-Owned Business Enterprise, ISO/IEC 27001, SOC 2 Type 2 certified company with massive 18+ years of diversified experience in providing IT Consulting Services, Artificial Intelligence, Data Analysis, Application Development, Cloud Services, Cybersecurity, Digital Marketing, ERP Management, Custom Software Development, Web Development, UI/ UX Design, System Integration, QA Support etc. We make reasonable accommodations for clients and employees, and we do not discriminate based on any protected attribute including race, religion, color, national origin, gender sexual orientation, gender identity, age, or marital status. We also are a Google Cloud and Oracle partner company.

About the Company

I

InterSources Inc.

It’s all about harnessing the real power of data. InterSources Inc was founded in 2007 providing intelligent data solutions to clients across industries and geographies.

Over the years, we have built products on Business Intelligence & Big Data platform simplifying and transforming the way business intelligence and real-time data analytics empower Corporations and end-users using Softwares like Tableau, Business Objects, MicroStrategy, etc.

In the process, we have enabled companies to use data analytics to help better understand, predict and influence consumer behavior, identify new market opportunities as they emerge, provide to users the data they need, alert the user when and why key business metrics have changed and enable them to make smart decisions.

COMPANY SIZE
100 to 499 employees
INDUSTRY
Computer/IT Services
FOUNDED
2007
WEBSITE
https://www.intersourcesinc.com/