Databricks Architect/ADMIN

Ampcus Incorporated

Hartford, CT

JOB DETAILS
SKILLS
Access Control, Administrative Skills, Amazon Elastic Compute Cloud (EC2), Amazon Simple Storage Service (S3), Amazon Web Services (AWS), Ansible, Apache, Apache Spark, Architectural Services, Artificial Intelligence (AI), Atlassian JIRA, Auditing, Automation, Autoscaling, Bash Scripting, Capacity Management, Change Order Management, Chargebacks, Cisco Unity, Cloud Computing, Cloud Storage, Communication Skills, Computer Systems, Configuration Management, Consulting, Continuous Deployment/Delivery, Continuous Integration, Cost Control, Cost Modeling, Cron Job Scheduling, Cryptography, Data Analysis, Data Management, Data Processing, Data Quality, Data Recovery, Data Science, Database Design, Database Extract Transform and Load (ETL), Database Programming, Disaster Recovery, Documentation, Ecosystems, Enterprise Architecture, Enterprise Protection, File Management, File Systems, Finance, Financial Services, Forecasting, GCP (Good Clinical Practices), Git, GitHub, IT Service Management (ITSM), Identify Issues, Information Technology & Information Systems, Information Technology Consulting, Insurance, Internet Security, Linux Administration, Linux Operating System, Local Government, Machine Learning, Machine Tool, Manufacturing, Microsoft Windows Azure, Multiplatform/Cross-Platform, Onboarding, Oracle Database, Presentation/Verbal Skills, Process Management, Programming Tools, Project Tracking, Python Programming/Scripting Language, Recruiting Strategy, Regulatory Compliance, Reliability Engineering, SQL (Structured Query Language), Scripting (Scripting Languages), ServiceNow, Software Engineering, Source Code/Configuration Management (SCM), Sprint Planning, Standards Development, System Operations, Systems Administration/Management, Topology, Unix Operating Systems, Unix Shell Programming, Unix System Administration, Virtual Machine (VM), Warehousing, Writing Skills
LOCATION
Hartford, CT
POSTED
14 days ago

Location: Hartford, CT (Hybrid)

Job Type: Full-Time

 

Per manager: This isn't an end to end mgmnt, but more of a consultant role. We have an extensive databricks team with a lead already. I need this type of expertise for consultation, pipelines and some automation work...also for forecasting usage.

POSITION SUMMARY

The Databricks Architect/ADMIN is a senior individual contributor responsible for the design, implementation, and continuous optimization of the enterprise Databricks platform. This role serves as the technical authority for all aspects of the Databricks environment — including workspace governance, Unity Catalog, cluster and compute strategy, data pipeline architecture, and cost management. The Architect works in close partnership with data engineering, analytics, and infrastructure teams, and operates within a broader multi-platform data ecosystem that includes Ab Initio and Fivetran. A strong background in Unix/Linux systems administration and scripting is essential, as the role requires deep engagement with the underlying compute infrastructure supporting the platform.

KEY RESPONSIBILITIES

Platform Architecture & Design

  • Architect and govern the enterprise Databricks environment, including workspace topology, Unity Catalog structure, and access control frameworks.
  • Define and enforce standards for cluster configuration, runtime versions, instance pool utilization, and auto-scaling policies.
  • Design scalable, performant data pipeline patterns using Delta Live Tables, Databricks Workflows, and structured streaming.
  • Establish architectural standards for Delta Lake — including table formats, partitioning strategies, Z-ordering, and OPTIMIZE/VACUUM scheduling.
  • Lead platform integration design with upstream ingestion tools including Fivetran and Ab Initio, ensuring reliable, governed data delivery.

Unix/Linux Infrastructure & Operations

  • Administer and troubleshoot Unix/Linux environments underpinning Databricks compute nodes, init scripts, and cluster lifecycle management.
  • Develop and maintain shell scripts (Bash) and Python automation for platform operations, monitoring, log aggregation, and maintenance tasks.
  • Manage file system operations, permission structures, and data movement tasks in Linux-based storage and compute environments.
  • Support EC2/VM-level diagnostics and tuning in coordination with infrastructure and cloud engineering teams.

Cost Management & Optimization

  • Own DBU consumption tracking and reporting; proactively identify optimization opportunities across jobs, interactive clusters, and SQL warehouses.
  • Implement and maintain cost attribution models to support chargeback or showback reporting by team, product, or LOB.
  • Partner with the Senior Director on capacity planning, contract utilization forecasting, and multi-year commitment management.

Governance, Security & Compliance

  • Design and implement data governance frameworks within Unity Catalog, including lineage, tagging, and access auditing.
  • Collaborate with Cybersecurity to ensure platform configurations satisfy enterprise security controls, including secrets management, network isolation, and encryption.
  • Support audit and compliance activities by maintaining documentation of platform configurations, access policies, and data classification standards.

Automation & Artificial Intelligence

  • Design and implement end-to-end automation frameworks for platform operations, including cluster lifecycle management, job scheduling, alerting, and self-healing workflows.
  • Leverage Databricks AutoML, MLflow, and Model Serving capabilities to support the operationalization of machine learning models within the enterprise data platform.
  • Integrate AI-assisted development tooling (e.g., Databricks Assistant, GitHub Copilot) into engineering workflows to accelerate pipeline development and reduce manual effort.
  • Identify and drive automation opportunities across ingestion, transformation, data quality, and governance processes — reducing toil and improving platform reliability.
  • Collaborate with data science and advanced analytics teams to architect scalable feature engineering pipelines and model deployment patterns on Databricks.
  • Evaluate and recommend emerging AI/ML platform capabilities, including generative AI integrations and LLM-backed data workflows, in alignment with enterprise strategy.
  • Serve as the primary technical escalation point for Databricks platform issues across data engineering and analytics teams.
  • Contribute to sprint planning and project tracking within Jira; manage platform change requests and incidents through ServiceNow.
  • Produce and maintain architectural documentation, runbooks, and onboarding materials for platform consumers.
  • Evaluate and recommend new Databricks features, partner integrations, and tooling investments in support of the platform roadmap.

REQUIRED QUALIFICATIONS

  • 7 years of experience in data engineering or data platform roles, with a minimum of 4 years hands-on Databricks implementation experience.
  • Demonstrated expertise with Databricks platform capabilities: Unity Catalog, Delta Lake, Databricks Workflows, Delta Live Tables, and SQL Warehouses.
  • Strong Unix/Linux proficiency — shell scripting, process management, file system operations, cron scheduling, and environment configuration.
  • Proficiency in Python and PySpark for distributed data processing, pipeline development, and platform automation.
  • Experience with cloud infrastructure (AWS, Azure, or GCP), including compute, storage, networking, and IAM/security constructs.
  • Demonstrated ability to design for scale, cost efficiency, and operational reliability in an enterprise data environment.
  • Demonstrated experience designing automation frameworks for data platform operations — including job orchestration, monitoring, alerting, and pipeline self-healing.
  • Familiarity with AI/ML concepts and tooling within the Databricks ecosystem, including MLflow, AutoML, and Model Serving; exposure to generative AI or LLM-integrated workflows is a plus.
  • Experience with Oracle database environments, including SQL development, schema design, and integration patterns for data extraction and pipeline sourcing.
  • Proficiency in Git-based version control — branching strategies, pull request workflows, repository management, and CI/CD pipeline integration for data platform code.
  • Experience working within ITSM and project delivery frameworks such as ServiceNow and Jira.
  • Strong written and verbal communication skills, with the ability to convey complex architectural concepts to both technical and non-technical audiences.

PREFERRED QUALIFICATIONS

  • Hands-on experience with MLflow experiment tracking, model registry, and deployment patterns within Databricks.
  • Exposure to generative AI frameworks (LangChain, LlamaIndex) or experience building LLM-integrated data pipelines and retrieval-augmented generation (RAG) workflows.
  • Experience with workflow automation tools such as Apache Airflow, Databricks Workflows, or comparable orchestration platforms at enterprise scale.
  • Experience integrating Databricks with ETL/ELT platforms including Fivetran, or Ab Initio; hands-on Ab Initio development or administration experience is a strong plus.
  • Familiarity with enterprise data governance frameworks and catalog tools (e.g., Collibra, Alation, or Unity Catalog advanced features).
  • Experience supporting Databricks in regulated industries (financial services, insurance) with associated audit and compliance requirements.
  • Working knowledge of Infrastructure-as-Code tooling (Terraform, Ansible) for platform provisioning and configuration management.
  • Background in disaster recovery design and resiliency planning for cloud-hosted data platforms.

CORE TECHNICAL COMPETENCIES

  • Platform & Data Engineering
  • Databricks (Unity Catalog, DLT, Workflows)
  • Delta Lake / Delta Live Tables
  • Apache Spark / PySpark
  • MLflow, AutoML & Model Serving
  • Generative AI / LLM-Integrated Workflows
  • Fivetran, Ab Initio Integration
  • Cloud Storage (S3, ADLS, GCS)
  • SQL / SparkSQL
  • Infrastructure & Systems
  • Unix/Linux Administration & Scripting
  • Bash / Shell Scripting
  • EC2 / VM Compute Management
  • Python Automation & Orchestration
  • Git Version Control & Repository Management
  • Oracle Database / SQL Development
  • Secrets Management & Security Hardening
  • ServiceNow

Since 1995, iTech Solutions Inc., has been providing IT Consulting and Direct Hire Services to the Insurance, Financial, Communications, Manufacturing and Government sectors with local offices in Connecticut, Minnesota, Colorado, Massachusetts, Tennessee, North Carolina, and New Jersey / Pennsylvania area.

Our recruiting strategy is simple, if you want to find qualified IT professionals then use IT professionals to find them. So at iTech Solutions, our personnel are all career IT professionals with a wide range of IT experience. We can honestly say our staff understands the technologies, the complexities of finding and selecting the appropriate personnel and the pressures of running successful IT projects.

Employer will not sponsor applicants for any employment visas, at hiring or in the future, including but not limited to H-1B visas. Corp-to-Corp or subcontract personnel will not be considered for this position.

iTech Solutions, Inc. is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identify, national origin, age, protected veterans or individuals with disabilities.

About the Company

A

Ampcus Incorporated

Ampcus Inc is a global technology and business consulting firm specializing in Digital Transforrmation, Big Data, Analytics, Cyber Security, Testing, IV&V, Infrastructure Management and Enterprise Solutions. Ampcus Inc is an SBA 8(a) certified Women and Minority Owned global Provider of broad range of consulting Services. From strategy to execution, our disciplined yet flexible approach starts and ends with our clients. By listening hard and working harder, their goals become our goals. We are an ISO 9000, ISO 20000, ISO 27000 and CMMi Level certified company.

Ampcus consultants have significant business, engineering and technology experience. Our consultants have over 20 years of business experience and an average of over 10 years of engineering and technology experience. This means that the project teams understand how systems work and how the technology impacts the business processes of organizations.

We believe that success of an engagement is determined by strong project management, clear communication and mutual commitment working collaboratively. Our methodology begins by listening to the customer needs, then working with their teams to gain a clear understanding of the requirements, while providing a knowledge transfer of best practices for the organization. As a recognized leader providing customized software services, management and engineering solutions to companies around the world, our ability to deliver is a "granted"​ that makes companies put their trust in us to answer their day-to-day business challenges and put them on a path for greater success. We are the choice for our clients because we look at our clients business from a growth perspective.

Industry: Information Technology and Services

Specialties: Digital Transformation, Big Data and Analytics, Infrastructure Management Services, Testing and IV&V, Cyber Security, Active Directory and E-mail Infrastructure, Project Management, Training, and ERP, CRM. EAI, BI

COMPANY SIZE
500 to 999 employees
INDUSTRY
Staffing/Employment Agencies
WEBSITE
http://www.ampcus.com