PySpark / Python Data Engineer

Cardinal Integrated Technologies Inc

Los Angeles, CA

JOB DETAILS
SKILLS
Amazon Web Services (AWS), Apache Spark, Application Programming Interface (API), Automation, Best Practices, Career Development, Computer Science, Computer Skills, Continuous Deployment/Delivery, Continuous Integration, Data Analysis, Data Management, Data Mapping, Database Design, Database Triggers, Ecosystems, Java, Leadership, Management of Information Systems/Technology (MIS), Ontology, Power BI, Process Improvement, Project Tracking, Python Programming/Scripting Language, Requirements Management, SQL Databases, Software Engineering, Stored Procedures, Tableau, Technical Leadership, Testing
LOCATION
Los Angeles, CA
POSTED
30+ days ago

Role PySpark Python Data Engineer - Tech Lead roleDuration 6-12 Months ContractLocation California - Remote Some visits may require in futureNote Prefer candidates from PST Time ZonePlease find the job details belowJob SummaryWe are seeking a seasoned professional with expertise in building data engineering and analytics solutions within AWS ecosystems. The ideal candidate should have deep experience in PySpark Python and end‐to‐end data pipeline development including job orchestration workflow design and data mapping. The role requires the ability to translate complex business logic stored procedures and SQL triggers into scalable PySpark implementations. Experience with data streaming on Spark clusters and API design is highly desirable. Knowledge of Palantir Foundry is a strong plus. Details - Time Zone Must be able to work in PST hours.- Minimum Qualifications MS or equivalent experience in Computer Science MIS or related technical fields 10-15 years of overall experience with 5 years in data engineeringETL ecosystems using PySpark Python and Java. Key Responsibilities - Translate business requirements into technical solutions using PySpark and Python frameworks.- Lead data engineering initiatives for complex analytics challenges.- Plan and execute tasks track progress and document work following best practices.- Identify and implement process improvements including scalable infrastructure design and workflow automation.- Participate in AgileScrum ceremonies.- Provide technical guidance to team members across functional and technical domains.- Build infrastructure for large‐scale data access and ensure data qualitymetadata management.- Collaborate with leadership to strengthen data‐driven decision‐making. Required Skills - Strong expertise in PySpark and Python.- Experience with Pandas APIs and Spark Streaming.- Solid understanding of database design fundamentals.- Familiarity with CICD tools and infrastructure‐as‐code frameworks.- Experience writing production‐grade code including unitintegration tests and schema validations.- Knowledge of Palantir Foundry Ontology modeling API configuration Foundry Typescript and exposure to Power BI or Tableau are significant advantages.

About the Company

C

Cardinal Integrated Technologies Inc