Key Responsibilities
Required Qualifications
Strong experience with Databricks, Apache Spark, and PySpark for large-scale data processing
Experience building and optimizing data pipelines at scale, including parallelization and performance tuning
Experience with near real-time or streaming data systems
Proficiency in Python and SQL for data engineering and transformation workflows
Experience with ETL/ELT processes and tools
Hands-on experience with cloud data platforms (Azure, AWS, or GCP)
Solid understanding of data modeling and dataset design for analytics and downstream applications
Experience tuning queries and optimizing compute performance
Knowledge of data governance, security, and compliance practices
Preferred Qualifications
Experience with cloud platforms (Azure, AWS, or GCP)
Experience with vector databases and embedding-based systems
Experience with streaming frameworks and data quality tools
Familiarity with knowledge graphs and graph-based data modeling
Experience with CI/CD pipelines and deployment automation
Familiarity with BI tools and machine learning pipelines
Education & Experience
Bachelor's degree or equivalent experience.
37 years of data engineering experience.
US Persons only (Citizens/ Green card)
At Cyient, we work towards improving the daily lives of people with unwavering focus. From a quieter flight to a safer train journey, a more reliable energy supply, or a quicker internet connection, we provide engineering, manufacturing, geospatial, network and operations management services to industry leaders across the globe. Our 15,000 associates are located in more than 21 countries, supporting 12 industries, including aerospace, rail transportation, power generation, telecommunications and medical technology. With a sound track record of growth and profitability, we are committed to developing a sustainable society and actively promoting education and inclusive growth initiatives in our local communities.