li>Design and implement scalable ingestion and transformation pipelines across structured (SQL, relational) and unstructured (documents, images, audio, email, call transcripts) data sources, applying OCR, NLP preprocessing, and document chunking strategies optimized for LLM consumption. The engineer works across structured and unstructured data domains - including documents, images, audio, and transactional records - to unlock analytical value through scalable pipelines, RAG architectures, vector databases, and knowledge graphs.