Research & Cross-Functional Collaboration: Lead experimentation with new architectures, prompt-engineering techniques, or retrieval systems, and collaborate with product, data, and ML operations teams to translate research into production features. Systems Integration & Deployment: Implement scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into applications or APIs.