$180,000–$200,000 Per Year
Artificial Intelligence (AI), Artificial Intelligence (AI) Agents, CUDA (Compute Unified Device Architecture), Cloud Architecture, Cloud Computing, Continuous Deployment/Delivery, Continuous Improvement, Data Modeling, Ethernet, Field Trials, GPU (Graphics Processing Unit), Healthcare, Manufacturing, Process Improvement, Product Demonstration, Product Planning, Retail, Safety Compliance, Use Cases
Must Have Technical/Functional Skills
AI / GenAI
- Hands‑on experience with LLMs, RAG, agentic workflows, and GenAI architectures
- Frameworks: PyTorch, TensorFlow
- NVIDIA stack: NeMo, NIM, Triton, TensorRT‑LLM, RAPIDS
- Custom LLM development experience (LoRA, QLoRA, distillation, hyperparameter tuning)
- Experience using NVIDIA Nemotron models
GPU & Systems
- GPU acceleration, CUDA fundamentals, performance profiling
- Distributed training and inference on multi‑node GPU clusters
- AI networking and storage concepts (InfiniBand / Ethernet)
Nice to Have
- Experience with LangGraph, LlamaIndex, CrewAI
- Industry expertise (BFSI, Healthcare, Retail, Manufacturing)
- NVIDIA Certifications (AI Infrastructure, GenAI, AI Operations)
- Enables scalable, governed adoption of GenAI and AI agents
Roles & Responsibilities
Solution Architecture & Delivery
- Design end-to-end AI / GenAI and agentic architectures using NVIDIA GPUs, DGX/HGX platforms, networking, and NVIDIA AI stack (NeMo, NIM, Triton, TensorRT‑LLM, RAPIDS)
- Build PoCs and reference architectures for LLMs, RAG, agentic AI, and industry-specific use cases
- Optimize training and inference performance across distributed GPU clusters
AI Agent Lifecycle (NeMo‑Powered)
- Enable the full AI agent lifecycle: data preparation, model selection, agent orchestration, deployment, and continuous optimization
- Use NeMo Curator & Data Designer for AI‑ready and synthetic data
- Apply Nemotron models, NeMo Retriever, and NeMo Evaluator for RAG and validation
- Build and optimize agents using NeMo Agent Toolkit across LangChain, CrewAI, LangGraph, and custom frameworks
- Deploy high‑performance inference using NVIDIA NIM
- Enforce grounding, safety, and compliance using NeMo Retriever and NeMo Guardrails
- Drive continuous improvement using NeMo Customizer, NeMo RL, and NeMo Evaluator
Customer & Partner Engagement
- Act as a trusted technical advisor to enterprise customers, GSIs, ISVs, and cloud partners
- Lead architecture workshops and deep‑dive sessions with CXOs, architects, and engineering teams
- Translate business problems into scalable NVIDIA‑based solutions with measurable outcomes
Ecosystem & Enablement
- Enable partners on NVIDIA AI Enterprise and cloud reference architectures
- Create reusable assets: demos, reference architectures, and enablement material
- Provide field feedback to influence NVIDIA product roadmap
Salary Range: $180,000 to $200,000 per year
T
Tata Consultancy Services Ltd