Senior GenAi/ Agentic Lead

Cardinal Integrated Technologies Inc

Santa Clara, CA

JOB DETAILS
SKILLS
Amazon Web Services (AWS), Application Programming Interface (API), Artificial Intelligence (AI), Automation, Cloud Architecture, Cloud Computing, Computer Science, Data Science, Documentation, Ecosystems, Enterprise Applications, Finance, GCP (Good Clinical Practices), Government, Healthcare, Information Technology & Information Systems, Kernel Programming, Maintain Compliance, Microsoft .NET, Microsoft C# (C Sharp), Microsoft Product Family, Microsoft Windows Azure, Node.js, Operations Processes, Performance Tuning/Optimization, Production Systems, Python Programming/Scripting Language, Safety Compliance, Search Ranking, Search Technology, Snowflake Schema, Software Engineering, User Interface/Experience (UI/UX), Web Client Plug-ins
LOCATION
Santa Clara, CA
POSTED
30+ days ago
Role: Senior GenAi/ Agentic Lead
Duration: 6-12+ Months Contract
Location – Santa Clara, CA
Onsite Requirement – Yes
Number of days onsite – (Onsite 5 days/week)


Must Have Skills
Skill 1 – seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multi cloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud.
Skill 2 – Architect end-to-end Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers
Skill 3 – Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).

Good To have Skills –
Skill 1 – Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines

We are seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multi cloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud. This role will design scalable, secure, and production ready AI systems, enabling RAG, agentic workflows, and enterprise copilots.
________________________________________
Core Responsibilities:
1. Architect end to end Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers.
2. Design and implement RAG architecture using vector stores, embeddings, hybrid search, and re ranking to embed enterprise knowledge into LLMs.
3. Create agentic systems, enabling multi agent collaboration for complex, stateful workflows and reasoning driven automation.
4. Develop and govern Copilots in Copilot Studio, including connectors, actions, plugins, DLP rules, environment strategy, and integration with Microsoft 365 and enterprise systems.
5. Leverage Azure AI Foundry (prompt flow, evaluators, safety, model orchestration) to operationalize LLM applications at scale.
6. Evaluate and optimize AI system performance, balancing quality, latency, throughput, cost efficiency, and safety compliance.
7. Implement Responsible AI, security, and HITL (HumanintheLoop) controls, ensuring compliance in regulated environments. in the Loop) controls, ensuring compliance in regulated environments.
8. Produce clear, maintainable documentation for architecture, patterns, and operational processes.
________________________________________
Required Qualifications:
• 8–10 years of experience in cloud architecture or enterprise software engineering.
• 3+ years of hands on experience designing or delivering Generative AI or LLM applications.
• Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).
• Experience deploying AI solutions on AWS (Bedrock, SageMaker) and/or GCP (Vertex AI).
• Hands on experience with RAG, vector databases (Azure AI Search, Pinecone, OpenSearch, Vertex Matching Engine), embeddings, and hybrid search.
• Deep understanding of cloud security (IAM/RBAC, Key Vault/KMS, VPC/PrivateLink, token safety).
• Experience with Kubernetes (AKS/EKS/GKE), containerization, API frameworks (FastAPI, Node.js, .NET), Python, TypeScript, or C#/.NET.
• Working knowledge of transformer architectures and model adaptation techniques (fine tuning, LoRA, prompt engineering).
• Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines.
________________________________________
Preferred Qualifications:
• Experience implementing agent based systems using frameworks like LangChain, LlamaIndex, Semantic Kernel, or AutoGen.
• Background working with enterprise data ecosystems (Databricks, Snowflake, BigQuery, Redshift).
• Knowledge of Responsible AI frameworks, guardrails, safety filters, PII redaction, and evaluation methodologies.
• Experience in regulated industries (healthcare, finance, government), with understanding of compliance controls.
• Experience with observability (OpenTelemetry, Prometheus/Grafana, App Insights) for AI workloads.
________________________________________
Education:
• Bachelor's/ Masters in Computer Science, Engineering, Information Systems, Data Science, or related field (required).

About the Company

C

Cardinal Integrated Technologies Inc