Artificial Intelligence (AI), Caching, Cloud Computing, Computer Programming, Data Recovery, Distributed Computing, Large-Scale Systems, MCP - Microsoft Certified Professional, Memory Hardware, Programming Languages, System Architecture, Technical Leadership
Role: Systems Architect 4 AI / Distributed Systems(AI Systems Architect)
Locations
- Dallas
- Charlotte
- San Francisco Bay Area(Onsite)
Must-Have Skills
- Experience in designing and implementing high-performance, large-scale distributed systems
- Experience implementing and deploying AI/ML platforms at scale
- Building and creating agent architectures
- Knowledge of evaluation frameworks
- Experience with prompt/context engineering
- Experience with MCP servers
- Hands-on experience in LLM inference optimization
- Knowledge of batching and caching strategies
- Experience in Kubernetes and cloud infrastructure
- Strong programming skills in any programming language
- Expertise in designing agent data stacks and retrieval systems, including:
- Vector databases
- Hybrid search
- Data freshness
- Memory systems
- Graph reasoning
- BM25
Key Responsibilities
- Architect and deliver scalable, high-performance distributed systems
- Design and deploy AI/ML and GenAI platforms at enterprise scale
- Build agent-based architectures, including:
- Prompt/context engineering
- MCP servers
- Evaluation frameworks
- Optimize LLM inference pipelines:
- Batching
- Caching
- Latency
- Throughput optimization
- Design agent data and retrieval stacks:
- Vector databases
- Hybrid search
- Memory systems
- Graph-based systems
- Lead Kubernetes-based, cloud-native deployments
- Provide technical leadership, architecture governance, and hands-on guidance