Aberdeen, Maryland30 days ago
This role focuses on evaluating, tuning, benchmarking, and operationalizing Large Language Models (LLMs), embedding models, and AI inference services for constrained hardware platforms, including the X9 Spider Mission Computer architecture and other edge compute systems supporting operational missions using ReadiChat. Evaluate candidate Large Language Models (LLMs), embedding models, and AI inference solutions for quality, latency, memory utilization, reliability, and operational performance on embedded GPU-enabled edge compute platforms, including the X9 Spider Mission Computer architecture.