Santa Rosa, California6 days ago
You'll own end-to-end performance: profiling training workloads on multi-GPU clusters, writing custom CUDA kernels and LibTorch C++ extensions for hot paths, and optimizing inference for embedding in production software where every millisecond matters. Our ~15,000 employees create world-class solutions in communications, 5G, automotive, energy, quantum, aerospace, defense, and semiconductor markets for customers in over 100 countries.