Senior ML Infrastructure Engineer, Inference Platform

General Motors

Austin, TX

JOB DETAILS
SALARY
$155,420–$395,900 Per Year
SKILLS
Application Programming Interface (API), Artificial Intelligence (AI), Audiovisual, Automotive Repair and Maintenance, Best Practices, C++ Programming Language, Caching, Cloud Computing, Communication Skills, Concurrency, Data Mining, Data Processing, Distributed Computing, Ecosystems, GPU (Graphics Processing Unit), Machine Learning, Metrics, Multiplatform/Cross-Platform, Multitasking, Open Source, Performance Tuning/Optimization, Problem Solving Skills, Python Programming/Scripting Language, State Laws and Regulations, Technical Leadership, Telemetry, Usability Engineering, Use Cases, User Interface/Experience (UI/UX)
LOCATION
Austin, TX
POSTED
1 day ago

About the Team:The ML Inference Platform is part of the AV ML Infrastructure organization. Our team owns the cloud-agnostic, reliable, and cost-efficient platform that powers GM's AI efforts. We're proud to serve teams developing autonomous vehicles (L3/L4/L5), as well as other groups building AI-driven products for GM and its customers. We enable rapid innovation and feature development by optimizing for high-priority, ML-centric use cases. Our platform supports the serving of state-of-the-art (SOTA) machine learning models for experimental, online and bulk inference, with a focus on performance, availability, concurrency, and scalability. We're committed to maximizing GPU utilization across platforms (B200, H100, A100, and more) while maintaining reliability and cost efficiency.About the Role:We are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms for ML Inference workflows. In this role, you'll work closely with ML engineers and researchers to ensure efficient model serving and inference in production, for workflows such as data mining, labeling, model distillation, evaluations, simulations and more. This is a high-impact opportunity to influence the future of AI infrastructure at GM. You will play a key role in shaping the architecture, roadmap and user-experience of a robust ML inference service supporting real‑time, batch, and experimental inference needs. The ideal candidate brings experience in designing distributed systems for ML, strong problem‑solving skills, and a product mindset focused on platform usability and reliability.What you'll be doing:Design and implement core platform backend software components.Collaborate with ML engineers and researchers to understand critical workflows, parse them to platform requirements, and deliver incremental value.Lead technical decision‑making on model serving strategies, orchestration, caching, model versioning, and auto‑scaling mechanisms for highly optimized use of accelerators.Drive the development of monitoring, observability, and metrics to ensure reliability, performance, and resource optimization of inference services.Proactively research and integrate state‑of‑the‑art model serving frameworks, hardware accelerators, and distributed computing techniques.Lead technical initiatives across GM's ML ecosystem.Raise the engineering bar through technical leadership, establishing best practices.Contribute to open source projects; represent GM in relevant communities.Minimum Requirements5+ years of industry experience, with focus on machine learning systems or high performance backend services.Expertise in either Python, C++ or other relevant coding languages.Expertise in ML inference, model serving frameworks (triton, rayserve, vLLM etc).Strong communication skills and a proven ability to drive cross‑functional initiatives.Ability to thrive in a dynamic, multi‑tasking environment with ever‑evolving priorities.Preferred QualificationsDeep expertise building zero‑to‑one ML infrastructure platforms.Experience working with or designing interfaces, APIs and clients for ML workflows.Experience with Ray framework, and/or vLLM.Experience with distributed systems, and handling large‑scale data processing.Familiarity with telemetry, and other feedback loops to inform product improvements.Familiarity with hardware acceleration (GPUs) and optimizations for inference workloads.CompensationThe compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.$155,420 to $395,900. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.RelocationThis job may be eligible for relocation benefits.Remote/HybridThis role is based remotely but if you live within a 50-mile radius of Mountain View, you are expected to report to that location three times a week, at minimum.#J-18808-Ljbffr

About the Company

G

General Motors

LEADING THE CHARGE INTO TOMORROW
AtGeneral Motors, we are passionate about earning customers for life. This vision unites us as a team each and every day and is the hallmark of our customer-driven culture. In fact, there are a lot of exciting things to share about our company.

Our story starts on November 18, 2010, when we completed the world’s largest initial public offering, emerging with a solid financial foundation that enables us to produce great vehicles for our customers and build a bright future for employees, partners and shareholders.Leading the way is our seasoned leadership team who set high standards for our company so that we can give you the best cars and trucks.
COMPANY SIZE
10,000 employees or more
INDUSTRY
Automotive Sales and Repair Services
FOUNDED
1908