Nuro IncSenior/Staff Machine Learning Research Scientist: Generative Models for Behavior Modeling Nuro IncSenior/Staff Machine Learning Research Scientist: Generative Models for Behavior Modelingmountain view, CA$202,350–$303,050 / yearYou have subject matter expertise and research in one or more of the following areas: scalable generative and diffusion models, generative model alignment with reward/cost models, controllability, distillation, mode recovery, fine-tuning with closed-loop RL, applications to autonomous driving and robotics, and other experiences such as video generation, text-to-image generation, in-painting/out-painting and so on. The first commercial application of the Nuro Driver is autonomous goods delivery with our custom, electric, zero-occupant vehicles in partnership with some of the most recognized brands in the world including Uber and FedEx.
Swish AnalyticsSenior Quantitative Researcher - Risk Modeling Swish AnalyticsSenior Quantitative Researcher - Risk ModelingSan Francisco, CaliforniaYou'll lead research initiatives that generate alpha and improve execution quality, mentor junior researchers, and collaborate closely with our Trading desk to translate quantitative insights into profitable systematic strategies while maintaining rigorous risk management. As we expand our presence on betting exchanges, we're building infrastructure and strategies akin to those found in traditional financial markets.
Advanced Micro Devices, IncNewSystem Performance Modeling Engineer Advanced Micro Devices, IncSystem Performance Modeling EngineerSanta Clara, CaliforniaWe are seeking a highly experienced Senior Modeling Engineer to join our engineering team focused on next-generation ASIC and system architecture development. The ideal candidate brings deep expertise in C/C++, SystemC modeling, and ARM fast models, along with strong debugging and cross-functional collaboration skills.
Spark Tek IncBSA - Anaplan/Pigment (Model Builders) [Pigment is very important, if you don t get it send with Ana Spark Tek IncBSA - Anaplan/Pigment (Model Builders) [Pigment is very important, if you don t get it send with AnaSan Jose, CAData Integration & Management: Develop, monitor, and troubleshoot data integration pipelines (e.g., Anaplan Connect, CloudWorks, or Pigment integrations) from ERPs/CRMs like NetSuite, Salesforce, or SAP. Model Building & Design: Configure and maintain Anaplan lists, modules, complex calculations, dashboards, and user experience (UX) pages, applying best practices (e.g., DISCO methodology) to ensure scalability.
Artech LLCNewSimulation and Modeling Engineer Artech LLCSimulation and Modeling EngineerCupertino, CA$95–$95.23 / hourWe are seeking a highly motivated CAE Engineer with expertise in non-linear static and dynamic Finite Element Analysis (FEA) to support the design and development of high-volume consumer electronic products. This role will focus on structural, thermomechanical, and heat transfer simulations while collaborating cross-functionally to drive robust, optimized designs from concept through production.
Tesla MotorsNewInternship, Robotics Modeling & Simulation Engineer, Optimus (Fall 2026) Tesla MotorsInternship, Robotics Modeling & Simulation Engineer, Optimus (Fall 2026)Palo Alto, CA$20–$55 / hourWelcome to Tesla's Optimus Robotics Team, the epicenter of innovation, where we're not just designing humanoid bi-pedal robots like the Tesla Bot; we're creating a revolution in automation for repetitive tasks. Join us in this thrilling journey, where you'll find yourself at the intersection of mechanical, electrical, controls, software, and manufacturing engineering disciplines, working collaboratively to drive the future of robotics.
Tesla IncCell Technology Modeling Engineer Tesla IncCell Technology Modeling Engineerpalo alto, CA$84,000–$204,000 / yearAlong with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire: Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction. For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).
ArcherCell Modeling Engineer ArcherCell Modeling EngineerSan Jose, CA$144,000–$198,000 / yearAs a member of the Battery Cell Engineering team, you will be responsible for the development, validation, and maintenance of models that describe the electrical behavior of cells over their operational life. Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission to advance the benefits of sustainable air mobility.
Tesla IncStaff Cell Thermal Modeling Engineer Tesla IncStaff Cell Thermal Modeling EngineerPalo Alto, CA$128,000–$228,000 / yearYou will act as a primary technical lead, providing high-level guidance to unblock the team on complex theoretical challenges while managing project timelines and resource allocation across multiple high-priority programs. Provide deep technical guidance to Senior Engineers to resolve roadblocks in numerical methods, code architecture, or complex thermal phenomena (e.g., hysteresis in non-ideal chemistries).
GoogleHead of Energy Risk Management and Grid Modeling GoogleHead of Energy Risk Management and Grid ModelingSan Francisco, CAFull timeFrom software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future.
Tesla IncVehicle Dynamics Modeling Engineer Tesla IncVehicle Dynamics Modeling Engineerpalo alto, CA$84,000–$204,000 / yearUsing this powerful suite of in-house and commercial tools, we predict and shape the lateral, longitudinal, and vertical dynamic behaviors from the earliest architectural concepts to the final production tuning. Create and implement objective metrics to correlate simulation with real-world performance, designing and supporting vehicle tests for high-quality data collection.
Intelliswift Software IncBusiness Model Strategy Senior Product Manager Intelliswift Software IncBusiness Model Strategy Senior Product ManagerSan Jose, CADevelop and conduct qualitative customer research and complete quantitative market research including one or more of the following research methods: Build-Your-Own Conjoint, Discrete Choice Conjoint, Gabor-Granger. We are a small, highly leveraged team that defines monetization strategy for the entire company and this particular role will be passionate about opportunities within the Digital Experience business, cantered around Experience Cloud.
LER TechForceSummer Intern Connected Driving World Models LER TechForceSummer Intern Connected Driving World ModelsMountain View, CAOur Client is seeking a PhD student in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline to work on Connected Driving World Models. D. in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline with a focus on AI / machine learning.
GoogleSoftware Engineer III, AI/ML, Image Recommendation Modeling GoogleSoftware Engineer III, AI/ML, Image Recommendation ModelingMountain View, CAFull timeWe're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a part of the Ember team, you will help reimagine Google Images, a platform serving over a billion daily interactions, to build a queryless, hyper-personalized feed that fuels creativity worldwide.
quadric, IncData Scientist - Model Optimization quadric, IncData Scientist - Model OptimizationBurlingame, CA$110,000–$270,000Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location.
PeopleNTech LLCData Scientist II Model Validation and Monitoring PeopleNTech LLCData Scientist II Model Validation and MonitoringSanta Clara, CA$65–$70 / hourEssential FunctionsLead model monitoring activities, including tracking performance metrics, detecting model and data drift, identifying data quality issues, providing root cause analysis, and recommending remediation strategies. This includes providing effective challenges to model development, conduct model monitoring and performance tracking, provide root cause analysis of model performance, exploring, building, validating, and deploying models.
GoogleStaff SWE, Compiler Architect, System Performance Modeling GoogleStaff SWE, Compiler Architect, System Performance ModelingSunnyvale, CAFull timeFrom software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Kirkland, WA, USA; New York, NY, USA; Seattle, WA, USA.Minimum qualifications: Bachelor's degree or equivalent practical experience.
Ursus, Inc.Business Model Strategy Senior Product Manager Ursus, Inc.Business Model Strategy Senior Product ManagerSan Jose, CA$90–$126.75 / hourDevelop and conduct qualitative customer research and complete quantitative market research, including one or more of the following research methods: Build-Your-Own Conjoint, Discrete Choice Conjoint, Gabor-Granger. We are a small, highly leveraged team that defines monetization strategy for the entire company, and this particular role will be passionate about opportunities within the Digital Experience business, centered around Experience Cloud.
Tesla IncVehicle Dynamics Modeling Engineer, Simulation Infrastructure Tesla IncVehicle Dynamics Modeling Engineer, Simulation Infrastructurepalo alto, CAThis role is a unique opportunity to merge your passion for vehicle dynamics, software development, and data science to directly impact products that accelerate the world's transition to sustainable energy. Apply statistical analysis and machine learning techniques to correlate simulation models with real-world performance data, identifying key performance indicators and areas for improvement.
NVIDIA CorpSenior Architecture Energy Modeling Engineer NVIDIA CorpSenior Architecture Energy Modeling Engineersanta clara, CAAs a member of the Power Modeling, Methodology and Analysis Team, you will collaborate with Architects, ASIC Design Engineers, Low Power Engineers, Performance Engineers, Software Engineers, and Physical Design teams to study and implement energy modeling techniques for NVIDIA's next generation GPUs, CPUs and Tegra SOCs. Our team is responsible for researching, developing, and deploying methodologies to help NVIDIA's products become more energy efficient; and is responsible for building energy models that integrate into architectural simulators, RTL simulation, emulation and silicon platforms.
Robert Bosch GmbHSenior AI Research Scientist- Time-Series Foundational Models Robert Bosch GmbHSenior AI Research Scientist- Time-Series Foundational ModelsSunnyvale, CA$165,000–$195,000 / yearD. in Computer Science, Electrical Engineering, Information Technology or a related discipline OR Masters degree with 2-3 years of preferred professional experience Expertise with Time Series FMs (beyond the time series task of forecasting) In-depth experience in signal processing for sensor data and their integration with deep-learning methods Proficiency in Python, PyTorch (including libraries such as torchaudio, torchvision, torchmetrics), familiarity with PyTorch Lightning A strong publication record in relevant venues such as ICASSP, NeurIPS, InterSpeech, ICML, ICLR, KDD, ICRA, CVPR, ICCV, ECCV or equivalent contributions to the field such as patents or significant open-source projects Strong interpersonal, communication, and teamwork capabilities. Preferred Qualifications 3+ years of experience in industrial research Experience with one or more of the following areas: data-centric AI, synthetic data generation, agentic AI Proficiency with version control systems (Git), integrated development environment (VSCode or PyCharm) and experience with experiment tracking tools (MLFlow) Familiarity with high-performance computing systems and job schedulers (Slurm, LSF) Hands-on experience in product development in the above-mentioned areas for consumer/enterprise markets Experience leading projects with small teams, demonstrating the ability to mentor junior researchers and interns, manage project timelines, and deliver results within time constraints.
Unity SoftwareSenior Machine Learning Engineer, Vector Core Modeling Unity SoftwareSenior Machine Learning Engineer, Vector Core ModelingMountain View, CAWhile specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.
Unity SoftwareMachine Learning Engineer, Vector Core Modeling Unity SoftwareMachine Learning Engineer, Vector Core ModelingMountain View, CAWhile specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.
Unity SoftwareStaff Machine Learning Engineer, Vector Core Modeling Unity SoftwareStaff Machine Learning Engineer, Vector Core ModelingMountain View, CAWhile specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.
Tesla IncSoftware Engineer, Metrics, GenAI Model Evaluation Tesla IncSoftware Engineer, Metrics, GenAI Model Evaluationpalo alto, CA$132,000–$390,000 / yearWorking alongside our AI team, you will design metrics that utilize fleet data and run on large inference clusters to help drive key decisions about end-to-end model architecture, data integrity, and exported model performance. For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).
GoogleProduct Manager, Search Ads Quality Modeling GoogleProduct Manager, Search Ads Quality ModelingMountain View, CAFull timeAdvertisers worldwide use Google Ads to promote their products; publishers use AdSense to serve relevant ads on their website; and business around the world use our products (like Google Shopping, and Google Wallet) to support their online businesses and bring users into their offline stores. As a Product Manager in the Ad Relevance and User Quality team, you will play a key role in building the ML models that are responsible for showing useful ads to users across Google Search and AI Mode.
GoogleHardware Architecture Modeling Engineer, PhD, University Graduate GoogleHardware Architecture Modeling Engineer, PhD, University GraduateSunnyvale, CAFull timeFrom software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future.
Varite, IncNewBusiness Model Strategy Senior Product Manager Varite, IncBusiness Model Strategy Senior Product ManagerSan Jose, CA$115–$126.76 / hourDevelop and conduct qualitative customer research and complete quantitative market research including one or more of the following research methods: Build-Your-Own Conjoint, Discrete Choice Conjoint, Gabor-Granger. Duties: Job Responsibilities: We are a small, highly leveraged team that defines monetization strategy for the entire company, and this role will be passionate about opportunities within the Digital Experience business, centered around Experience Cloud.
d-MatrixNewPrincipal Architect, Performance Analysis and Modeling d-MatrixPrincipal Architect, Performance Analysis and ModelingSanta Clara, CAYour day-to-day work will include (1) analyzing the properties of emerging machine learning algorithms and workloads and identifying functional, performance implications (2) Creating analytical models to project performance on current and future generations of d-matrix hardware (3) proposing new HW/SW features to enable or accelerate these algorithms . Experience with developing analytical performance models, architecture simulators for performance analysis, Research background with publication record in top-tier architecture, or machine learning venues is a huge plus (such as ISCA, MICRO, ASPLOS, HPCA, DAC, MLSys etc.).
quadric, IncData Scientist, New Grad - Model Optimization quadric, IncData Scientist, New Grad - Model OptimizationBurlingame, CA$120,000–$160,000Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location.
GoogleStaff Software Engineer, Applied Research, Foundation User Models GoogleStaff Software Engineer, Applied Research, Foundation User ModelsMountain View, CAFull timeWe're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
Crusoe Energy Systems LLCStaff Software Engineer, Model LifeCycle Crusoe Energy Systems LLCStaff Software Engineer, Model LifeCycleSan Francisco, CA$204,000–$247,000 / yearAbout this role: The Staff Software Engineer for the Model LifeCycle team will play a key role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). Collaboration and impact: Work closely with Principal Engineers, product, business, and platform teams to implement the core abstractions and APIs of the system.
Advanced Micro Devices, IncDirector Software Development, AI Models and Research Advanced Micro Devices, IncDirector Software Development, AI Models and ResearchSan Jose, CaliforniaWe push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary.
Amazon.com IncPre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS Amazon.com IncPre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWSCupertino, CAWhat youll do: - Build and own models of SoC subsystems - translating architecture specs and RTL behavior into accurate, testable C++ models - Work directly with RTL design and verification teams to validate model behavior against RTL, debug discrepancies, and support pre-silicon verification flows - Develop model-based test infrastructure: regression suites, RTL correlation checks, and coverage-driven testing - Contribute to performance modeling efforts - building cycle-approximate models that help architects evaluate design trade-offs before RTL exists - Improve modeling methodology and infrastructure: how models are structured, integrated, tested, and released to DV and architecture teams - Collaborate with chip architects to understand upcoming designs and plan modeling work ahead of RTL availability Why this role is interesting: - Your models are used to verify silicon before its built - bugs you catch save months of schedule and millions of dollars - Youll work at the intersection of software engineering and chip design, with deep visibility into how custom ML accelerators are architected - As the team scales, theres a clear path into architectural modeling - using your models to influence chip design decisions, not just validate them - Small team, high ownership, direct impact on AWSs most strategic silicon programs You will thrive in this role if you: - Have built functional or performance models of SoCs, ASICs, GPUs, CPUs, or IP blocks - Are comfortable working with architectural / design specifications or reference implementations and translating them into C++ or SystemC models - Understand verification concepts and have worked with DV teams or in pre-silicon validation environments - Care about model fidelity and have experience correlating models against RTL or silicon - Are interested in expanding into architectural performance modeling as the team grows - Enjoy working on a small, high-impact team where you own significant pieces of the stack No ML background needed. Our team builds C++ models of these custom SoCs that RTL designers, verification engineers, and software teams depend on throughout the silicon development lifecycle.
Amazon.com IncSenior Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS Amazon.com IncSenior Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWSCupertino, CAOur team builds C++ models of these custom SoCs that RTL designers, verification engineers, and software teams depend on throughout the silicon development lifecycle. Contribute to performance modeling efforts - building cycle-approximate models that help architects evaluate design trade-offs before RTL exists.
quadric, IncData Science Intern - Model Optimization quadric, IncData Science Intern - Model OptimizationBurlingame, CAQuadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.
NVIDIA CorpTechnical Marketing Engineer, World Models - AV Physical AI NVIDIA CorpTechnical Marketing Engineer, World Models - AV Physical AISanta Clara, CAPrototype and iterate rapidly on experiments across powerful AI domains, including agentic systems, reinforcement learning, reasoning, and video generation in partnership with customer/partner teams. We are seeking a hardworking and technically skilled engineer to join our Physical AI Technical Marketing team, focusing on building world-class technical materials for world models within various industries.
SoundThinking IncSr. Data Engineer - Ontology & Semantic Modeling SoundThinking IncSr. Data Engineer - Ontology & Semantic ModelingFremont, CAData Engineer (Ontology & Semantic Modeling) to design scalable data pipelines, contribute to an ontology-driven semantic layer, and help improve database schema and performance across our platform. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.\n \nSoundThinking expressly prohibits any form of workplace harassment based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.
Tesla MotorsNewSoftware Engineer, Model Hardware CoDesign Tesla MotorsSoftware Engineer, Model Hardware CoDesignPalo Alto, CA$140,000–$390,000 / yearExpected Compensation $140,000 - $390,000/annual salary + cash and stock awards + benefits Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. What You'll DoDesign, train, and iterate on neural network architectures for autonomous driving and robotics, with a focus on efficiency-aware model design (architecture search, distillation, pruning, quantization-aware training) .
Databricks IncStaff Engineer, Model Serving Databricks IncStaff Engineer, Model Servingsan francisco, CAContribute directly to key components across the serving infrastructure \u2014 from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling \u2014 ensuring smooth and efficient operations at scale. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform.
Waymo LLCSenior Staff Software Engineer, Model Post Training Waymo LLCSenior Staff Software Engineer, Model Post TrainingMountain View, CA$81,000–$356,000 / yearYou will provide technical leadership to influence senior engineers and researchers across ML infra and data teams, raising the technical bar for how Waymo trains, evaluates, and deploys LLM models in the autonomous driving technical stack. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The Worlds Most Experienced Driver-to improve access to mobility while saving thousands of lives now lost to traffic crashes.
Baseten Labs IncSoftware Engineer - Model API''s Baseten Labs IncSoftware Engineer - Model API''sSan FranciscoRESPONSIBILITIES:Design, build, and operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling and multi-modal servingProfile and optimize TensorRT-LLM kernels, analyze CUDA kernel performance, implement custom CUDA operators, tune memory allocation patterns for maximum throughput and optimize communication patterns across multi-GPU setupsProductionize performance improvements across runtimes with deep understanding of their internals: speculative decoding implementations, guided generation for structured outputs, custom scheduling and routing algorithms for high-performance servingBuild comprehensive benchmarking frameworks that measure real-world performance across different model architectures, batch sizes, sequence lengths, and hardware configurationsProductionize performance improvements across runtimes (e.g. NICE TO HAVE:Experience with LLM runtimes (vLLM, SGLang, TensorRT‑LLM) or contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang, TGI)Knowledge of Kubernetes, service meshes, API gateways, or distributed scheduling.
Databricks IncSenior Engineer, Model Serving Databricks IncSenior Engineer, Model Servingsan francisco, CAContribute directly to key components across the serving infrastructure \u2014 from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling \u2014 ensuring smooth and efficient operations at scale. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform.
inferenceNewSenior Software Engineer - Model Performance inferenceSenior Software Engineer - Model PerformanceSan Francisco, CA$220,000–$320,000 / yearYour work spans from implementing known optimization techniques to experimenting with novel approaches, always with the goal of serving models faster and cheaper at scale. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you.
PathwayMachine Learning Researcher / Engineer (Foundational Models) PathwayMachine Learning Researcher / Engineer (Foundational Models)Palo Alto, CARemote€100,000Pathway is led by co-founder & CEO Zuzanna Stamirowska, a complexity scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who was the first person to apply Attention to speech and worked with Nobel laureate Geoff Hinton at Google Brain, as well as CSO Adrian Kosowski, a leading computer scientist and quantum physicist who obtained his PhD at the age of 20. The company is backed by leading investors and advisors, including Lukasz Kaiser, co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models.
Tenstorrent IncC++/Machine Learning Engineer, AI Models Training Tenstorrent IncC++/Machine Learning Engineer, AI Models Trainingsanta clara, CAThe ideal candidate will not only possess strong technical skills but also exhibit a deep curiosity, investigative mindset, and a passion for staying up-to-date with the latest advancements in the AI landscape. Understand how models map onto the Tenstorrent devices through compilation steps and kernels, investigate any gaps in functionality and performance, propose innovative solutions to mitigate any issues.
Advanced Micro Devices IncSr. Manager Software Development, AI Models and Applications Advanced Micro Devices IncSr. Manager Software Development, AI Models and ApplicationsSan Jose, CAStrong technical expertise in Gen AI model training and inference, and familiarity working with deep learning frameworks like Pytorch/JAX/vLLM/SGLang • Strong technical expertise in algorithmic innovation towards efficient Gen AI application for both training and inferencing; Expertise/publications in one of the areas preferred - efficient model architectures, optimized training, innovative parallelism strategies, low-precision training, model quantization • Additional plus if publications include conferences such as NeuRIPS, CVPR, ECCV/ICCV, ICML, ICLR, etc. The ideal candidate has deep technical understanding of the latest generative AI applications like large language models (LLMs), large multimodal models (LMMs), image/video generation and is passionate about innovating efficient approaches to enable on AMD devices.
Advanced Micro Devices, IncSr. Staff Engineer, AI Models and Applications Advanced Micro Devices, IncSr. Staff Engineer, AI Models and ApplicationsSan Jose, CaliforniaThe ideal candidate has deep technical understanding of the latest generative AI applications like large language models (LLMs), large multimodal models (LMMs), image/video generation, has experience training models at scale and is passionate about innovating efficient approaches to enable distributed training and inference at scale on AMD devices. Impactful Work: Your contributions will directly influence how cutting-edge gen AI models across the industry are efficiently trained at scale as well as inferencing deployed to serve millions of customers, making a significant difference in various industries and applications.
Microsoft CorpPrincipal Software Engineer - CoreAI Model Inference & Serving Microsoft CorpPrincipal Software Engineer - CoreAI Model Inference & ServingMountain View, CA$139,900–$274,800 / yearThere is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year. Join our team within CoreAI, where we are building the AI data-plane that powers all LLM inferencing workloads across Microsoft and Azure customers-from cutting-edge startups to Fortune 500 enterprises.
Microsoft CorpSenior Software Engineer - CoreAI Model Inference & Serving Microsoft CorpSenior Software Engineer - CoreAI Model Inference & ServingMountain View, CA$119,800–$234,700 / yearThere is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.