96 Results for

Model Jobs in Daly City, CA

Jobs

mountain view, CA

$202,350–$303,050 / year

You have subject matter expertise and research in one or more of the following areas: scalable generative and diffusion models, generative model alignment with reward/cost models, controllability, distillation, mode recovery, fine-tuning with closed-loop RL, applications to autonomous driving and robotics, and other experiences such as video generation, text-to-image generation, in-painting/out-painting and so on. The first commercial application of the Nuro Driver is autonomous goods delivery with our custom, electric, zero-occupant vehicles in partnership with some of the most recognized brands in the world including Uber and FedEx.

30+ days ago

palo alto, CA

$84,000–$204,000 / year

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire: Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction. For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).

30+ days ago

San Francisco, California

You'll lead research initiatives that generate alpha and improve execution quality, mentor junior researchers, and collaborate closely with our Trading desk to translate quantitative insights into profitable systematic strategies while maintaining rigorous risk management. As we expand our presence on betting exchanges, we're building infrastructure and strategies akin to those found in traditional financial markets.

30+ days ago

Palo Alto, CA

$128,000–$228,000 / year

You will act as a primary technical lead, providing high-level guidance to unblock the team on complex theoretical challenges while managing project timelines and resource allocation across multiple high-priority programs. Provide deep technical guidance to Senior Engineers to resolve roadblocks in numerical methods, code architecture, or complex thermal phenomena (e.g., hysteresis in non-ideal chemistries).

30+ days ago

palo alto, CA

This role is a unique opportunity to merge your passion for vehicle dynamics, software development, and data science to directly impact products that accelerate the world's transition to sustainable energy. Apply statistical analysis and machine learning techniques to correlate simulation models with real-world performance data, identifying key performance indicators and areas for improvement.

30+ days ago

palo alto, CA

$132,000–$390,000 / year

Working alongside our AI team, you will design metrics that utilize fleet data and run on large inference clusters to help drive key decisions about end-to-end model architecture, data integrity, and exported model performance. For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).

30+ days ago

San Francisco, CA

Full time

From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future.

16 days ago

San Jose, California

We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary.

30+ days ago

Mountain View, CA

While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.

8 days ago

Mountain View, CA

8 days ago

Mountain View, CA

8 days ago

Santa Clara, CA

Your day-to-day work will include (1) analyzing the properties of emerging machine learning algorithms and workloads and identifying functional, performance implications (2) Creating analytical models to project performance on current and future generations of d-matrix hardware (3) proposing new HW/SW features to enable or accelerate these algorithms . Experience with developing analytical performance models, architecture simulators for performance analysis, Research background with publication record in top-tier architecture, or machine learning venues is a huge plus (such as ISCA, MICRO, ASPLOS, HPCA, DAC, MLSys etc.).

1 day ago

Mountain View, CA

Full time

We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a part of the Ember team, you will help reimagine Google Images, a platform serving over a billion daily interactions, to build a queryless, hyper-personalized feed that fuels creativity worldwide.

10 days ago

Cupertino, CA

What youll do: - Build and own models of SoC subsystems - translating architecture specs and RTL behavior into accurate, testable C++ models - Work directly with RTL design and verification teams to validate model behavior against RTL, debug discrepancies, and support pre-silicon verification flows - Develop model-based test infrastructure: regression suites, RTL correlation checks, and coverage-driven testing - Contribute to performance modeling efforts - building cycle-approximate models that help architects evaluate design trade-offs before RTL exists - Improve modeling methodology and infrastructure: how models are structured, integrated, tested, and released to DV and architecture teams - Collaborate with chip architects to understand upcoming designs and plan modeling work ahead of RTL availability Why this role is interesting: - Your models are used to verify silicon before its built - bugs you catch save months of schedule and millions of dollars - Youll work at the intersection of software engineering and chip design, with deep visibility into how custom ML accelerators are architected - As the team scales, theres a clear path into architectural modeling - using your models to influence chip design decisions, not just validate them - Small team, high ownership, direct impact on AWSs most strategic silicon programs You will thrive in this role if you: - Have built functional or performance models of SoCs, ASICs, GPUs, CPUs, or IP blocks - Are comfortable working with architectural / design specifications or reference implementations and translating them into C++ or SystemC models - Understand verification concepts and have worked with DV teams or in pre-silicon validation environments - Care about model fidelity and have experience correlating models against RTL or silicon - Are interested in expanding into architectural performance modeling as the team grows - Enjoy working on a small, high-impact team where you own significant pieces of the stack No ML background needed. Our team builds C++ models of these custom SoCs that RTL designers, verification engineers, and software teams depend on throughout the silicon development lifecycle.

30+ days ago

Cupertino, CA

Our team builds C++ models of these custom SoCs that RTL designers, verification engineers, and software teams depend on throughout the silicon development lifecycle. Contribute to performance modeling efforts - building cycle-approximate models that help architects evaluate design trade-offs before RTL exists.

30+ days ago

Sunnyvale, CA

Full time

From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Kirkland, WA, USA; New York, NY, USA; Seattle, WA, USA.Minimum qualifications: Bachelor's degree or equivalent practical experience.

30 days ago

Santa Clara, CA

$65–$70 / hour

Essential FunctionsLead model monitoring activities, including tracking performance metrics, detecting model and data drift, identifying data quality issues, providing root cause analysis, and recommending remediation strategies. This includes providing effective challenges to model development, conduct model monitoring and performance tracking, provide root cause analysis of model performance, exploring, building, validating, and deploying models.

30+ days ago

San Jose, California

The ideal candidate has deep technical understanding of the latest generative AI applications like large language models (LLMs), large multimodal models (LMMs), image/video generation, has experience training models at scale and is passionate about innovating efficient approaches to enable distributed training and inference at scale on AMD devices. Impactful Work: Your contributions will directly influence how cutting-edge gen AI models across the industry are efficiently trained at scale as well as inferencing deployed to serve millions of customers, making a significant difference in various industries and applications.

30+ days ago

San Jose, California

30+ days ago

Mountain View, CA

Our Client is seeking a PhD student in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline to work on Connected Driving World Models. D. in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline with a focus on AI / machine learning.

8 days ago

Mountain View, CA

$43.59–$43.59

Position: CW Summer Intern Connected Driving World Models Location: Mountain View, CA 94043 Duration: 3+ Months Job Type: Temporary Assignment Work Type: Onsite Job Description Client is seeking a PhD student in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline to work on Connected Driving World Models. Overview: TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent solutions to our clients worldwide.

8 days ago

Mountain View, CA

Full time

Advertisers worldwide use Google Ads to promote their products; publishers use AdSense to serve relevant ads on their website; and business around the world use our products (like Google Shopping, and Google Wallet) to support their online businesses and bring users into their offline stores. As a Product Manager in the Ad Relevance and User Quality team, you will play a key role in building the ML models that are responsible for showing useful ads to users across Google Search and AI Mode.

8 days ago

Sunnyvale, CA

Full time

30+ days ago

Palo Alto, CA

$180,000–$440,000 / year

Domain expertise in multimodal applications such as graphics engines, rendering techniques, image/video understanding and generation, world models, real-time simulation, or controllable/long-horizon visual content creation (audio/speech processing or music/audio generation experience is a plus where it supports video). ABOUT THE ROLE: As a multimodal engineer on the Imagine Model Team, you will develop cutting-edge AI experiences beyond text, with a strong focus on enabling high-fidelity understanding and generation across image and video modalities, while also incorporating audio where it enhances visual content (e.g., synchronized audio for video).

30+ days ago

San Francisco, CA

The Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor to Zyphra's BCI work, building the next generation of open-source EEG and brain–computer interface models. You will be involved across the full model lifecycle, from data collection and preprocessing to designing novel architectures and training methodologies.

1 day ago

San Francisco, CA

$120,000–$170,000 / year

As a Product Manager on our Hive Models team, you will work cross-functionally with all stakeholders to define product requirements and see the implementation through to completion, leading development efforts between our Machine Learning, Core Infrastructure, and Product teams. Own our Hive Models portfolio, understanding the needs of the product development teams, and envisioning and executing the creation of our deep learning models that can provide human-like interpretation of video, image, audio and text.

30+ days ago

Burlingame, CA

$110,000–$270,000

Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location.

30+ days ago

san leandro, CA

$100–$150 / hour

To ensure accurate garment fitting and proportion alignment for our product development process, photos should be taken in fitted, non-branded clothing (e.g., tank top and leggings or similar). In the event a recruiter or agency submits a resume or candidate without a previously signed Agreement, Ariat explicitly reserves the right to pursue and hire those candidate(s) without any financial obligation to the recruiter or agency.

30+ days ago

Mountain View, CA

Full time

We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.

22 days ago

Redwood City, CA

Together, these groups are responsible for transforming powerful pretrained language models into intelligent, engaging, safely aligned, and highly scalable products-working across data, compute, algorithms, infrastructure, and user insights to improve model performance and ensure reliable delivery. Program ownership: Lead planning and execution of cross-functional programs spanning data collection, annotation pipelines, alignment workflows (RLHF, DPO, Constitutional AI), safety guardrails (adversarial testing, red-teaming), and model serving.

1 day ago

San Francisco, CA

$204,000–$247,000 / year

About this role: The Staff Software Engineer for the Model LifeCycle team will play a key role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). Collaboration and impact: Work closely with Principal Engineers, product, business, and platform teams to implement the core abstractions and APIs of the system.

30+ days ago

Burlingame, CA

$120,000–$160,000

15 days ago

Fremont, CA

$177,000–$230,000 / year

Our team is differentiated by its expertise in imagining, engineering, and delivering robots with advanced mobility, dexterity, intelligence, and efficiency -- robots specifically designed to work alongside people, in spaces built for people. Anticipated Base Salary Range$177,000—$230,000 USDIn addition to base pay, our competitive total rewards package consists of the following for full-time employees:

30+ days ago

Palo Alto, CA

$140,000–$390,000 / year

Expected Compensation $140,000 - $390,000/annual salary + cash and stock awards + benefits Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. What You'll DoDesign, train, and iterate on neural network architectures for autonomous driving and robotics, with a focus on efficiency-aware model design (architecture search, distillation, pruning, quantization-aware training) .

1 day ago

San Francisco, CA

$220,000–$320,000 / year

Your work spans from implementing known optimization techniques to experimenting with novel approaches, always with the goal of serving models faster and cheaper at scale. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you.

1 day ago

Palo Alto, CA

Remote

€100,000

Pathway is led by co-founder & CEO Zuzanna Stamirowska, a complexity scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who was the first person to apply Attention to speech and worked with Nobel laureate Geoff Hinton at Google Brain, as well as CSO Adrian Kosowski, a leading computer scientist and quantum physicist who obtained his PhD at the age of 20. The company is backed by leading investors and advisors, including Lukasz Kaiser, co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models.

30+ days ago

san francisco, CA

Contribute directly to key components across the serving infrastructure \u2014 from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling \u2014 ensuring smooth and efficient operations at scale. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform.

30+ days ago

san francisco, CA

30+ days ago

San Francisco

RESPONSIBILITIES:Design, build, and operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling and multi-modal servingProfile and optimize TensorRT-LLM kernels, analyze CUDA kernel performance, implement custom CUDA operators, tune memory allocation patterns for maximum throughput and optimize communication patterns across multi-GPU setupsProductionize performance improvements across runtimes with deep understanding of their internals: speculative decoding implementations, guided generation for structured outputs, custom scheduling and routing algorithms for high-performance servingBuild comprehensive benchmarking frameworks that measure real-world performance across different model architectures, batch sizes, sequence lengths, and hardware configurationsProductionize performance improvements across runtimes (e.g. NICE TO HAVE:Experience with LLM runtimes (vLLM, SGLang, TensorRT‑LLM) or contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang, TGI)Knowledge of Kubernetes, service meshes, API gateways, or distributed scheduling.

30+ days ago

santa clara, CA

Ways to stand out from the crowd: Master's or PhD's degree in Computer Science, Robotics, Engineering, or a related field; Demonstrated Tech Lead experience, coordinating a team of engineers and driving projects from conception to deployment; Strong experience at building large-scale LLM and multimodal LLM training infrastructure; Contributions to popular open-source AI frameworks or research publications in top-tier AI conferences, such as NeurIPS, ICRA, ICLR, CoRL. What we need to see: Bachelor's degree in Computer Science, Robotics, Engineering, or a related field; 10+ years of full-time industry experience in large-scale MLOps and AI infrastructure; Proven experience designing and optimizing distributed training systems with frameworks like PyTorch, JAX, or TensorFlow.

30+ days ago

San Francisco, California

Remote

$15–$20 / hour

Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies. For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome.

1 day ago

San Francisco, California

Remote

$15–$20 / hour

1 day ago

San Francisco

You may be a good fit for the Model Efficiency team if you have:5+ years of experience writing high-performance, production-quality codeStrong programming skills in C++ or Python (Rust/Go also welcome)Experience working with large language models and familiarity with the LLM inference ecosystem (e.g., vLLM, SGLang, etc.)Ability to diagnose and resolve performance bottlenecks across the model execution stackA strong bias for action - you ship fast, measure impact, and iterateIt's a big plus if you have experience with:GPU programming, CUDA, or low-level systems optimizationLanguage modeling with transformers (MoE, speculative decoding, KV-cache optimizations)Scaling performance-critical distributed systems (e.g., computation, search, storage)If some of the above doesn't line up perfectly with your experience, we still encourage you to apply! Full-Time Employees at Cohere enjoy these Perks: An open and inclusive culture and work environment ‍ Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks Full health and dental benefits, including a separate budget to take care of your mental health 100% Parental Leave top-up for up to 6 months Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend️ 6 weeks of vacation (30 working days!).

30+ days ago

San Francisco, CA

In addition to your customers, network engineers, you'll partner closely with two research engineers who have deep ML backgrounds and a clear picture of what training data needs to look like. When a network engineer looks at a set of device stats and figures out it's upstream packet loss - not a hardware failure, not a misconfiguration, specifically upstream packet loss - that reasoning lives in their head.

1 day ago

Burlingame, CA

Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.

23 days ago

San Mateo, CA

$295,250–$345,040 / year

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

29 days ago

San Francisco, CA

$250,000–$300,000 / year

Our platform powers multi-tenant server-less workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal models, image, audio, video, and reasoning models at scale. You will be in charge of designing and scaling our ML processes & tooling at production scale – optimizing operations to ensure availability and reliability for our services, across differing tenants and user loads, and in a multi-cluster deployment.

30+ days ago

Foster City, CA

We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.

30+ days ago

San Francisco

Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading AI-driven innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, and Zed to deliver industry-leading performance, security, and reliability for their mission-critical workloads. This role is highly technical and hands-on: you'll own key components of our training stack, collaborate with product and infra teams to surface customer needs, and push forward the state of the art in scalable training infrastructure.

30+ days ago

Fremont, CA

We value people who can bridge research and production, and who care about robustness, scalability, efficiency, and practical deployment in large-scale autonomous driving systems. Work on distributed training systems and large-scale model optimization using frameworks such as: PyTorch Distributed.

30+ days ago

Resume Resources

Free Resume Templates Free Resume Builder

Get noticed by top employers!

Upload your resume to let employers know you're open to Model job opportunities. Plus, receive relevant job recommendations in your inbox.

Create A Free Account

Model Jobs in Daly City, CA

Senior/Staff Machine Learning Research Scientist: Generative Models for Behavior Modeling Nuro Inc

Cell Technology Modeling Engineer Tesla Inc

Senior Quantitative Researcher - Risk Modeling Swish Analytics

Staff Cell Thermal Modeling Engineer Tesla Inc

Vehicle Dynamics Modeling Engineer, Simulation Infrastructure Tesla Inc

Software Engineer, Metrics, GenAI Model Evaluation Tesla Inc

Head of Energy Risk Management and Grid Modeling Google

Director Software Development, AI Models and Research Advanced Micro Devices, Inc

Staff Machine Learning Engineer, Vector Core Modeling Unity Software

Senior Machine Learning Engineer, Vector Core Modeling Unity Software

Machine Learning Engineer, Vector Core Modeling Unity Software

Principal Architect, Performance Analysis and Modeling d-Matrix

Software Engineer III, AI/ML, Image Recommendation Modeling Google

Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS Amazon.com Inc

Senior Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS Amazon.com Inc

Staff SWE, Compiler Architect, System Performance Modeling Google

Data Scientist II Model Validation and Monitoring PeopleNTech LLC

Sr. Staff Engineer, AI Models and Applications Advanced Micro Devices, Inc

Sr. Manager Software Development, AI Models and Applications Advanced Micro Devices, Inc

Summer Intern Connected Driving World Models LER TechForce

CW Summer Intern Connected Driving World Models TekWissen LLC

Product Manager, Search Ads Quality Modeling Google

Hardware Architecture Modeling Engineer, PhD, University Graduate Google

Member of Technical Staff - Imagine Model xAI

Research Engineer - Brain Computer Interface Models Zyphra

Product Manager, Models Hive

Data Scientist - Model Optimization quadric, Inc

Men's Fit Model Part-Time Contractor Ariat International Inc

Staff Software Engineer, Applied Research, Foundation User Models Google

Technical Program Manager, Model Alignment and Deployment Character AI

Staff Software Engineer, Model LifeCycle Crusoe Energy Systems LLC

Data Scientist, New Grad - Model Optimization quadric, Inc

Senior Product Manager - AI Models & Tools Agility Robotics

Software Engineer, Model Hardware CoDesign Tesla Motors

Senior Software Engineer - Model Performance inference

Machine Learning Researcher / Engineer (Foundational Models) Pathway

Senior Engineer, Model Serving Databricks Inc

Staff Engineer, Model Serving Databricks Inc

Software Engineer - Model API''s Baseten Labs Inc

Senior Research Engineer, Foundation Model Training Infrastructure NVIDIA Corp

Language Model Analyst - Fully Remote Mercor

Language Model Evaluator - Fully Remote Mercor

Member of Technical Staff, Model Efficiency Cohere

Software Engineer, Models Meter Service

Data Science Intern - Model Optimization quadric, Inc

Principal Model Optimization Engineer Roblox

Engineering Manager, Model Serving Together AI

Senior AI Inference Engineer - Model Optimization & Deployment Zoox

Senior Software Engineer - Model Training Baseten Labs Inc

Member of Technical Staff (MTS) - Multimodal Foundation Models Deeproute.ai

Resume Resources

Similar Job Searches