Nuro IncSenior/Staff Machine Learning Research Scientist: Generative Models for Behavior Modeling Nuro IncSenior/Staff Machine Learning Research Scientist: Generative Models for Behavior Modelingmountain view, CA$202,350–$303,050 / yearYou have subject matter expertise and research in one or more of the following areas: scalable generative and diffusion models, generative model alignment with reward/cost models, controllability, distillation, mode recovery, fine-tuning with closed-loop RL, applications to autonomous driving and robotics, and other experiences such as video generation, text-to-image generation, in-painting/out-painting and so on. The first commercial application of the Nuro Driver is autonomous goods delivery with our custom, electric, zero-occupant vehicles in partnership with some of the most recognized brands in the world including Uber and FedEx.
Tesla IncCell Technology Modeling Engineer Tesla IncCell Technology Modeling Engineerpalo alto, CA$84,000–$204,000 / yearAlong with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire: Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction. For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).
Swish AnalyticsSenior Quantitative Researcher - Risk Modeling Swish AnalyticsSenior Quantitative Researcher - Risk ModelingSan Francisco, CaliforniaYou'll lead research initiatives that generate alpha and improve execution quality, mentor junior researchers, and collaborate closely with our Trading desk to translate quantitative insights into profitable systematic strategies while maintaining rigorous risk management. As we expand our presence on betting exchanges, we're building infrastructure and strategies akin to those found in traditional financial markets.
Tesla IncStaff Cell Thermal Modeling Engineer Tesla IncStaff Cell Thermal Modeling EngineerPalo Alto, CA$128,000–$228,000 / yearYou will act as a primary technical lead, providing high-level guidance to unblock the team on complex theoretical challenges while managing project timelines and resource allocation across multiple high-priority programs. Provide deep technical guidance to Senior Engineers to resolve roadblocks in numerical methods, code architecture, or complex thermal phenomena (e.g., hysteresis in non-ideal chemistries).
Tesla IncVehicle Dynamics Modeling Engineer, Simulation Infrastructure Tesla IncVehicle Dynamics Modeling Engineer, Simulation Infrastructurepalo alto, CAThis role is a unique opportunity to merge your passion for vehicle dynamics, software development, and data science to directly impact products that accelerate the world's transition to sustainable energy. Apply statistical analysis and machine learning techniques to correlate simulation models with real-world performance data, identifying key performance indicators and areas for improvement.
Tesla IncSoftware Engineer, Metrics, GenAI Model Evaluation Tesla IncSoftware Engineer, Metrics, GenAI Model Evaluationpalo alto, CA$132,000–$390,000 / yearWorking alongside our AI team, you will design metrics that utilize fleet data and run on large inference clusters to help drive key decisions about end-to-end model architecture, data integrity, and exported model performance. For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).
GoogleHead of Energy Risk Management and Grid Modeling GoogleHead of Energy Risk Management and Grid ModelingSan Francisco, CAFull timeFrom software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future.
Advanced Micro Devices, IncDirector Software Development, AI Models and Research Advanced Micro Devices, IncDirector Software Development, AI Models and ResearchSan Jose, CaliforniaWe push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary.
Unity SoftwareStaff Machine Learning Engineer, Vector Core Modeling Unity SoftwareStaff Machine Learning Engineer, Vector Core ModelingMountain View, CAWhile specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.
Unity SoftwareSenior Machine Learning Engineer, Vector Core Modeling Unity SoftwareSenior Machine Learning Engineer, Vector Core ModelingMountain View, CAWhile specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.
Unity SoftwareMachine Learning Engineer, Vector Core Modeling Unity SoftwareMachine Learning Engineer, Vector Core ModelingMountain View, CAWhile specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program. These models serve as the core intelligence behind delivering the right ad to the right user at the right time, maximizing both user experience and advertiser outcomes.
d-MatrixNewPrincipal Architect, Performance Analysis and Modeling d-MatrixPrincipal Architect, Performance Analysis and ModelingSanta Clara, CAYour day-to-day work will include (1) analyzing the properties of emerging machine learning algorithms and workloads and identifying functional, performance implications (2) Creating analytical models to project performance on current and future generations of d-matrix hardware (3) proposing new HW/SW features to enable or accelerate these algorithms . Experience with developing analytical performance models, architecture simulators for performance analysis, Research background with publication record in top-tier architecture, or machine learning venues is a huge plus (such as ISCA, MICRO, ASPLOS, HPCA, DAC, MLSys etc.).
GoogleSoftware Engineer III, AI/ML, Image Recommendation Modeling GoogleSoftware Engineer III, AI/ML, Image Recommendation ModelingMountain View, CAFull timeWe're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a part of the Ember team, you will help reimagine Google Images, a platform serving over a billion daily interactions, to build a queryless, hyper-personalized feed that fuels creativity worldwide.
Amazon.com IncPre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS Amazon.com IncPre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWSCupertino, CAWhat youll do: - Build and own models of SoC subsystems - translating architecture specs and RTL behavior into accurate, testable C++ models - Work directly with RTL design and verification teams to validate model behavior against RTL, debug discrepancies, and support pre-silicon verification flows - Develop model-based test infrastructure: regression suites, RTL correlation checks, and coverage-driven testing - Contribute to performance modeling efforts - building cycle-approximate models that help architects evaluate design trade-offs before RTL exists - Improve modeling methodology and infrastructure: how models are structured, integrated, tested, and released to DV and architecture teams - Collaborate with chip architects to understand upcoming designs and plan modeling work ahead of RTL availability Why this role is interesting: - Your models are used to verify silicon before its built - bugs you catch save months of schedule and millions of dollars - Youll work at the intersection of software engineering and chip design, with deep visibility into how custom ML accelerators are architected - As the team scales, theres a clear path into architectural modeling - using your models to influence chip design decisions, not just validate them - Small team, high ownership, direct impact on AWSs most strategic silicon programs You will thrive in this role if you: - Have built functional or performance models of SoCs, ASICs, GPUs, CPUs, or IP blocks - Are comfortable working with architectural / design specifications or reference implementations and translating them into C++ or SystemC models - Understand verification concepts and have worked with DV teams or in pre-silicon validation environments - Care about model fidelity and have experience correlating models against RTL or silicon - Are interested in expanding into architectural performance modeling as the team grows - Enjoy working on a small, high-impact team where you own significant pieces of the stack No ML background needed. Our team builds C++ models of these custom SoCs that RTL designers, verification engineers, and software teams depend on throughout the silicon development lifecycle.
Amazon.com IncSenior Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS Amazon.com IncSenior Pre-Silicon SoC Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWSCupertino, CAOur team builds C++ models of these custom SoCs that RTL designers, verification engineers, and software teams depend on throughout the silicon development lifecycle. Contribute to performance modeling efforts - building cycle-approximate models that help architects evaluate design trade-offs before RTL exists.
GoogleStaff SWE, Compiler Architect, System Performance Modeling GoogleStaff SWE, Compiler Architect, System Performance ModelingSunnyvale, CAFull timeFrom software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Kirkland, WA, USA; New York, NY, USA; Seattle, WA, USA.Minimum qualifications: Bachelor's degree or equivalent practical experience.
PeopleNTech LLCData Scientist II Model Validation and Monitoring PeopleNTech LLCData Scientist II Model Validation and MonitoringSanta Clara, CA$65–$70 / hourEssential FunctionsLead model monitoring activities, including tracking performance metrics, detecting model and data drift, identifying data quality issues, providing root cause analysis, and recommending remediation strategies. This includes providing effective challenges to model development, conduct model monitoring and performance tracking, provide root cause analysis of model performance, exploring, building, validating, and deploying models.
Advanced Micro Devices, IncSr. Staff Engineer, AI Models and Applications Advanced Micro Devices, IncSr. Staff Engineer, AI Models and ApplicationsSan Jose, CaliforniaThe ideal candidate has deep technical understanding of the latest generative AI applications like large language models (LLMs), large multimodal models (LMMs), image/video generation, has experience training models at scale and is passionate about innovating efficient approaches to enable distributed training and inference at scale on AMD devices. Impactful Work: Your contributions will directly influence how cutting-edge gen AI models across the industry are efficiently trained at scale as well as inferencing deployed to serve millions of customers, making a significant difference in various industries and applications.
Advanced Micro Devices, IncSr. Manager Software Development, AI Models and Applications Advanced Micro Devices, IncSr. Manager Software Development, AI Models and ApplicationsSan Jose, California
LER TechForceSummer Intern Connected Driving World Models LER TechForceSummer Intern Connected Driving World ModelsMountain View, CAOur Client is seeking a PhD student in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline to work on Connected Driving World Models. D. in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline with a focus on AI / machine learning.
TekWissen LLCCW Summer Intern Connected Driving World Models TekWissen LLCCW Summer Intern Connected Driving World ModelsMountain View, CA$43.59–$43.59Position: CW Summer Intern Connected Driving World Models Location: Mountain View, CA 94043 Duration: 3+ Months Job Type: Temporary Assignment Work Type: Onsite Job Description Client is seeking a PhD student in Computer Science, Electrical Engineering, Mechanical Engineering, or a related engineering discipline to work on Connected Driving World Models. Overview: TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent solutions to our clients worldwide.
GoogleProduct Manager, Search Ads Quality Modeling GoogleProduct Manager, Search Ads Quality ModelingMountain View, CAFull timeAdvertisers worldwide use Google Ads to promote their products; publishers use AdSense to serve relevant ads on their website; and business around the world use our products (like Google Shopping, and Google Wallet) to support their online businesses and bring users into their offline stores. As a Product Manager in the Ad Relevance and User Quality team, you will play a key role in building the ML models that are responsible for showing useful ads to users across Google Search and AI Mode.
GoogleHardware Architecture Modeling Engineer, PhD, University Graduate GoogleHardware Architecture Modeling Engineer, PhD, University GraduateSunnyvale, CAFull timeFrom software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future.
xAIMember of Technical Staff - Imagine Model xAIMember of Technical Staff - Imagine ModelPalo Alto, CA$180,000–$440,000 / yearDomain expertise in multimodal applications such as graphics engines, rendering techniques, image/video understanding and generation, world models, real-time simulation, or controllable/long-horizon visual content creation (audio/speech processing or music/audio generation experience is a plus where it supports video). ABOUT THE ROLE: As a multimodal engineer on the Imagine Model Team, you will develop cutting-edge AI experiences beyond text, with a strong focus on enabling high-fidelity understanding and generation across image and video modalities, while also incorporating audio where it enhances visual content (e.g., synchronized audio for video).
ZyphraNewResearch Engineer - Brain Computer Interface Models ZyphraResearch Engineer - Brain Computer Interface ModelsSan Francisco, CAThe Role: As a Research Engineer - Brain Computer Interface Models , you will be a core contributor to Zyphra's BCI work, building the next generation of open-source EEG and brain–computer interface models. You will be involved across the full model lifecycle, from data collection and preprocessing to designing novel architectures and training methodologies.
HiveProduct Manager, Models HiveProduct Manager, ModelsSan Francisco, CA$120,000–$170,000 / yearAs a Product Manager on our Hive Models team, you will work cross-functionally with all stakeholders to define product requirements and see the implementation through to completion, leading development efforts between our Machine Learning, Core Infrastructure, and Product teams. Own our Hive Models portfolio, understanding the needs of the product development teams, and envisioning and executing the creation of our deep learning models that can provide human-like interpretation of video, image, audio and text.
quadric, IncData Scientist - Model Optimization quadric, IncData Scientist - Model OptimizationBurlingame, CA$110,000–$270,000Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location.
Ariat International IncMen's Fit Model Part-Time Contractor Ariat International IncMen's Fit Model Part-Time Contractorsan leandro, CA$100–$150 / hourTo ensure accurate garment fitting and proportion alignment for our product development process, photos should be taken in fitted, non-branded clothing (e.g., tank top and leggings or similar). In the event a recruiter or agency submits a resume or candidate without a previously signed Agreement, Ariat explicitly reserves the right to pursue and hire those candidate(s) without any financial obligation to the recruiter or agency.
GoogleStaff Software Engineer, Applied Research, Foundation User Models GoogleStaff Software Engineer, Applied Research, Foundation User ModelsMountain View, CAFull timeWe're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
Character AINewTechnical Program Manager, Model Alignment and Deployment Character AITechnical Program Manager, Model Alignment and DeploymentRedwood City, CATogether, these groups are responsible for transforming powerful pretrained language models into intelligent, engaging, safely aligned, and highly scalable products-working across data, compute, algorithms, infrastructure, and user insights to improve model performance and ensure reliable delivery. Program ownership: Lead planning and execution of cross-functional programs spanning data collection, annotation pipelines, alignment workflows (RLHF, DPO, Constitutional AI), safety guardrails (adversarial testing, red-teaming), and model serving.
Crusoe Energy Systems LLCStaff Software Engineer, Model LifeCycle Crusoe Energy Systems LLCStaff Software Engineer, Model LifeCycleSan Francisco, CA$204,000–$247,000 / yearAbout this role: The Staff Software Engineer for the Model LifeCycle team will play a key role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs). Collaboration and impact: Work closely with Principal Engineers, product, business, and platform teams to implement the core abstractions and APIs of the system.
quadric, IncData Scientist, New Grad - Model Optimization quadric, IncData Scientist, New Grad - Model OptimizationBurlingame, CA$120,000–$160,000Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location.
Agility RoboticsSenior Product Manager - AI Models & Tools Agility RoboticsSenior Product Manager - AI Models & ToolsFremont, CA$177,000–$230,000 / yearOur team is differentiated by its expertise in imagining, engineering, and delivering robots with advanced mobility, dexterity, intelligence, and efficiency -- robots specifically designed to work alongside people, in spaces built for people. Anticipated Base Salary Range$177,000—$230,000 USDIn addition to base pay, our competitive total rewards package consists of the following for full-time employees:
Tesla MotorsNewSoftware Engineer, Model Hardware CoDesign Tesla MotorsSoftware Engineer, Model Hardware CoDesignPalo Alto, CA$140,000–$390,000 / yearExpected Compensation $140,000 - $390,000/annual salary + cash and stock awards + benefits Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. What You'll DoDesign, train, and iterate on neural network architectures for autonomous driving and robotics, with a focus on efficiency-aware model design (architecture search, distillation, pruning, quantization-aware training) .
inferenceNewSenior Software Engineer - Model Performance inferenceSenior Software Engineer - Model PerformanceSan Francisco, CA$220,000–$320,000 / yearYour work spans from implementing known optimization techniques to experimenting with novel approaches, always with the goal of serving models faster and cheaper at scale. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'd love to meet you.
PathwayMachine Learning Researcher / Engineer (Foundational Models) PathwayMachine Learning Researcher / Engineer (Foundational Models)Palo Alto, CARemote€100,000Pathway is led by co-founder & CEO Zuzanna Stamirowska, a complexity scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who was the first person to apply Attention to speech and worked with Nobel laureate Geoff Hinton at Google Brain, as well as CSO Adrian Kosowski, a leading computer scientist and quantum physicist who obtained his PhD at the age of 20. The company is backed by leading investors and advisors, including Lukasz Kaiser, co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models.
Databricks IncSenior Engineer, Model Serving Databricks IncSenior Engineer, Model Servingsan francisco, CAContribute directly to key components across the serving infrastructure \u2014 from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling \u2014 ensuring smooth and efficient operations at scale. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform.
Databricks IncStaff Engineer, Model Serving Databricks IncStaff Engineer, Model Servingsan francisco, CAContribute directly to key components across the serving infrastructure \u2014 from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling \u2014 ensuring smooth and efficient operations at scale. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform.
Baseten Labs IncSoftware Engineer - Model API''s Baseten Labs IncSoftware Engineer - Model API''sSan FranciscoRESPONSIBILITIES:Design, build, and operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling and multi-modal servingProfile and optimize TensorRT-LLM kernels, analyze CUDA kernel performance, implement custom CUDA operators, tune memory allocation patterns for maximum throughput and optimize communication patterns across multi-GPU setupsProductionize performance improvements across runtimes with deep understanding of their internals: speculative decoding implementations, guided generation for structured outputs, custom scheduling and routing algorithms for high-performance servingBuild comprehensive benchmarking frameworks that measure real-world performance across different model architectures, batch sizes, sequence lengths, and hardware configurationsProductionize performance improvements across runtimes (e.g. NICE TO HAVE:Experience with LLM runtimes (vLLM, SGLang, TensorRT‑LLM) or contributions to open-source inference engines (vLLM, TensorRT-LLM, SGLang, TGI)Knowledge of Kubernetes, service meshes, API gateways, or distributed scheduling.
NVIDIA CorpSenior Research Engineer, Foundation Model Training Infrastructure NVIDIA CorpSenior Research Engineer, Foundation Model Training Infrastructuresanta clara, CAWays to stand out from the crowd: Master's or PhD's degree in Computer Science, Robotics, Engineering, or a related field; Demonstrated Tech Lead experience, coordinating a team of engineers and driving projects from conception to deployment; Strong experience at building large-scale LLM and multimodal LLM training infrastructure; Contributions to popular open-source AI frameworks or research publications in top-tier AI conferences, such as NeurIPS, ICRA, ICLR, CoRL. What we need to see: Bachelor's degree in Computer Science, Robotics, Engineering, or a related field; 10+ years of full-time industry experience in large-scale MLOps and AI infrastructure; Proven experience designing and optimizing distributed training systems with frameworks like PyTorch, JAX, or TensorFlow.
MercorNewLanguage Model Analyst - Fully Remote MercorLanguage Model Analyst - Fully RemoteSan Francisco, CaliforniaRemote$15–$20 / hourGenerate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies. For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome.
MercorNewLanguage Model Evaluator - Fully Remote MercorLanguage Model Evaluator - Fully RemoteSan Francisco, CaliforniaRemote$15–$20 / hourGenerate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies. For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome.
CohereMember of Technical Staff, Model Efficiency CohereMember of Technical Staff, Model EfficiencySan FranciscoYou may be a good fit for the Model Efficiency team if you have:5+ years of experience writing high-performance, production-quality codeStrong programming skills in C++ or Python (Rust/Go also welcome)Experience working with large language models and familiarity with the LLM inference ecosystem (e.g., vLLM, SGLang, etc.)Ability to diagnose and resolve performance bottlenecks across the model execution stackA strong bias for action - you ship fast, measure impact, and iterateIt's a big plus if you have experience with:GPU programming, CUDA, or low-level systems optimizationLanguage modeling with transformers (MoE, speculative decoding, KV-cache optimizations)Scaling performance-critical distributed systems (e.g., computation, search, storage)If some of the above doesn't line up perfectly with your experience, we still encourage you to apply! Full-Time Employees at Cohere enjoy these Perks: An open and inclusive culture and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks Full health and dental benefits, including a separate budget to take care of your mental health 100% Parental Leave top-up for up to 6 months Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend️ 6 weeks of vacation (30 working days!).
Meter ServiceNewSoftware Engineer, Models Meter ServiceSoftware Engineer, ModelsSan Francisco, CAIn addition to your customers, network engineers, you'll partner closely with two research engineers who have deep ML backgrounds and a clear picture of what training data needs to look like. When a network engineer looks at a set of device stats and figures out it's upstream packet loss - not a hardware failure, not a misconfiguration, specifically upstream packet loss - that reasoning lives in their head.
quadric, IncData Science Intern - Model Optimization quadric, IncData Science Intern - Model OptimizationBurlingame, CAQuadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code.
RobloxPrincipal Model Optimization Engineer RobloxPrincipal Model Optimization EngineerSan Mateo, CA$295,250–$345,040 / yearEvery day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
Together AIEngineering Manager, Model Serving Together AIEngineering Manager, Model ServingSan Francisco, CA$250,000–$300,000 / yearOur platform powers multi-tenant server-less workloads and dedicated endpoints, enabling developers, enterprises, and researchers to harness the latest LLMs, multimodal models, image, audio, video, and reasoning models at scale. You will be in charge of designing and scaling our ML processes & tooling at production scale – optimizing operations to ensure availability and reliability for our services, across differing tenants and user loads, and in a multi-cluster deployment.
ZooxSenior AI Inference Engineer - Model Optimization & Deployment ZooxSenior AI Inference Engineer - Model Optimization & DeploymentFoster City, CAWe are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
Baseten Labs IncSenior Software Engineer - Model Training Baseten Labs IncSenior Software Engineer - Model TrainingSan FranciscoBacked by top investors including IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading AI-driven innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, and Zed to deliver industry-leading performance, security, and reliability for their mission-critical workloads. This role is highly technical and hands-on: you'll own key components of our training stack, collaborate with product and infra teams to surface customer needs, and push forward the state of the art in scalable training infrastructure.
Deeproute.aiMember of Technical Staff (MTS) - Multimodal Foundation Models Deeproute.aiMember of Technical Staff (MTS) - Multimodal Foundation ModelsFremont, CAWe value people who can bridge research and production, and who care about robustness, scalability, efficiency, and practical deployment in large-scale autonomous driving systems. Work on distributed training systems and large-scale model optimization using frameworks such as: PyTorch Distributed.