Software Engineer, Perception Attributes Autolabeling Pipeline

Ursus, Inc.

Foster City, CA

JOB DETAILS
SALARY
$76–$86.16 Per Hour
SKILLS
AWS Lambda, Amazon Simple Storage Service (S3), Amazon Web Services (AWS), Analysis Skills, Application Programming Interface (API), Automotive Automation, Buses, C++ Programming Language, Calibration, Cost Modeling, Data Management, Data Sets, Documentation, Engineering, Experiment Design, Machine Tool, Metrics, Production Systems, Python Programming/Scripting Language, Reporting Dashboards, Sales Pipeline, Software Engineering, Stewardship, Systems Engineering, Team Lead/Manager, Vehicle Fleets, Writing Skills
LOCATION
Foster City, CA
POSTED
3 days ago
JOB TITLE: Software Engineer, Perception Attributes Autolabeling Pipeline
LOCATION: Foster City, CA (Hybrid; 3 days in office/week)
PAY RANGE: $76 - $86/hr.
DURATION: 6 Months

TOP 3 SKILLS:
  • 3+ years of backend/data pipeline engineering experience; Strong Python; comfort with C++; Large-dataset experience with PySpark or equivalent.
  • ML fundamentals understanding of model inference, embeddings, structured output, and common eval metrics (precision, recall, calibration); able to reason about ML data shapes and integration patterns.
  • Experience integrating foundation models (Gemini, OpenAI, Anthropic) at production scale; Excellent written communication for design docs and runbooks.

Job Description:
The Perception Attribute Flywheel team is looking for a Software Engineer to build and operate the autolabeling pipeline that accelerates human annotation throughput on vehicle attribute classification tasks. The company is building a future for Riders, not drivers. The accuracy of our perception attribute models recognizing emergency vehicles, school buses, brake lights, hazard signals, and more depends on a steady flow of high-quality labeled examples drawn from our fleet's drive data. Today, every label is produced by a human annotator from scratch. We are building a pipeline that uses off-the-shelf foundation models (Gemini, SigLIP, CLIP) to pre-label tasks, so human reviewers verify and correct rather than labeling from scratch.

This role owns the pipeline engineering for that system: ingesting queued tasks from our annotator service, calling foundation-model APIs at fleet scale, writing structured predictions back into the labeling workflow, and operating the whole thing reliably. The team lead and supporting ML engineers own model selection, prompt design, and evaluation methodology; this role partners closely with them but is not expected to own those decisions. If you take pride in building reliable, observable, well-tested data pipelines and want to ship a system that visibly accelerates an autonomous vehicle program, you will excel in this role.

Responsibilities:
  • Build the autolabeling pipeline: ingest queued tasks from the annotator service, dispatch them to foundation-model APIs (Gemini and others), parse structured outputs, and write pre-labels back to the labeling workflow.
  • Build the observability layer: per-task latency, per-model cost, per-attribute coverage, and error-mode dashboards.
  • Run experiments designed by the team lead set up the inputs, execute, and collect outputs in formats the ML engineers can analyze.
  • Integrate the pipeline cleanly with existing systems, partnering with the data infrastructure team.
  • Document the system, write runbooks, and ensure a clean handoff at end of the engagement.

Qualifications:
  • 3+ years of backend/data pipeline engineering experience.
  • Strong Python; comfort with C++.
  • Large-dataset experience with PySpark or equivalent.
  • ML fundamentals understanding of model inference, embeddings, structured output, and common eval metrics (precision, recall, calibration); able to reason about ML data shapes and integration patterns.
  • Experience integrating foundation models (Gemini, OpenAI, Anthropic) at production scale.
  • Excellent written communication for design docs and runbooks.

Bonus Qualities:
Experience with any of the following:
  • Databricks.
  • End-to-end ML pipeline stewardship owned an ML system in production from data ingest through inference through monitoring.
  • Annotation tooling or human-in-the-loop ML workflows.
  • Autonomous-systems data pipelines.
  • AWS, especially S3, ECS/EKS, Lambda.
  • Working in a codebase shared with ML engineers (proto schemas, joint deploys).


BENEFITS SUMMARY: Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate or annual salary only, unless otherwise stated. In addition to base compensation, full-time roles are eligible for Medical, Dental, Vision, Commuter and 401K benefits with company matching.

IND123

About the Company

U

Ursus, Inc.

Similar Job Searches