Palo Alto, California11 days ago
Candidates have a strong background in one or more of the following areas:Agentic AI and Reasoning: Large language model (LLM)-powered agents, reinforcement learning (RL), reasoning and planning, autonomous workflows, self-evolving agent systems, computer use agents, deep research agents, and coding agents. Core Modeling and Post-Training: Machine learning methodology, pre-training and post-training techniques including reinforcement learning from human feedback (RLHF) and direct preference optimization (DPO), model distillation, evaluation, causal inference, and time series modeling.