In this role, you will work cross-functionally with Researchers and Engineers on Agent Evals and Quality to ensure that we have the best quality of agents for GenAI model improvement and product developments. You will collaborate with agent platform and model teams to leverage user signals and metrics to improve model performance. focusing on the development, evaluation, and optimization of AI agentic systems—specifically LLM-based agents designed to perform complex, multi-step tasks and workflows.
Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.