Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It's the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you'll do more than join something - you'll add something.
Apple's AIML Evaluation team is looking for a seasoned, technical leader to lead our Data Science and Insights team. The organization leads Evaluation for Apple Intelligence, Siri and a large portfolio of other billion+ user facing features in SWE.
Successful candidates will have deep experience in traditional human evaluation methodology, logging, and A/B testing, in addition to hands-on experience building and deploying LLM-based autograders and rubrics, and using these tools to proactively drive improvements in models and agentic features. As the head of Data Science and Insights, youll influence the direction of a wide variety of software features, models, and platforms, in close collaboration with teams across the company. Your experience will enable you to thoughtfully balance the various tradeoffs involved in creating successful features that meet Apples high customer expectations for both quality and privacy.Setting the evaluation strategy that determines how Apple measures quality for Apple Intelligence, Siri, and the broader SWE portfolio of billion+ user features. Leading a large team data scientists and machine learning engineers - recruiting, developing, and retaining strong technical talent across both disciplines. Driving the methodological agenda across human evaluation, logging, AB testing, and LLM-based autograders and rubrics, and ensuring those methods translate into measurable model and agentic feature improvements. Partnering with Apple Intelligence, Siri, and other SWE product and engineering teams to embed evaluation into product development cycles and turn evaluation results into shipped quality gains. Partnering with peer leaders inside AIML Evaluation platforms (AB, Annotation, Synthetic Data), Apple Foundation Models, Machine Translation, etc. Representing the team to executive leadership across AIML and SWE, including Senior Vice Presidents.10+ years of experience in data science and machine learning evaluation, including 6+ years leading large technical teams Advanced degree in a quantitative field such as Statistics, Computer Science, Machine Learning, or similar Demonstrated track record of running organizations of 50+ data scientists and/or machine learning engineers Deep experience in human evaluation methodology, logging, and AB testing for consumer-facing products at scale Hands-on experience building and deploying LLM-based autograders and rubrics, and using them to drive proactive improvements in models and agentic features Strong written and verbal communication skills, able to communicate effectively with engineers and senior leaders, including Senior Vice PresidentsExperience evaluating large consumer AI products such as conversational assistants, search systems, or agentic features Experience with logging infrastructure and instrumentation for AI product quality measurement Track record of growing senior leaders from within your organization and where needed recruiting senior data science and machine learning talent in competitive hiring markets Familiarity with evaluation frameworks for agentic systems and tool-use Strong written and verbal communication skills, able to communicate effectively with engineers and senior leaders, including Senior Vice Presidents
We’re a diverse collection of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways. The people who work here have reinvented entire industries with the Mac, iPhone, iPad, and Apple Watch, as well as with services, including iTunes, the App Store, Apple Music, and Apple Pay. And the same passion for innovation that goes into our products also applies to our practices — strengthening our commitment to leave the world better than we found it.
There’s a place here for every kind of brilliant. Everyone here is an innovator, or an innovator-to-be, no matter what your team or your role. So bring your passion, courage, and original thinking and get ready to share it, because every new product, service, or feature we invent is the result of people working together to make each others’ ideas stronger. Innovation at this level depends on people who represent the variety of the human experience and inspire us with their own fresh perspectives. Together, we’ll do amazing work that can make a difference in people’s lives. Including your own. Learn more about working at Apple.