Key job responsibilities NPI - New Product Introduction - Own the end-to-end NPI lifecycle for storage and/or accelerator (AI/ML/GPU) server platforms - from architecture definition through design, qualification, manufacturing ramp, and launch - Lead technical solutions for complex server and rack system architectural challenges - Work with ODM/manufacturing partners to develop, validate, and manufacture server products at scale - Develop functional specifications, design verification plans, and test procedures - Drive qualification and readiness milestones, ensuring new platforms meet performance, reliability, and cost targets before fleet deployment - Identify and resolve technical risks early in the development cycle - don't let problems reach production Fleet Health, Diagnostics & Automation - Own fleet health for the server platforms you launch - reliability doesn't end at ship - Design and implement predictive failure detection systems using telemetry, sensor data, error trending, and log correlation to identify hardware issues before they cause customer impact - Drive toward zero-touch operations - help build detection, diagnoses, and remediation of faults without human intervention - Debug complex system failures in time-sensitive settings - personally diving deep when the problem demands it - Perform root cause analysis correlating across firmware, kernel, driver, thermal, power, and physical layers Systems Design & Technical Depth - Apply expertise across hardware, software, system design, x86 architecture, processes, and operations (compute, storage, network, GPU) - Design and implement solutions to address system-level issues at large scale - Decompose complex server system problems (testability, reliability, diagnostics) into deliverable tasks and features - Collaborate with hardware, software, manufacturing, supply chain, and product management teams Cross-Team Collaboration - Work closely with internal customers to ensure new server hardware meets data path and control path requirements - Identify early any potential problems onboarding new servers into customer ecosystems - Collaborate across Hardware Engineering, component, firmware, test, qualification, and integration teams - Partner with datacenter operations to close the loop between field failures and design improvements A day in the life Your day-to-day responsibilities include interfacing with internal and external customers to understand product requirements and facilitate system development on top of your server designs. To deliver your products, you will work with an interdisciplinary team of component, firmware, power, mechanical, electrical, test, qualification, manufacturing engineers, and lead our ODM (design and manufacturing partners) to bring these servers to the data center.