Network Engineer

Super Micro Computer Inc

San Jose, CA

JOB DETAILS
SKILLS
ARM (Advanced RISC Machine), Analysis Skills, Apache Hadoop, Artificial Intelligence (AI), Benchmarking, Big Data, Blog, Business Solutions, CCIE - Cisco Certified Internetwork Expert, CUDA (Compute Unified Device Architecture), Cloud Computing, Communication Skills, Computer Engineering, Computer Science, Customer Support/Service, Customer/Client Research, Deep Learning, DevOps, Diversity, Docker, Electrical Engineering, Embedded Systems, Enterprise Computing, GPU (Graphics Processing Unit), Genetics, Hardware Upgrades, Intel Product Family, Internet of Things, JNCIE - Juniper Networks Certified Internet Expert, Linux Operating System, Machine Learning, Network Architecture/Engineering, Network Configuration Management, Network Debugging, Network Operations Center, Network Routing, Network Switching, Network Testing, On Site Support, Problem Solving Skills, Process Development, Product/Service Launch, Programming Tools, Proof of Concept, Quality Assurance Methodology, Reliability Testing, Resolve Customer Issues, Schedule Development, Simulation, Software Administration, Software Upgrades, Stress Testing, System Test, Systems Administration/Management, Team Player, Technical Writing, Telecommunications, Test Design, Test Plan/Schedule, Test Scripts, Testing, Time Management, Unix Shell Programming, Windows PowerShell
LOCATION
San Jose, CA
POSTED
3 days ago

Job Req ID: 27692

About Supermicro:

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary:

As a Network Engineer, you will be assisting roll out and maintain business critical applications and services for Supermicro. You will work with senior engineer to resolve escalated service issues, work with other engineers to resolutions, engineering and implementing complex projects.

Essential Duties and Responsibilities:

Includes the following essential duties and responsibilities (other duties may also be assigned):

  • Execute system-level rack tests on latest NVidia and AMD GPUs, ARM-based, Intel Xeon, and AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and reliability testing, leveraging proprietary in-house tools.
  • Familiar with HPC/AI applications and benchmarks, address customer support issues, demonstrating innovative problem-solving skills and building robust processes and procedures for HPC/AI solutions.
  • May work on conduct proof of concept design and testing, providing optimized benchmarks for HPC/AI applications in a timely manner. Fine-tune BIOS settings, optimize OS/network configurations, and develop diverse simulation configurations to enhance efficiency across various workloads.
  • Deliver on-site deployment services, ensuring customer acceptance verification and providing post-level 1&2 support. Create and maintain technical documentation, including technical notes, blogs, and diagrams, to facilitate knowledge dissemination.
  • Identify and document hardware and software quality issues and collaborate with Product Management and other Engineering teams to integrate customer feedback into future product enhancements.
  • Proactively engage in HPC roadmap development, planning software and hardware upgrades to sustain exceptional HPC infrastructure performance.
  • Document and analyze test plans, reports, logs, and actively contribute to the development of test utilities and automation scripts to streamline testing processes.

Qualifications:

  • BS/MS in Electrical Engineering, Computer Engineering or Computer Science
  • 1+ years of work-related experience in Deep Learning and Machine Learning
  • Familiar with Linux/networking debugging/testing or relevant experience preferred
  • Familiar with data center, enterprise, or telecommunication working on routing and switching networking technologies.
  • Knowledge with DevOps or in cloud environments, including but not limited to Docker/Containers and Kubernetes
  • Hands-on experience with workload/scheduler Managers (Slurm) for rack/cluster
  • Familiar with MLPerf Training/Inference benchmark, LLM, HPL-AI or RCCL/NCCL
  • Programming experience with windows and Linux shell scripting
  • Strong sense of teamwork and good team player, strong communication skills

Desired Skills:

  1. Familiar with Intel/AMD/NVIDIA development tool kits such as CUDA, oneAPI, ROCm

  2. Relevant certifications such as CCIE, JNCIE, or Arista ACE are highly desirable

  3. Experience with server/network hardware debugging and troubleshooting

  4. CCNA, OpenStack, OpenShift, Azure or AWS

Salary Range

EEO Statement

Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

About the Company

S

Super Micro Computer Inc

Super Micro Computer, Inc. or Supermicro® (NASDAQ: SMCI), a global leader in high-performance, high-efficiency server technology and innovation is a premier provider of end-to-end green computing solutions for Enterprise IT, Datacenter, Cloud Computing, HPC and Embedded Systems worldwide. Supermicro's advanced server Building Block Solutions® offers a vast array of modular, interoperable components for building energy-efficient,pplication-optimized, computing solutions. This broad line of products includes servers, blades, GPU systems, workstations, motherboards, chassis, power supplies, storage technologies, networking solutions and SuperRack® cabinets/accessories. Architecture innovations include Twin Architecture, SuperServer®, SuperBlade®, MicroCloud, Super Storage Bridge Bay (SBB), Double-Sided Storage™, Universal I/O (UIO) and WIO expansion technology all of which deliver unrivaled performance and value.
COMPANY SIZE
1,500 to 1,999 employees
INDUSTRY
Computer Hardware
FOUNDED
1993
WEBSITE
https://www.supermicro.com/