Amazon Web Services (AWS), Ansible, Automation, Cloud Computing, Docker, Identify Issues, Linux Operating System, Open Source, QoS (Quality of Service), Reporting Dashboards, Return on Capital Employed (ROCE), Telemetry, VLAN (Virtual Local Area Network)
Position : Senior Implementation Specialist
Location : 275 S Hillview Dr, Milpitas, CA 95035-5417
Duration : 12 months
Requirements:
- Containers: Kubernetes, Docker, etc.
- Automation Tools: Grafana, Ansible
- Cluster Management
- Open-source data tools: Kafka
- Cloud Databases: AWS Databases
- Linux
- HPC related tools
Core Focus:
Ongoing health, observability, and operational stability of the network fabric
Network Context
- Leaf spine style architecture (currently VLAN-based segmentation)
- Mix of:
- Low-speed control/management networks
- High-speed data acquisition paths
" RDMA / RoCE not currently used, but future consideration
" Possible evolution toward L3 routed fabric at larger scale (beta phase)
Key Responsibilities
- Build post-handoff network observability
- Implement and enhance:
- Telemetry
- Advanced monitoring
- Health dashboards
- Detect and alert on:
Fiber/optic degradation
- Port failures
- Link state changes
- Environmental impacts (construction activity, cable damage)
Work with:
- Spectrum-X telemetry
- Advanced QoS policies
- Establish operational workflows:
- Detection alerting remediation
- Maintain fabric stability after PS team exits
- Support integration troubleshooting across: