Are you ready to lead the development of cutting-edge Data Protection platforms to ensure a highly available and secure Cloud infrastructure? Are you skilled at mentoring junior staff and collaborating with cross-functional teams across a large enterprise? Then working as a Lead Site Reliability Engineer at Spectrum may be for you. BE PART OF THE CONNECTION This position is responsible for leading design, development and implementation efforts of Data Protection platform, policy, and services. In this role, you will use your development and operations knowledge to identify and prioritize issues, find solutions to common problems and mentor and support junior staff to help support our Cloud infrastructure enterprise wide. This includes working with our entire engineering organization and Enterprise Architecture. You will work closely with our engineering, development, operations, and security teams to design, implement, and maintain a highly available and secure Data Protect solution. The Data Protect Lead will align Data Protect strategy to the needs of the business focusing no criticality and SLAs of backup and restoration of applications. WHAT OUR LEAD SITE RELIABILITY ENGINEERS ENJOY THE MOST Own and ensure product/site reliability. Develop and implement monitoring and alerting systems for Data Protect services. Automate processes and develop tools to improve operational efficiency and reduce manual intervention. Create and test disaster recovery and business continuity plans. Participate in capacity planning and performance optimization. Install, upgrade, and implement Rubrik Data Protect systems. Analyze code, components, and infrastructure for reliability issues and identify failure points with stakeholders. WHAT YOU'LL BRING TO SPECTRUM Required Qualifications Experience: Five years (5+) Network, System Administration, Troubleshooting, Data Protect / Backup, Visualization (VMWare, OpenStack, Kubernetes) experience Three years (3+) Scripting Two years (2+) Container Services Education: Bachelor's degree in Computer Science or related field, or equivalent experience Technical Skills: Advanced experience with: The VMWare suite of products Managing both physical and Virtual infrastructure Operating systems (e.g. Windows and Linux) Data Protect platforms (e.g. Rubrik or Cohesity) Compute Servers (e.g. Dell, Cisco, HPE) Storage Arrays and SAN (e.g. Dell, Pure, VAST, Cisco) Implementing a variety of cloud service models (e.g. Private, Public, Multi-Cloud) Hands-on experience in one or more of cloud computing services (e.g. AWS, Microsoft Azure, Google Cloud Platforms, IBM, etc.) Proficient scripting in one or more languages (e.g. Python, Shell, PowerShell, Ansible or Perl) Advanced experience with CI/CD tools (Puppet, Ansible, Jenkins) Advanced experience managing monitoring and alerting tools Skills and Abilities: Prior experience working in an Agile environment Familiar with containerized workloads (e.g. Kubernetes, Openshift, TKGI) Advanced experienced with firewalls, routing and load balancing Skilled in troubleshooting methodologies Must have excellent written and oral communications, including technical documents, and process documents. Requires attention to detail and excellent organizational skills Ability to contribute independently as well as be a team player Advanced experience managing small projects Self-starter, ability to manage tasks with little supervision Work Environment: Office Environment On Call support, on a rotation basis Preferred Education Eight years (8+) VMware, OpenStack, or Nutanix System Administration experience Eight years (8+) Experience with Data Protection in a Cloud Environment using Bare Metal and VMware platforms with specific experience in Rubrik or Coh Five years (5+) Experience with Infrastructure as a Service technologies Five years (5+) years Experience with Unix/Linux or Windows systems administration Five years (5+) Experience with Compute in a Cloud Environment using Rack Mount a d Blade Servers Three years (3+) vROPs, Log Insight, vRNI, vRIL Three years (3+) Cisco networking Three years (3+) Firewall configuration management Three years (3+) Load Balancer configuration management Three years (3+) Experience as a Site-Reliability/DevOps System Engineer Two years (2+) TKGI Enterprise Pivotal Container Services Two years (2+) VMware NSX-T One year (1+) CI/CD experience in a customer facing, production environment SPECTRUM CONNECTS YOU TO MORE Dynamic Growth: The growth of our industry and evolving technology powers our employees' careers as they move up or around the company Learning Culture: With a dedicated focus on training and development, employees can have confidence that day one is truly just the beginning of a dynamic career Innovation: We move businesses forward by delivering high-speed data and fiber technology solutions that power today's evolving network demands Supportive Teams: Be part of a strong community that gives you opportunities to network and grow and wants to see you succeed Total Rewards: See all the ways we invest in you-at work... For full info follow application link. EOE, including disability/vets