Senior Site Reliability Engineer

Wells Fargo & Co

Charlotte, NC

JOB DETAILS
SKILLS
Agile Programming Methodologies, Analysis Skills, Ansible, Artificial Intelligence (AI), Best Practices, Bootstrap, Budgeting, CA Workload Automation AE (AutoSys Edition), Cloud Computing, Consulting, Data Modeling, Embedded Systems, IBM Product Family, IT Service Management (ITSM), Incident Response, JavaScript Frameworks, Jenkins, Large-Scale Systems, Machine Tool, Mentoring, Metrics, Micromuse Netcool, Operational Audit, Operations Planning, Operations Processes, Presentation/Verbal Skills, React.js, Regulatory Compliance, Reliability Engineering, Remedy, Root Cause Analysis, ServiceNow, Software Administration, Splunk, Structured Data, System Operations, Systems Engineering, Technical Support, Unified Communications, Unstructured Data, Writing Skills
LOCATION
Charlotte, NC
POSTED
14 days ago

Description

Title: Senior Site Reliability Engineer

Location: Charlotte, NC

Duration: 12 months

Work Engagement: W2

Work Schedule: Hybrid 3 days in office/2 days remote

Benefits on offer for this contract position: Health Insurance, Life insurance, 401K and Voluntary Benefits

Summary:

In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Systems Operations Engineering. Review and analyze complex multi-faceted, larger scale or longer-term Systems Operations Engineering challenges that require in-depth evaluation of multiple factors including intangibles or unprecedented factors. Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Strategically collaborate and consult with client personnel. Required Qualifications: Systems Engineering or Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work or consulting experience, training, military experience, education.

Key Responsibilities:

  • Design and implement automated tooling to eliminate manual toil and optimize operations.
  • Build and enhance monitoring, alerting and overall observability.
  • Champion the SRE practice within COO Technology by modeling best practices, mentoring peers, and collaborating with embedded platform SRE teams.
  • Enhance system availability in a multi-cloud environment by evolving resiliency patterns.
  • Introduce and scale AIOps, including self-healing and autonomic systems using AI/ML, RPA, and unified communications.
  • Automate key SRE metrics and IT service operations processes, including customer impact analysis, availability tracking, SLO/SLI adherence, error budgeting, and incident response.
  • Support critical applications and customer journeys, lead Agile-based remediation efforts, conduct blameless postmortems, and drive root cause analysis to eliminate recurring issues.
  • Implement and guide through Non-Functional Requirements (NFRs) during modernization and uplift initiatives.
  • Help define, govern and enforce Permit to Operate

Key Requirements:

  • Applicants must be authorized to work for ANY employer in the U.S. This position is not eligible for visa sponsorship.

  • leading, operating and performing within an SRE team.

  • Strong soft-skills including written and verbal communication.

  • Understanding of AutoSys

  • Proficiency in Python and/or Java/J2EE

  • Experience with REST APIs, microservices, messaging technologies (Kafka, MQ)

  • Familiarity with JavaScript frameworks (React, Bootstrap)

  • Strong SQL and database design skills

  • Expertise in Linux and container platforms (Kubernetes)

  • Experience with cloud platforms: PCF, AWS, GCP, or Azure

  • Tools: Jenkins, GitLab, SonarQube, Artifactory, Ansible

  • Tools: Grafana, Prometheus, ELK/Splunk, AppDynamics, Cloud Google Logger, Elastic, Thousandeyes, Aternity

  • AIOps platforms: Moogsoft, AI/ML frameworks

  • ITSM tools: ServiceNow, Remedy, IBM Netcool

  • Data platforms: Oracle, DB2, SQL, MongoDB, Hadoop, Cloudera, Spark, Teradata

  • Understanding of key AI terminologies.

  • Understanding of core AI/ML concepts such as classification, regression, clustering, and anomaly detection

  • Awareness of ethical considerations and limitations in AI-driven operations (e.g., bias, explainability)

  • Ability to work with structured and unstructured data for model training and evaluation

  • Some experience in integrating AI models into automation workflows (e.g., for predictive alerting or self-healing systems)

About the Company

W

Wells Fargo & Co

We believe in our vision and values just as strongly today as we did the first time we put them on paper more than 20 years ago. Staying true to them will guide us toward continued growth and success for decades to come. As you read more about our vision and values, you will learn about who we are, where we’re headed and how every Wells Fargo team member can help us get there.

COMPANY SIZE
10,000 employees or more
INDUSTRY
Financial Services
FOUNDED
1852