Site Reliability Engineer 2940

inSync Staffing

Berkeley Heights, NJ

Apply

JOB DETAILS

SKILLS

Amazon Web Services (AWS), Ansible, Automation, Banking Services, Business Solutions, Capacity Management, Cloud Computing, Communication Skills, Continuous Deployment/Delivery, Continuous Improvement, Continuous Integration, Customer Relations, DevOps, Documentation, Finance, High Availability, Identify Issues, Incident Response, Information Technology & Information Systems, Java, Linux Administration, Linux Operating System, Microsoft Windows Azure, Microsoft Windows Server, Microsoft Windows System Administration, Network Operations Center, On Call, Onboarding, Operational Improvement, Performance Analysis, Performance Management, Performance Metrics, Problem Solving Skills, Process Improvement, Production Control, Production Support, Production Systems, Public Cloud, Python Programming/Scripting Language, Reliability Engineering, Root Cause Analysis, Scalable System Development, Scripting (Scripting Languages), Software Administration, Software Development, Software Engineering, Splunk, Systems Administration/Management, Systems Reliability, Team Player, Technical Recruiting, Windows PowerShell

LOCATION

Berkeley Heights, NJ

POSTED

6 days ago

JOB TITLE: SRE (only W2, no, c2c)
LOCATION: Onsite in Berkeley Heights, NJ
INDUSTRY: Financial Technology

JOB DESCRIPTION: Theoris is seeking an experienced Site Reliability Engineer (SRE) to help build, maintain, and optimize highly available, scalable, and resilient platforms that support critical business applications. This role combines software engineering principles with operational excellence to improve system reliability, automate processes, and drive continuous improvement across enterprise environments.
The ideal candidate will have a strong background in cloud technologies, Kubernetes, Linux administration, automation, observability, and production support within large-scale environments.

RESPONSIBILITIES:

Drive incident response activities, including triage, troubleshooting, root cause analysis, post-mortems, and long-term remediation planning.
Partner closely with application development and operations teams to improve system performance, stability, and reliability.
Monitor production environments using observability and monitoring tools to proactively identify and resolve issues.
Participate in an on-call rotation supporting critical production systems.

Design, develop, and maintain automation solutions to eliminate manual operational tasks and improve system efficiency.
Automate health checks, deployments, operational workflows, and infrastructure management processes.
Leverage tools such as Ansible, Azure DevOps, Azure Runbooks, PowerShell, and Python to build scalable automation solutions.
Support application onboarding and operational readiness initiatives.

Support and manage hybrid infrastructure environments spanning on-premises data centers and public cloud platforms.
Collaborate with cloud platform teams, application teams, and business stakeholders on infrastructure design, platform management, and capacity planning.
Assist with architecture reviews and reliability-focused engineering improvements.
Work extensively within Azure and AWS environments, with a stronger emphasis on Azure.

Monitor system health using tools such as Dynatrace, Splunk, Grafana, and other observability platforms.
Analyze performance metrics, identify reliability gaps, and drive continuous improvement initiatives.
Partner with engineering teams to optimize application and infrastructure performance.
Documentation & Process Improvement

REQUIREMENTS:

5+ years of experience in Site Reliability Engineering (SRE), Production Engineering, DevOps, or Infrastructure Engineering.
4+ years of experience with automation and scripting tools such as: Ansible (highly preferred), Python, PowerShell, Java, Azure DevOps and Azure Runbooks.
4+ years of experience with observability and monitoring solutions, including: Dynatrace, Splunk, Moogsoft and Grafana.
Strong troubleshooting experience within Linux environments.
Hands-on experience supporting Kubernetes container platforms.
Experience supporting cloud environments, including Azure and/or AWS.
Strong communication skills and the ability to collaborate effectively across engineering, operations, platform, and business teams.

PREFERRED:

Windows server administration and troubleshooting experience.
FinTech, payments, banking, or highly regulated industry experience.
Experience managing CI/CD pipelines and DevOps toolchains.
Experience with: GitLab, Harness, Nexus, Terraform and SonarQube
Experience supporting large-scale, mission-critical production environments.

About Theoris
Our goal is to Fuel Your Career! As a Theoris team member, you join a culture based on people-centered values and an environment that fosters both personal and professional growth. We build long-term relationships with our clients and our consultants. With over 30 years of building strong relationships in the industry, we’re uniquely positioned to make the right connections. Our recruiting teams are experts dedicated to the information technology and engineering staffing space and are highly respected by our client base.

About the Company

inSync Staffing

We recognize the VMS program management team is our customer and needs to be serviced with integrity, so we built and continue to improve upon our delivery methods as we strive to provide the highest quality service possible. inSync Staffing’s management team recognized ten years ago the inevitable changes to the staffing industry being brought about by technology and the growing trend of Fortune 1000 corporations to outsource management of their contingent workforces to meet compliance and cost control goals. Rather than swim upstream against the changes, inSync Staffing has embraced MSP and VMS programs as our customers, not competitors. We asked program managers how they want to be serviced. The result of their input is that we have structured inSync Staffing as a recruiting and customer service organization, unlike traditional staffing companies who sell directly to the end client. Our delivery model allows us concentrates our resources on how to best supply candidates in a very competitive MSP/VMS program environment.

COMPANY SIZE

50 to 99 employees

INDUSTRY

Staffing/Employment Agencies

FOUNDED

2014

WEBSITE

http://www.insyncstaffing.com/default.html

Resume Resources

Free Resume Templates Free Resume Builder