Senior Site Reliability Engineer (SRE)

Varite, Inc

Atlanta, GA

JOB DETAILS
SALARY
$130,005–$135,001 Per Year
SKILLS
Apache Kafka, Apache Spark, Automation, Aviation Industry, Bash Scripting, Best Practices, Cloud Computing, Consulting, Continuous Deployment/Delivery, Continuous Integration, Docker, Financial Control, Fortune 1000 Customers, Functional Programming Languages, GCP (Good Clinical Practices), Government, Hardware Virtualization, Haskell, High Availability, High Tech Industry, Identify Issues, Java, Jenkins, Large-Scale Systems, Linux Operating System, Microservices, Objective Caml, Performance Tuning/Optimization, Production Support, Production Systems, Prolog Programming Language, Python Programming/Scripting Language, Reliability Engineering, Scripting (Scripting Languages), Software Administration, Software Installation, Splunk, Systems Scalability, Unix Operating Systems, VMWare, nginx Web Server
LOCATION
Atlanta, GA
POSTED
3 days ago
VARITE is looking for a qualified Senior Site Reliability Engineer (SRE) – 619374 in Atlanta, GA
 
About the client:
An American Software company that provides a suite of tools intended to support the development and deployment of large-scale service-oriented software installations.
 
What do we do?
Established in the Year 2000, VARITE is an award-winning minority business enterprise providing global consulting & staffing services to Fortune 1000 companies and government agencies. With 850+ global consultants, VARITE is committed to delivering excellence to its customers by leveraging its global experience and expertise in providing comprehensive scientific, engineering, technical, and non-technical staff augmentation and talent acquisition services.
 
Job Title: Senior Site Reliability Engineer (SRE)
Job ID: 619374
Location: Atlanta, GA
Duration: FULL TIME
 
Salary: $130 k/ Yr - $135k/Yr
 
Fulltime
 
Overview:
  • We are seeking an experienced and results-driven Senior Site Reliability Engineer (SRE) to join a high impact aviation technology project. This role requires a strong background in Java development, cloud infrastructure, and site reliability best practices. The ideal candidate will bring a deep understanding of system scalability, fault tolerance, observability, and hands-on production support in Kubernetes-based environments running on Google Cloud Platform (GCP)
 
Core Responsibilities:
  • Design, implement, and maintain Java-based microservices ensuring high availability, scalability, and performance.
  • Collaborate with development and infrastructure teams to support and optimize production systems using SRE principles.
  • Manage and maintain Kubernetes clusters, including deployments, scaling, networking, and storage.
  • Develop and maintain robust CI/CD pipelines using tools like GitLab CI/CD and Jenkins. Build automation for system health monitoring, alerting, log aggregation, and recovery using tools such as Prometheus, Datadog, Splunk, and Kiali.
  • Integrate and operate event-driven systems leveraging Kafka, KSQLDB, Spark Streams, and cluster federation.
  • Deploy and manage service mesh technologies such as Istio and Anthos Service Mesh.
  • Utilize EBPF for advanced observability and system tracing.
  • Support containerized applications using Docker, and infrastructure provisioning with Terraform.
  • Administer storage solutions in Kubernetes environments using Portworx.
 
Required Qualifications:
  • 10+ years of experience in SRE.
  • Strong proficiency in Java is mandatory.
  • Solid experience in scripting languages like Python, Go, and Bash.
  • Deep understanding of Linux/Unix operating systems and system-level troubleshooting.
  • Proven experience with Kubernetes, Docker, and infrastructure as code tools like Terraform.
  • Strong background in CI/CD, monitoring, alerting, and performance tuning.
  • Hands-on experience with virtualization platforms including VMware.
  • Familiarity with tools like Nginx Controller, Seesaw, and service mesh technologies. Proficient in handling large-scale systems and capable of automating repetitive operational tasks.
  • Experience with functional programming languages such as Prolog, Haskell, or OCaml is a plus.
  • Certification in Kubernetes & GCP is required.
  • Hands-on experience working in GCP environments is strongly required.
 
BENEFITS:
We offer a comprehensive benefits package designed to support the health, well-being, and financial security of our employees and their families. Eligible employees may receive:
Health Insurance: Medical, dental, and vision coverage
Retirement Plans: Participation in a company-sponsored retirement savings plan
Legal Service Plans – Offering access to attorneys for legal advice and representation
 
If this opportunity interests you, please respond by clicking on EasyApply.
 
In case you are not currently available or interested in pursuing this opportunity, please forward this email among your circle who might be suitable and interested in any of our open requirements. VARITE has a robust Candidate Referral Fee plan where you will receive a one-time referral fee if the referred candidate successfully works with VARITE on an assignment for at least 1 month or 160 Hours.
 
VARITE is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

About the Company

V

Varite, Inc