DevOps Engineer

Careflow

Miami, FL(remote)

Apply

JOB DETAILS

SKILLS

Access Control, Administrative Skills, Automation, Best Practices, Cloud Computing, Cloud Storage, Communication Skills, Computer Networks, Computer Security, Continuous Deployment/Delivery, Continuous Integration, Cross-Functional, Data Analysis, Data Management, Debugging Skills, DevOps, Disaster Recovery, Diving, Docker, Expense Management, Expense Tracking, GCP (Good Clinical Practices), Healthcare, Healthcare Software, High Availability, Identify Issues, Identity Data Management, Incident Response, Leadership, Linux Administration, Machine Tool, Metrics, Multiplatform/Cross-Platform, Network Administration/Management, Node.js, On Call, Operational Support, Performance Tuning/Optimization, Presentation/Verbal Skills, Problem Solving Skills, Process Improvement, Production Control, Production Management, Production Systems, Python Programming/Scripting Language, Reliability Engineering, Reporting Dashboards, Root Cause Analysis, Software Administration, Software Debugging, Software Engineering, Startup, Systems Reliability, Technical Leadership, Writing Skills

LOCATION

Miami, FL(remote)

POSTED

Today

About the Role

We are looking for an experienced DevOps Engineer to own and improve our cloud infrastructure, security, observability, and operational reliability. This role is responsible for ensuring our platform remains secure, scalable, performant, and highly available as we continue to grow.

The ideal candidate is someone who enjoys wearing multiple hats—building infrastructure, improving deployment processes, monitoring production systems, and troubleshooting issues across the stack. As a bonus, we would love someone who is comfortable diving into the application codebase to diagnose and resolve bugs when needed.

This is a fully remote position. We are particularly interested in candidates who can provide weekend coverage on Saturdays and take another day off during the week in exchange.

What You'll Do

Cloud Infrastructure & Operations

Manage and maintain our Google Cloud Platform (GCP) environment.
Design, implement, and improve infrastructure for scalability, reliability, and cost efficiency.
Manage networking, compute resources, databases, storage, and cloud services.
Monitor system health and proactively address performance bottlenecks.

Monitoring, Logging & Observability

Build and maintain centralized logging and monitoring solutions.
Create dashboards and alerts for system health, application performance, and business-critical workflows.
Establish operational metrics and usage tracking across the platform.
Lead incident response and root cause analysis efforts.
Monitor and manage spend

Security & Compliance

Implement and maintain security best practices across infrastructure and applications.
Manage identity and access controls, secrets management, and environment security.
Conduct security reviews and vulnerability remediation.
Assist with compliance initiatives and audit readiness.

CI/CD & Automation

Improve deployment pipelines and release processes.
Automate infrastructure provisioning and operational workflows.
Enhance development environments and deployment reliability.
Reduce manual operational tasks through automation.

Reliability Engineering

Improve uptime, resiliency, backup strategies, and disaster recovery processes.
Establish service-level objectives and operational standards.
Drive improvements in platform stability and performance.

Cross-Functional Support

Partner with engineering, product, and leadership teams to support company initiatives.
Provide technical guidance on infrastructure and operational considerations.
Participate in an on-call and operational support rotation.

Bonus Responsibilities

Troubleshoot and fix application-level issues when needed.
Contribute code improvements and bug fixes across the platform.
Assist with performance optimization and debugging efforts.

What Success Looks Like

Within your first 90 days, you will:

Gain ownership of our GCP infrastructure and environments.
Establish visibility into system performance, reliability, and usage metrics.
Improve monitoring, alerting, and incident response processes.
Identify and address security and operational risks.
Reduce infrastructure-related issues and deployment friction.
Become a trusted technical resource for platform reliability and operational excellence.

Position Details

Role: DevOps Engineer
Employment Type: Full-Time
Location: Fully Remote
Schedule: Flexible, with availability to provide Saturday coverage and take another weekday off
Reports To: Lead Architect

This role is ideal for someone who enjoys both infrastructure ownership and hands-on problem solving, and wants to have a significant impact on the reliability, security, and scalability of a growing software platform.

Requirements:

Required Qualifications

5+ years of DevOps, Site Reliability Engineering, Cloud Engineering, or related experience.
Strong hands-on experience with Google Cloud Platform (GCP).
Experience building and maintaining CI/CD pipelines.
Strong understanding of infrastructure monitoring, logging, and alerting systems.
Experience with cloud security best practices.
Experience managing production environments and incident response.
Strong Linux administration skills.
Experience with Infrastructure as Code tools (Terraform preferred).
Experience with containerization technologies such as Docker and Kubernetes.
Strong troubleshooting and problem-solving abilities.
Excellent written and verbal communication skills.
Ability to work independently in a fully remote environment.

Nice-to-Have Qualifications

Experience working in startup or high-growth environments.
Experience with healthcare technology or regulated environments.
Ability to read and contribute to application code.
Experience with Python, TypeScript, Node.js, or similar technologies.
Experience building internal tooling and automation.
Experience with data pipelines and analytics infrastructure.

About the Company

Careflow

Resume Resources

Free Resume Templates Free Resume Builder