Logo
ZipRecruiter

Site Reliability Engineer - Intermediate Level

ZipRecruiter, Washington, District of Columbia, us, 20022

Save Job

Overview

As an intermediate level SRE, you'll support Kubernetes-based client environments and build the automation that keeps them secure, scalable, and compliant. Responsibilities

Monitor and support Kubernetes clusters in production (AWS and GCP). Contribute to CI/CD pipelines and deployment automation. Participate in on-call rotations and incident response. Troubleshoot infrastructure and app performance issues. Write scripts to automate tasks (audits, scaling, patching, etc.). Work directly with clients on environment integration and support. Ensure systems meet FedRAMP and high-assurance compliance standards. Collaborate with senior engineers and architects to deploy infrastructure, write scripts, maintain pipelines, and respond to operational issues. Qualifications

2-5 years in a DevOps/SRE/platform engineering or backend role. Experience managing or deploying Kubernetes environments. Familiarity with AWS and/or GCP. Scripting skills (Python, Bash, or similar). CI/CD and infrastructure automation fundamentals. Strong problem-solving and debugging abilities. Experience with infrastructure-as-code (Terraform, Pulumi). Observability stack knowledge (Prometheus, Grafana, Loki, etc.). Knowledge of FedRAMP or regulated cloud environments. Familiarity with GitOps (e.g., ArgoCD).

#J-18808-Ljbffr