ziprecruiter
Site Reliability Engineer - Intermediate Level
ziprecruiter, Washington, District of Columbia, us, 20022
Overview As an intermediate level SRE, you'll support Kubernetes-based client environments and build the automation that keeps them secure, scalable, and compliant.
Responsibilities
Monitor and support Kubernetes clusters in production (AWS and GCP). Contribute to CI/CD pipelines and deployment automation. Participate in on-call rotations and incident response. Troubleshoot infrastructure and app performance issues. Write scripts to automate tasks (audits, scaling, patching, etc.). Work directly with clients on environment integration and support. Ensure systems meet FedRAMP and high-assurance compliance standards. Collaborate with senior engineers and architects to deploy infrastructure, write scripts, maintain pipelines, and respond to operational issues. Qualifications
2-5 years in a DevOps/SRE/platform engineering or backend role. Experience managing or deploying Kubernetes environments. Familiarity with AWS and/or GCP. Scripting skills (Python, Bash, or similar). CI/CD and infrastructure automation fundamentals. Strong problem-solving and debugging abilities. Experience with infrastructure-as-code (Terraform, Pulumi). Observability stack knowledge (Prometheus, Grafana, Loki, etc.). Knowledge of FedRAMP or regulated cloud environments. Familiarity with GitOps (e.g., ArgoCD).
#J-18808-Ljbffr
Monitor and support Kubernetes clusters in production (AWS and GCP). Contribute to CI/CD pipelines and deployment automation. Participate in on-call rotations and incident response. Troubleshoot infrastructure and app performance issues. Write scripts to automate tasks (audits, scaling, patching, etc.). Work directly with clients on environment integration and support. Ensure systems meet FedRAMP and high-assurance compliance standards. Collaborate with senior engineers and architects to deploy infrastructure, write scripts, maintain pipelines, and respond to operational issues. Qualifications
2-5 years in a DevOps/SRE/platform engineering or backend role. Experience managing or deploying Kubernetes environments. Familiarity with AWS and/or GCP. Scripting skills (Python, Bash, or similar). CI/CD and infrastructure automation fundamentals. Strong problem-solving and debugging abilities. Experience with infrastructure-as-code (Terraform, Pulumi). Observability stack knowledge (Prometheus, Grafana, Loki, etc.). Knowledge of FedRAMP or regulated cloud environments. Familiarity with GitOps (e.g., ArgoCD).
#J-18808-Ljbffr