ZipRecruiter
Job DescriptionJob Description About the Role
We are seeking a highly motivated and skilled
Site Reliability Engineer (SRE)
to join our infrastructure team. As an SRE, you will be responsible for the scalability, reliability, and performance of our cloud-based services. You will work closely with IT, engineering, and Security teams to design and maintain systems that are secure, observable, and cost-efficient, with a strong emphasis on automation and continuous improvement. Requirements 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles. Strong proficiency with
AWS
(IAM, EC2, ECS/Fargate, S3, RDS, CloudFormation or Terraform). Experience with Infrastructure as Code (Terraform ). Experience with
GitHub
(workflow automation, PR workflows, secrets management). Hands-on experience with
log aggregation and observability tools
like
Sumo Logic
(or equivalent: Datadog, ELK, etc.). Familiarity with incident management practices and SRE principles (SLAs, SLOs, error budgets). Experience with Kubernetes (EKS), Helm, and container orchestration. Prior experience in fast-paced SaaS or startup environments. Familiarity with compliance frameworks (SOC2, HIPAA, etc.). Benefits Health Care Plan (Medical, Dental & Vision) Retirement Plan (401k, IRA) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Work From Home Stock Option Plan
#J-18808-Ljbffr
Site Reliability Engineer (SRE)
to join our infrastructure team. As an SRE, you will be responsible for the scalability, reliability, and performance of our cloud-based services. You will work closely with IT, engineering, and Security teams to design and maintain systems that are secure, observable, and cost-efficient, with a strong emphasis on automation and continuous improvement. Requirements 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles. Strong proficiency with
AWS
(IAM, EC2, ECS/Fargate, S3, RDS, CloudFormation or Terraform). Experience with Infrastructure as Code (Terraform ). Experience with
GitHub
(workflow automation, PR workflows, secrets management). Hands-on experience with
log aggregation and observability tools
like
Sumo Logic
(or equivalent: Datadog, ELK, etc.). Familiarity with incident management practices and SRE principles (SLAs, SLOs, error budgets). Experience with Kubernetes (EKS), Helm, and container orchestration. Prior experience in fast-paced SaaS or startup environments. Familiarity with compliance frameworks (SOC2, HIPAA, etc.). Benefits Health Care Plan (Medical, Dental & Vision) Retirement Plan (401k, IRA) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Work From Home Stock Option Plan
#J-18808-Ljbffr