EVONA
Overview
Site Reliability Engineer (SRE). Are you motivated by building reliable systems that keep critical infrastructure running at its best? Were supporting a pioneering space-tech company looking to hire a
Site Reliability Engineer (SRE)
to strengthen their DevOps and cloud environments. In this role, youll work closely with a collaborative team of engineers to design and implement strategies that improve system resilience, automate monitoring, and respond to live incidents. If youre passionate about scaling infrastructure in high-impact environments, this could be the perfect opportunity. What Youll Be Doing Designing and implementing strategies to boost system reliability Monitoring and optimizing cloud systems (AWS, Kubernetes, Datadog) Developing custom monitoring and alerting solutions Supporting on-call rotations and incident response Championing SRE best practices across the organisation
What Were Looking For
2+ years of experience with cloud platforms (AWS preferred) Hands-on with Kubernetes and monitoring tools (Datadog, Prometheus, Grafana) Scripting and automation skills (Python or similar) Strong problem-solving, troubleshooting, and collaboration skills Exposure to DevOps methodologies, Terraform, or CI/CD pipelines is a plus
Location and Employment Type
Location:
Hybrid (3 days a week on site) Irvine Employment Type:
Full-time, permanent Seniority level
Entry level Job function
Engineering and Information Technology Industries
Defense and Space; Manufacturing and Space Research and Technology
#J-18808-Ljbffr
Site Reliability Engineer (SRE). Are you motivated by building reliable systems that keep critical infrastructure running at its best? Were supporting a pioneering space-tech company looking to hire a
Site Reliability Engineer (SRE)
to strengthen their DevOps and cloud environments. In this role, youll work closely with a collaborative team of engineers to design and implement strategies that improve system resilience, automate monitoring, and respond to live incidents. If youre passionate about scaling infrastructure in high-impact environments, this could be the perfect opportunity. What Youll Be Doing Designing and implementing strategies to boost system reliability Monitoring and optimizing cloud systems (AWS, Kubernetes, Datadog) Developing custom monitoring and alerting solutions Supporting on-call rotations and incident response Championing SRE best practices across the organisation
What Were Looking For
2+ years of experience with cloud platforms (AWS preferred) Hands-on with Kubernetes and monitoring tools (Datadog, Prometheus, Grafana) Scripting and automation skills (Python or similar) Strong problem-solving, troubleshooting, and collaboration skills Exposure to DevOps methodologies, Terraform, or CI/CD pipelines is a plus
Location and Employment Type
Location:
Hybrid (3 days a week on site) Irvine Employment Type:
Full-time, permanent Seniority level
Entry level Job function
Engineering and Information Technology Industries
Defense and Space; Manufacturing and Space Research and Technology
#J-18808-Ljbffr