Logo
Canonical

Site Reliability / Gitops Engineer

Canonical, San Jose, California, United States, 95199

Save Job

Join to apply for the

Site Reliability / Gitops Engineer

role at

Canonical . Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include leading public cloud and silicon providers, and industry leaders across sectors. The company is founder-led, profitable, and growing. We are hiring a

Site Reliability / Gitops Engineer

for our Information Systems (IS) team. This role is ideal for an automation-focused technologist passionate about Linux, aiming to build a career with Canonical and support those leveraging Ubuntu and open source products. Experience in IT operations automation, Infrastructure as Code, and a passion for technology are essential. Job Summary

The IS team supports and maintains all of Canonical's IT production services, serving over 60 million Ubuntu users. As an SRE & Gitops engineer, you will drive operations automation in private and public clouds using open source infrastructure as code, CI/CD pipelines, and Canonical's automation products. You will also provide feedback to developers on product operation at scale, contribute to open-source projects, and collaborate across teams. This role is remote and can be based in any timezone. Responsibilities

Develop infrastructure as code practices, increasing automation and process improvements. Automate software operations for reusability and consistency across clouds. Enhance the resilience and scalability of Canonical’s cloud and container services. Maintain operational responsibility for core services, networks, and infrastructure. Use observability tools like Prometheus, Grafana, Elasticsearch for troubleshooting, capacity planning, and performance monitoring. Collaborate on service architecture, documentation, and operational procedures. Support and work with globally distributed teams. Focus on larger projects and automation tasks during dedicated development time. Share knowledge through design sessions, mentorship, and collaborative work. Handle time-critical escalations responsibly. Requirements

Extensive experience defining operations in code, using version control, peer review, and CI/CD. Strong engineering background with experience in peer review, unit testing, SCM, CI/CD, Agile methodologies. Proficient in Python with large project experience. Practical Linux networking, routing, and firewall knowledge. Experience with Linux storage solutions like Ceph and databases. Hands-on enterprise Linux server administration. Deep understanding of cloud computing concepts and technologies. Bachelor’s degree or higher in computer science or related fields. Effective communication skills in English. Motivated troubleshoot from kernel to web, collaborative, and quick to learn. Flexible, adaptable to fast-changing environments, and comfortable working in distributed teams. Familiarity and passion for open source, especially Ubuntu or Debian. Additional Information

Canonical is committed to diversity and equal opportunity employment. We foster an inclusive environment free from discrimination, welcoming applicants of all backgrounds.

#J-18808-Ljbffr