GitLab
Intermediate Site Reliability Engineer, Database Operations
GitLab, Germantown, Ohio, United States
Overview
Intermediate Site Reliability Engineer, Database Operations – GitLab. The Database Operations team owns the lifecycle of the PostgreSQL database engine for GitLab.com, focusing on reliability, scalability, performance, and security of the database and its supporting services. The role emphasizes building services on Reliability::Foundations and cloud vendor managed products to reduce complexity and deliver new capabilities quickly. GitLab values AI-driven productivity and a high-performance culture. Responsibilities
Automate operational tasks across environments (e.g., package updates, configuration changes, provisioning tools for user-facing services). Respond to platform emergencies, alerts, and escalations from Customer Support. Design and maintain automated processes to minimize manual intervention for software lifecycle management (e.g., operating systems). Develop a fully automated, multi-environment observability stack and extend it to predict capacity based on usage patterns. Plan new service rollouts, capacity management, and collaborate with users to optimize resource consumption. What You Will Do as an SRE
Work on database reliability and performance for GitLab.com from within the SRE team and ship solutions with the product. Analyze solutions and implement best practices for PostgreSQL clusters and components. Improve observability of database metrics and meet database objectives. Collaborate with peer SREs to roll out changes and mitigate database-related production incidents. On-call support on rotation. Provide database expertise to engineering teams (e.g., reviews of migrations, queries, performance optimizations). Automate database infrastructure and provide self-service tools for engineering success. Use GitLab product to run GitLab.com as a first resort and improve the product. Plan growth of GitLab’s database infrastructure and design/maintain core database components to support high concurrency. Support and debug database production issues across services and stack levels. Design monitoring and alerting to detect symptoms and not just outages; document actions to build repeatable processes and automation. Qualifications
Experience running PostgreSQL in high-growth, large production environments using self-managed (VM, Kubernetes with PostgreSQL Operators) and DBaaS. Hands-on experience using PostgreSQL internals for design, construction, and troubleshooting of systems. Experience with infrastructure automation, orchestration, and configuration management (Chef, Ansible, Puppet, Terraform). Strong SQL and PL/pgSQL knowledge. Significant experience in large SaaS distributed systems production environments. Strong written and verbal English communication and collaboration skills, with ability to work asynchronously. Proactive, go-for-it attitude and ability to document learnings for repeatability and faster delivery. Solid data modeling and data structure design skills. Bonus: programming skills (Ruby and/or Go) and experience with ClickHouse or other modern OLAP databases. Projects You Could Work On
Review and implement database administration solutions (backups, performance tuning). Build automation using Ansible, Terraform, Chef; automate replica setup and backup/testing monitoring. Develop self-service tools for engineers using GitLab ChatOps. Provide technical assistance on database design methodologies, resource planning, and tuning. Review database-related changes (e.g., migrations) and recommend query/schema optimizations. Respond to production incidents and contribute to infrastructure design and scalability focusing on data storage. Plan and implement future database requirements including capacity planning and upgrades. Company and Employment Details
Country Hiring Guidelines: GitLab hires worldwide. All roles are remote; location eligibility may apply. Privacy Policy: Review Recruitment Privacy Policy. GitLab is an equal opportunity employer; policies are merit-based and protect against discrimination. Seniority level
Associate Employment type
Full-time Job function and Industries
Engineering and Information Technology IT Services and IT Consulting and Software Development
#J-18808-Ljbffr
Intermediate Site Reliability Engineer, Database Operations – GitLab. The Database Operations team owns the lifecycle of the PostgreSQL database engine for GitLab.com, focusing on reliability, scalability, performance, and security of the database and its supporting services. The role emphasizes building services on Reliability::Foundations and cloud vendor managed products to reduce complexity and deliver new capabilities quickly. GitLab values AI-driven productivity and a high-performance culture. Responsibilities
Automate operational tasks across environments (e.g., package updates, configuration changes, provisioning tools for user-facing services). Respond to platform emergencies, alerts, and escalations from Customer Support. Design and maintain automated processes to minimize manual intervention for software lifecycle management (e.g., operating systems). Develop a fully automated, multi-environment observability stack and extend it to predict capacity based on usage patterns. Plan new service rollouts, capacity management, and collaborate with users to optimize resource consumption. What You Will Do as an SRE
Work on database reliability and performance for GitLab.com from within the SRE team and ship solutions with the product. Analyze solutions and implement best practices for PostgreSQL clusters and components. Improve observability of database metrics and meet database objectives. Collaborate with peer SREs to roll out changes and mitigate database-related production incidents. On-call support on rotation. Provide database expertise to engineering teams (e.g., reviews of migrations, queries, performance optimizations). Automate database infrastructure and provide self-service tools for engineering success. Use GitLab product to run GitLab.com as a first resort and improve the product. Plan growth of GitLab’s database infrastructure and design/maintain core database components to support high concurrency. Support and debug database production issues across services and stack levels. Design monitoring and alerting to detect symptoms and not just outages; document actions to build repeatable processes and automation. Qualifications
Experience running PostgreSQL in high-growth, large production environments using self-managed (VM, Kubernetes with PostgreSQL Operators) and DBaaS. Hands-on experience using PostgreSQL internals for design, construction, and troubleshooting of systems. Experience with infrastructure automation, orchestration, and configuration management (Chef, Ansible, Puppet, Terraform). Strong SQL and PL/pgSQL knowledge. Significant experience in large SaaS distributed systems production environments. Strong written and verbal English communication and collaboration skills, with ability to work asynchronously. Proactive, go-for-it attitude and ability to document learnings for repeatability and faster delivery. Solid data modeling and data structure design skills. Bonus: programming skills (Ruby and/or Go) and experience with ClickHouse or other modern OLAP databases. Projects You Could Work On
Review and implement database administration solutions (backups, performance tuning). Build automation using Ansible, Terraform, Chef; automate replica setup and backup/testing monitoring. Develop self-service tools for engineers using GitLab ChatOps. Provide technical assistance on database design methodologies, resource planning, and tuning. Review database-related changes (e.g., migrations) and recommend query/schema optimizations. Respond to production incidents and contribute to infrastructure design and scalability focusing on data storage. Plan and implement future database requirements including capacity planning and upgrades. Company and Employment Details
Country Hiring Guidelines: GitLab hires worldwide. All roles are remote; location eligibility may apply. Privacy Policy: Review Recruitment Privacy Policy. GitLab is an equal opportunity employer; policies are merit-based and protect against discrimination. Seniority level
Associate Employment type
Full-time Job function and Industries
Engineering and Information Technology IT Services and IT Consulting and Software Development
#J-18808-Ljbffr