Babylist
Staff Software Engineer, Site Reliability
Babylist, Emeryville, California, United States, 94608
Overview
Babylist is the leading registry, e‑commerce, and content platform for growing families. We serve 9+ million customers annually with a focus on seamless purchasing, trusted guidance, and expert product recommendations. Our ecosystem includes the Babylist Shop, Babylist Health, and a flagship showroom in Los Angeles. With over $1B in annual GMV and substantial 2024 revenue, we are shaping the $320B baby product industry. Our mission is to connect growing families with everything they need to thrive. To learn more, visit www.babylist.com. What the Role Is
Staff Software Engineer, Site Reliability is a pivotal role on the Platform team. You will help ensure our systems and services are stable, scalable, and reliable. You will collaborate with all Babylist Engineering teams to support shared infrastructure and developer tools. Your expertise in site reliability engineering, AWS cloud infrastructure, and modern DevOps practices will be instrumental in optimizing our systems and driving continuous improvement. Responsibilities
Manage and build our AWS infrastructure using Infrastructure as Code (IaC) tools like Terraform, ensuring EKS clusters and databases run with up‑to‑date versions for performance and reliability. Improve the speed and reliability of our Continuous Integration (CI) systems to support the Engineering Team, enabling faster and more efficient development and deployment. Provide support to developers in troubleshooting issues across local development, staging, and production environments. Establish, communicate, and support best practices for monitoring and alerting, setting up effective monitoring systems and actionable alerts for proactive incident management. Qualifications
8+ years of Experience as a Site Reliability Engineer or similar role, with a background in maintaining highly available and scalable systems. Experience supporting high-traffic consumer-facing websites; understanding the challenges of maintaining such systems. Proficiency with Terraform and IaC for managing AWS infrastructure. Strong experience with AWS cloud-based infrastructure and services, focusing on reliability, performance, and security. Proficiency with Docker and Kubernetes, contributing to design, deployment, and management of containerized applications. Solid understanding of cloud‑native design, including CDNs, load balancers, cloud networking, DNS, caching, and distributed systems. Excellent troubleshooting and debugging skills across environments. Experience designing and supporting CI systems (e.g., CircleCI, Jenkins, GitHub Actions). Familiarity with monitoring/alerting tools (e.g., Datadog, Cronitor, Sentry, PagerDuty) for proactive incident management. On‑call management experience, including incident response, escalation, and post‑incident reviews. Strong communication and collaboration skills to work effectively with cross‑functional teams. Why You Will Love Working At Babylist
Investments in the infrastructure and tools you need to be successful, plus a stipend to help set up your home office. Products that have a positive impact on millions of people. Sustainable pace and real work/life balance. Belief that technology and data can solve hard problems, with AI meaningfully embedded in tools, systems, and decision‑making. Exceptional management and opportunities for career advancement. Competitive pay and meaningful benefits, including company‑paid health, dental, and vision insurance, 401(k) with company match, and generous parental leave. Perks supporting wellbeing, parenting, childcare, and financial planning. Compensation & Benefits
Babylist follows a market‑based approach to compensation. The starting salary range for this role is $199,200.00 to $239,040.00, representing the lowest to highest compensation we reasonably expect to offer. Final starting salary is determined by skills, experience, and location, with future adjustments based on growth, performance, and pay equity. In addition to salary, we offer equity, bonus opportunities, and a comprehensive benefits package. Notes
Official communications will come from an @babylist.com email address. We use AI to transcribe interviews for fair evaluation in compliance with data privacy laws. By applying, you acknowledge this use. For more information, see our careers page.
#J-18808-Ljbffr
Babylist is the leading registry, e‑commerce, and content platform for growing families. We serve 9+ million customers annually with a focus on seamless purchasing, trusted guidance, and expert product recommendations. Our ecosystem includes the Babylist Shop, Babylist Health, and a flagship showroom in Los Angeles. With over $1B in annual GMV and substantial 2024 revenue, we are shaping the $320B baby product industry. Our mission is to connect growing families with everything they need to thrive. To learn more, visit www.babylist.com. What the Role Is
Staff Software Engineer, Site Reliability is a pivotal role on the Platform team. You will help ensure our systems and services are stable, scalable, and reliable. You will collaborate with all Babylist Engineering teams to support shared infrastructure and developer tools. Your expertise in site reliability engineering, AWS cloud infrastructure, and modern DevOps practices will be instrumental in optimizing our systems and driving continuous improvement. Responsibilities
Manage and build our AWS infrastructure using Infrastructure as Code (IaC) tools like Terraform, ensuring EKS clusters and databases run with up‑to‑date versions for performance and reliability. Improve the speed and reliability of our Continuous Integration (CI) systems to support the Engineering Team, enabling faster and more efficient development and deployment. Provide support to developers in troubleshooting issues across local development, staging, and production environments. Establish, communicate, and support best practices for monitoring and alerting, setting up effective monitoring systems and actionable alerts for proactive incident management. Qualifications
8+ years of Experience as a Site Reliability Engineer or similar role, with a background in maintaining highly available and scalable systems. Experience supporting high-traffic consumer-facing websites; understanding the challenges of maintaining such systems. Proficiency with Terraform and IaC for managing AWS infrastructure. Strong experience with AWS cloud-based infrastructure and services, focusing on reliability, performance, and security. Proficiency with Docker and Kubernetes, contributing to design, deployment, and management of containerized applications. Solid understanding of cloud‑native design, including CDNs, load balancers, cloud networking, DNS, caching, and distributed systems. Excellent troubleshooting and debugging skills across environments. Experience designing and supporting CI systems (e.g., CircleCI, Jenkins, GitHub Actions). Familiarity with monitoring/alerting tools (e.g., Datadog, Cronitor, Sentry, PagerDuty) for proactive incident management. On‑call management experience, including incident response, escalation, and post‑incident reviews. Strong communication and collaboration skills to work effectively with cross‑functional teams. Why You Will Love Working At Babylist
Investments in the infrastructure and tools you need to be successful, plus a stipend to help set up your home office. Products that have a positive impact on millions of people. Sustainable pace and real work/life balance. Belief that technology and data can solve hard problems, with AI meaningfully embedded in tools, systems, and decision‑making. Exceptional management and opportunities for career advancement. Competitive pay and meaningful benefits, including company‑paid health, dental, and vision insurance, 401(k) with company match, and generous parental leave. Perks supporting wellbeing, parenting, childcare, and financial planning. Compensation & Benefits
Babylist follows a market‑based approach to compensation. The starting salary range for this role is $199,200.00 to $239,040.00, representing the lowest to highest compensation we reasonably expect to offer. Final starting salary is determined by skills, experience, and location, with future adjustments based on growth, performance, and pay equity. In addition to salary, we offer equity, bonus opportunities, and a comprehensive benefits package. Notes
Official communications will come from an @babylist.com email address. We use AI to transcribe interviews for fair evaluation in compliance with data privacy laws. By applying, you acknowledge this use. For more information, see our careers page.
#J-18808-Ljbffr