Site Reliability Engineering Manager
nJob Category: Product Engineering
nLocation: US - Massachusetts - Waltham
nMeet Our Team:
nJoin Pega's Services Engineering organization, where we build and support the infrastructure that powers our next-generation SaaS offerings. As an SRE Manager, you'll lead a globally distributed team focused on the reliability, scalability, and performance of Launchpad, our flagship SaaS product.
nPlease note this role is onsite in Waltham 3 times per week.
nPicture Yourself at Pega:
nAt Pega, we're redefining how enterprises build and run software. Our SRE team ensures our systems are robust, resilient, and ready to scale. You'll work alongside engineering leaders and cross-functional teams to drive operational excellence and deliver seamless customer experiences.
nWhat You'll Do at Pega:
n- n
- n
Lead and mentor a team of SREs supporting the Launchpad SaaS platform.
n n - n
Define and implement SLOs, SLIs, and error budgets to guide reliability goals.
n n - n
Collaborate with product and platform teams to design scalable, fault-tolerant systems.
n n - n
Drive incident response, root cause analysis, and continuous improvement.
n n - n
Champion automation, observability, and DevOps best practices.
n n - n
Foster a culture of collaboration across time zones and cultures, ensuring smooth handoffs and consistent service delivery.
n n
Who You Are:
n- n
- n
A seasoned engineering leader with a passion for reliability and operational excellence.
n n - n
Proven experience managing SRE or DevOps teams in a cloud-native environment.
n n - n
Comfortable working with globally distributed teams and navigating cultural and time zone differences.
n n - n
Strong background in systems engineering, distributed systems, and CI/CD pipelines.
n n - n
Excellent communicator and collaborator, able to influence across functions.
n n
What You've Accomplished:
n- n
- n
8+ years in software engineering or infrastructure roles, with 3+ years in a technical or people leadership capacity.
n n - n
Strong experience applying SRE principles in a SaaS company.
n n - n
Hands-on experience with cloud platforms (AWS, Azure, or GCP), Kubernetes, and monitoring tools (e.g., Prometheus, Grafana, Datadog).
n n - n
Demonstrated success in scaling systems and teams in a fast-paced environment.
n n
Education & Certifications
n- n
- Bachelor's degree in Computer Science, Engineering, or a related technical field n
Relevant certifications such as:
n- n
- n
AWS Certified DevOps Engineer or Solutions Architect
n n - n
Google Cloud Professional DevOps Engineer
n n - n
Certified Kubernetes Administrator (CKA)
n n - n
ITIL Foundation or SRE Foundation (optional but a plus)
n n
Pega Offers You:
n- n
- n
Analyst-acclaimed technology leadership.
n n - n
Continuous learning and development opportunities.
n n - n
An innovative, inclusive, agile, and fun work environment.
n n - n
Competitive compensation, bonus incentives, and equity participation.
n n
#LI-CL1
nJob ID: 22400
nIt is Pega's policy to engage, recruit, hire, promote, train, discipline, and compensate in all job classifications, without regard to race, color, sex, religion, national origin, age, disability, sexual orientation, gender identity, veteran status, or any other category protected by law.
n