Nordstrom
Senior Manager, Site Reliability Engineering (SRE) - Hybrid - Seattle
Nordstrom, Seattle, Washington, us, 98127
Overview
Senior Manager, Site Reliability Engineering (SRE) - Hybrid - Seattle. Nordstrom is seeking a strategic and hands-on Senior Manager of Site Reliability Engineering to lead our SRE team in delivering resilient, scalable, and high-performing systems. You’ll guide a team of engineers, champion automation, and collaborate across disciplines to ensure our infrastructure supports business growth and innovation. Responsibilities
Lead & Inspire: Build and mentor a high-performing SRE team. Foster a culture of ownership, innovation, and continuous learning. Drive Reliability: Ensure the availability and performance of critical services through proactive monitoring, incident response, and root cause analysis. Automate Everything: Reduce manual toil by implementing automation across deployment, recovery, and scaling processes. Monitor & Observe: Define and execute observability strategies using tools like New Relic and Splunk to detect and resolve issues before they impact users. Collaborate & Align: Partner with engineering, product, and operations teams to align reliability goals with business priorities. Plan for Scale: Lead capacity planning and performance tuning for services running on AWS EKS and other cloud-native platforms. Measure & Improve: Establish and track SLOs, SLAs, and error budgets; continuously refine processes to improve system reliability and team efficiency. Qualifications
Experience: 5+ years in SRE, DevOps, or infrastructure engineering, with 2+ years in a leadership role. Technical Depth: Expertise in cloud platforms (especially AWS), container orchestration (Kubernetes/EKS), and CI/CD pipelines. Programming Skills: Proficiency in Python, Go, or Java. Tool Mastery: Hands-on experience with New Relic, Splunk, Kubernetes. Problem Solver: Strong analytical skills and a passion for root cause analysis and continuous improvement. Communicator: Clear, concise, and collaborative communicator who thrives in cross-functional environments. Education: Bachelor’s degree in Computer Science, Engineering, or equivalent experience. Bonus Points: Experience with large-scale distributed systems; familiarity with ITIL or similar incident management frameworks; cloud certifications (e.g., AWS Solutions Architect, Google Cloud Professional Engineer). Benefits & Notices
We’ve got you covered… Nordstrom offers a variety of benefits to support employees and their families, including medical/vision/dental, retirement options, paid time away, life insurance, disability, and merchandise discounts. Nordstrom conducts background checks and considers qualified applicants with criminal histories in a manner consistent with all legal requirements. Applicants with disabilities who require assistance or accommodation should contact the nearest Nordstrom location.
#J-18808-Ljbffr
Senior Manager, Site Reliability Engineering (SRE) - Hybrid - Seattle. Nordstrom is seeking a strategic and hands-on Senior Manager of Site Reliability Engineering to lead our SRE team in delivering resilient, scalable, and high-performing systems. You’ll guide a team of engineers, champion automation, and collaborate across disciplines to ensure our infrastructure supports business growth and innovation. Responsibilities
Lead & Inspire: Build and mentor a high-performing SRE team. Foster a culture of ownership, innovation, and continuous learning. Drive Reliability: Ensure the availability and performance of critical services through proactive monitoring, incident response, and root cause analysis. Automate Everything: Reduce manual toil by implementing automation across deployment, recovery, and scaling processes. Monitor & Observe: Define and execute observability strategies using tools like New Relic and Splunk to detect and resolve issues before they impact users. Collaborate & Align: Partner with engineering, product, and operations teams to align reliability goals with business priorities. Plan for Scale: Lead capacity planning and performance tuning for services running on AWS EKS and other cloud-native platforms. Measure & Improve: Establish and track SLOs, SLAs, and error budgets; continuously refine processes to improve system reliability and team efficiency. Qualifications
Experience: 5+ years in SRE, DevOps, or infrastructure engineering, with 2+ years in a leadership role. Technical Depth: Expertise in cloud platforms (especially AWS), container orchestration (Kubernetes/EKS), and CI/CD pipelines. Programming Skills: Proficiency in Python, Go, or Java. Tool Mastery: Hands-on experience with New Relic, Splunk, Kubernetes. Problem Solver: Strong analytical skills and a passion for root cause analysis and continuous improvement. Communicator: Clear, concise, and collaborative communicator who thrives in cross-functional environments. Education: Bachelor’s degree in Computer Science, Engineering, or equivalent experience. Bonus Points: Experience with large-scale distributed systems; familiarity with ITIL or similar incident management frameworks; cloud certifications (e.g., AWS Solutions Architect, Google Cloud Professional Engineer). Benefits & Notices
We’ve got you covered… Nordstrom offers a variety of benefits to support employees and their families, including medical/vision/dental, retirement options, paid time away, life insurance, disability, and merchandise discounts. Nordstrom conducts background checks and considers qualified applicants with criminal histories in a manner consistent with all legal requirements. Applicants with disabilities who require assistance or accommodation should contact the nearest Nordstrom location.
#J-18808-Ljbffr