Microsoft
Senior Site Reliability Engineering Manager - CTJ - Top Secret
Microsoft, Reston, Virginia, United States, 22090
Senior Site Reliability Engineering Manager - CTJ - Top Secret
Are you interested in working on cutting‑edge cloud security products? Would you like to be part of one of the world’s most advanced cyber‑security solutions and protect millions of computers from thousands of active attack attempts, every month? Look no further than the Microsoft Defender engineering team. We are looking for a Senior Site Reliability Engineering (SRE) Manager. You will be building and delivering cloud solutions to meet the scale that few companies in the industry are required to support. The Microsoft Defender team is responsible for delivering a constantly evolving set of services and solutions to meet the challenging landscape of our ever‑evolving attackers. This team provides on‑call operational support and improves the operational posture of the Microsoft Defender products within U.S. Government clouds. You will operate production services and work closely with other engineering teams to ensure services and systems are highly stable, meet performance SLAs, and meet the expectations of internal and external customers and users. Responsibilities
Lead Reliability Strategy: Drive the vision and execution of reliability, performance, and security across critical systems and services. Influence product design and engineering decisions to ensure resilient, scalable infrastructure. Build and Scale Automation: Champion intelligent automation (AI/ML‑powered) for monitoring, deployment, and incident response to reduce manual overhead and accelerate safe delivery. Drive Operational Excellence: Use telemetry and service‑level data to guide improvements in availability, efficiency, and cost. Lead post‑incident reviews and service improvement plans that restore customer trust and drive long‑term resilience. Foster Engineering Partnerships: Collaborate deeply with product engineering and security teams from early development through production to align on reliability goals and prevent recurrence of issues. Grow and Empower Teams: Attract, mentor, and develop high‑performing SRE talent. Create a culture of inclusion, learning, and accountability that supports career growth and innovation. Shape Technical Direction: Guide architecture and tooling decisions across distributed systems and cloud infrastructure. Promote adoption of best practices and scalable solutions across teams. Qualifications
Master’s Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor’s Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience. People‑management experience (1+ year(s) people management experience). 3+ years technical experience working with large‑scale cloud or distributed systems. Security Clearance Requirements
Candidates must have an active U.S. Government Top‑Secret Security Clearance and meet Microsoft, customer and/or government security screening requirements. Failure to maintain or obtain the appropriate clearance may result in employment action up to and including termination. Preferred Qualifications
Doctorate Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR equivalent. 6+ years technical experience in software engineering, network engineering, or systems administration (depending on degree). 8+ years technical experience alternative. Compensation and Benefits
Site Reliability Engineering M4 – base pay range USD $119,800 – $234,700 per year; ranges differ by location (San Francisco Bay Area and New York City: USD $158,400 – $258,000 per year). Benefits and other compensation may also apply. Microsoft will accept applications for the role until November 1, 2025. Equal Opportunity
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
#J-18808-Ljbffr
Are you interested in working on cutting‑edge cloud security products? Would you like to be part of one of the world’s most advanced cyber‑security solutions and protect millions of computers from thousands of active attack attempts, every month? Look no further than the Microsoft Defender engineering team. We are looking for a Senior Site Reliability Engineering (SRE) Manager. You will be building and delivering cloud solutions to meet the scale that few companies in the industry are required to support. The Microsoft Defender team is responsible for delivering a constantly evolving set of services and solutions to meet the challenging landscape of our ever‑evolving attackers. This team provides on‑call operational support and improves the operational posture of the Microsoft Defender products within U.S. Government clouds. You will operate production services and work closely with other engineering teams to ensure services and systems are highly stable, meet performance SLAs, and meet the expectations of internal and external customers and users. Responsibilities
Lead Reliability Strategy: Drive the vision and execution of reliability, performance, and security across critical systems and services. Influence product design and engineering decisions to ensure resilient, scalable infrastructure. Build and Scale Automation: Champion intelligent automation (AI/ML‑powered) for monitoring, deployment, and incident response to reduce manual overhead and accelerate safe delivery. Drive Operational Excellence: Use telemetry and service‑level data to guide improvements in availability, efficiency, and cost. Lead post‑incident reviews and service improvement plans that restore customer trust and drive long‑term resilience. Foster Engineering Partnerships: Collaborate deeply with product engineering and security teams from early development through production to align on reliability goals and prevent recurrence of issues. Grow and Empower Teams: Attract, mentor, and develop high‑performing SRE talent. Create a culture of inclusion, learning, and accountability that supports career growth and innovation. Shape Technical Direction: Guide architecture and tooling decisions across distributed systems and cloud infrastructure. Promote adoption of best practices and scalable solutions across teams. Qualifications
Master’s Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor’s Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience. People‑management experience (1+ year(s) people management experience). 3+ years technical experience working with large‑scale cloud or distributed systems. Security Clearance Requirements
Candidates must have an active U.S. Government Top‑Secret Security Clearance and meet Microsoft, customer and/or government security screening requirements. Failure to maintain or obtain the appropriate clearance may result in employment action up to and including termination. Preferred Qualifications
Doctorate Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR equivalent. 6+ years technical experience in software engineering, network engineering, or systems administration (depending on degree). 8+ years technical experience alternative. Compensation and Benefits
Site Reliability Engineering M4 – base pay range USD $119,800 – $234,700 per year; ranges differ by location (San Francisco Bay Area and New York City: USD $158,400 – $258,000 per year). Benefits and other compensation may also apply. Microsoft will accept applications for the role until November 1, 2025. Equal Opportunity
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
#J-18808-Ljbffr