Logo
Google Inc.

Software Engineering Manager, Emerging On-prem AI Infrastructure

Google Inc., Sunnyvale, California, United States, 94087

Save Job

Software Engineering Manager, Emerging On-prem AI Infrastructure By applying to this position you will have an opportunity to share your preferred working location from the following: Kirkland, WA, USA; Sunnyvale, CA, USA.

Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain.

Experience Requirements

Bachelor’s degree, or equivalent practical experience.

8 years of experience in software development.

3 years of experience with developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage or hardware architecture.

3 years of experience in a technical leadership role.

2 years of experience in a people management or team leadership role.

Preferred Qualifications

Experience in end-to-end (E2E) diagnostics, troubleshooting, and supportability combined with leading "SWAT team" efforts for issues and developing long term sustainable solutions.

Experience building cloud or systems level infrastructure spanning the entire HW and SW stack or passion for building deep system skills.

Understanding with deep systems and ability to guide a team in building sustainable systems.

Familiarity with SLOs/metrics measurement, logs/telemetry/metrics integration with tools for enhanced operator experience.

Ability to demonstrate comfort with changing priorities and ambiguity, and a track record of delivering solutions for subtle or technical problems.

Responsibilities

Drive project success by setting the technical goals and roadmap for diagnostics/operational tooling and repair automation.

Lead and manage the team, which includes a 40-60 split of Software Development Engineer (SDEs) and Software Engineers (SWEs) ensuring the right skill sets are in place for frontline support, debugging, and building sustainable systems.

Ensure central responsibility is taken for diagnostics and troubleshooting of end-to-end supportability issues, to uncover and address technical problems, and the building of repair automation systems.

Implement and govern the success metrics for the team, spanning Operational Plane metrics (e.g., Support case metrics, Global Services Operations (GSO) case handling), and Return Merchandise Authorization (RMA) or Spares metrics (e.g., swap and repair rate).

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

United States base salary range for this full-time position is $197,000-$291,000 plus bonus, equity and benefits. Individual pay is determined by work location and additional factors, including job-related skills, experience and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus and equity or benefits. Learn more about benefits at Google.

#J-18808-Ljbffr