Logo
NVIDIA

Senior Engineering Manager - Compute Server Bring Up

NVIDIA, Santa Clara, California, us, 95053

Save Job

Senior Engineering Manager - Compute Server Bring Up Join NVIDIA as a Senior Engineering Manager specializing in Compute Server Bring‑Up. This role leads the bring‑up, integration, validation, and troubleshooting of compute tray platforms of GPU racks, ensuring servers meet requirements before mass deployment in data centers.

Responsibilities

Own initial power‑on and board bring‑up: lead functional validation of compute trays (CPU, GPU, NIC, storage, cooling) internally and with customers.

Form and lead a virtual team across NVIDIA software & firmware teams; provide status reporting.

Oversee flashing, updating, and validation of firmware; perform boundary, stress, and regression testing; document pain points and recovery flows.

Factory & manufacturing support: firmware updates, diagnostic procedures, BOM change sign‑off, process optimization.

Debug, issue resolution & customer support: lead root cause analysis for bring‑up failures; collaborate with partners, ODMs, and customers.

Documentation & knowledge transfer: maintain platform design guides, bring‑up checklists, install instructions; provide training internally and externally.

Product ownership: drive product life cycles with QA teams, ensuring robust bring‑up, productization, and delivery.

Performance management: conduct evaluations, develop excellence culture, ensure high productivity.

Qualifications

5+ years relevant experience managing systems/platform software teams, ideally in server bring‑up, firmware development, or data center solutions.

Deep experience operating successfully in a matrix environment, leading high‑impact virtual teams.

BS, MS, or PhD in EE/CS or related field (or equivalent experience); 12+ overall years of experience.

Strong knowledge of compute tray designs, firmware enablement, and system‑level architecture.

Proven track record delivering scalable server products & solutions for large‑scale data centers.

Experience collaborating with hardware, firmware, manufacturing, diagnostics, and QA teams.

Experience with SCM (Git, Perforce) and project management tools (Jira).

Excellent written & oral communication; strong work ethic and dedication to teamwork.

Hands‑on experience with x86/ARM system architecture & coding (C/C++, Python).

Self‑starter, creative problem‑solver.

Proven excellence in server architecture, collaborating across teams to deliver products per KPIs.

Ways to Stand Out

Experience leading bring‑up for sophisticated compute architectures like GB200 NVL72.

Benefits & Compensation Base salary range: $272,000 – $425,500 (USD). Eligibility for equity and additional benefits.

EEO Statement NVIDIA is committed to fostering a diverse work environment and is a proud equal‑opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Locations Sunnyvale, CA and San Jose, CA (details available upon request).

Application Deadline Applications accepted until December 5, 2025.

#J-18808-Ljbffr