Logo
NVIDIA

Senior Software Engineer - Bare Metal Provisioning

NVIDIA, California, Missouri, United States, 65018

Save Job

Overview The NVIDIA DGX Cloud organization is looking for software engineering talent to build NVIDIA’s accelerated compute infrastructure. This includes software to assist in the rapid bring-up, operation, configuration, and trouble-shooting of compute hardware and networking equipment. As a software engineer, you will work with other software engineers, product architects, and product managers as a collaborative team to deliver and support end-to-end software solutions to manage complex cloud infrastructure deployments. You will write services and software that aligns with the broad architectural vision for the NVIDIA Cloud Platform, working with other teams to develop a robust and scalable system. You own your code - from development to commit to test to production, including operational support. We expect you to be passionate about code quality, testing, deployment efficiency/simplicity and bringing amazing products to market.

Responsibilities

Work with NVIDIA internal customers.

Design and build scalable software systems to manage NVIDIA’s cloud infrastructure.

Participate in responses to real-time operational events.

Build network and systems automation software for managing a multi-tenant cloud infrastructure.

Participate in open-source communities of software we leverage and build.

Present to internal stakeholders and NVIDIA leadership on roadmaps, vision, and demos.

Qualifications

5+ years of experience with designing and building distributed software systems.

Track record of directly supporting systems with external customers, or demanding internal customers.

BS/MS degree in Computer science or related areas (or equivalent experience).

Demonstrated ability to write code in a mainstream systems programming language such as C, C++, Golang, or Rust.

Demonstrated ability to design and implement maintainable APIs for consumers.

Practical experience with asynchronous programming, type safety, threading models, state machines and data structures.

Background of data persistence (SQL or similar).

Understanding of secure communication protocols (mutual-TLS, IPsec, or similar).

Knowledge of SRE principles (observability, SLOs, logging, etc.).

Ways To Stand Out From The Crowd

Experience in a Hyperscale Cloud Service Provider (public facing or not).

Understanding of networking protocols such as IP, IPv6, BGP, HTTP, ICMP, tunneling protocols (VXLAN, Geneve, FoU, GRE), etc.

Familiarity with Infiniband networking.

Background with Host management systems (DHCP, Redfish, UEFI) and host security services such as TPM, TXT, and SecureBoot.

Kubernetes and/or distributed task scheduling.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate in our hiring and promotion practices on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Applications for this job will be accepted at least until August 11, 2025.

#J-18808-Ljbffr