Logo
NVIDIA Corporation

Senior Full-Stack Software Engineer

NVIDIA Corporation, Santa Clara, California, us, 95053

Save Job

Senior Full-Stack Software Engineer page is loaded Senior Full-Stack Software Engineer Apply locations US, CA, Santa Clara US, WA, Seattle time type Full time posted on Posted 5 Days Ago job requisition id JR1999978 NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. Today, we're at the forefront of AI innovation powering breakthroughs in research, autonomous vehicles, robotics, and more. The DGX Cloud team builds and operates the AI infrastructure that fuels this progress. We’re looking for a Senior Full-Stack Software Engineer to join our DGX Cloud AI Infrastructure team and help deliver the next-generation user experience for NVIDIA’s GPU clusters and AI infrastructure. In this role, you’ll design and build a unified, self-service portal that serves as the front door to our AI compute platform enabling researchers to efficiently manage, monitor, and optimize their use of GPU clusters. You’ll work across the stack to build intuitive interfaces, streamline workflows, and surface actionable insights that improve the productivity of AI research teams across the company. What You’ll Be Doing: Design, develop, and deploy full-stack web applications to support large-scale AI infrastructure operations and workflows

Collaborate with AI and ML research teams to identify pain points and deliver tools that accelerate their work

Develop APIs, backend services, and UIs to improve visibility, observability, and control over large-scale GPU clusters

Develop backend services to manage job schedulers and cluster operations.

Define and track metrics that measure efficiency, resiliency, and developer productivity across the platform

Drive engineering excellence in testing, CI/CD, code quality, and performance

Lead architectural discussions and mentor junior engineers on design and implementation

Stay ahead of AI/ML infrastructure trends and drive adoption of best practices within the team

What We Need To See: 8+ years of experience in developing software infrastructure for large scale AI systems.

Bachelor's degree or higher in Computer Science or a related technical field (or equivalent experience).

Proficiency with full-stack development: JavaScript (Vue or React), Node.js, Python, and/or Golang, script languages

Experience with distributed systems and cloud-native technologies (Docker, Kubernetes, microservices)

Familiarity with observability stacks: ELK, OpenSearch, Prometheus, Grafana, or Loki

Strong debugging and root cause analysis skills across application and infrastructure layers

Experience with large-scale AI training, inference, or data infrastructure services

Excellent communication, collaboration, problem solving and a growth mindset

Ways to Stand Out from the crowd: Experience building developer platforms or self-service internal infrastructure tools for efficiency, resiliency, or observability.

Hands-on experience as a Machine Learning Engineer (MLE) or deep familiarity with DL frameworks (e.g., PyTorch, TensorFlow, JAX, Ray).

Hands-on experience operating at datacenter scale, including GPU cluster debugging and root cause analysis.

Experience with MongoDB, Hadoop, or Spark.

At NVIDIA, you’ll be immersed in a diverse, supportive environment where you’re empowered to do your best work. The DGX Cloud AI Infrastructure team is at the core of NVIDIA’s AI efforts building the software that makes scalable research possible. Join us and help power the next wave of innovation. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits . Applications for this job will be accepted at least until August 25, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Similar Jobs (5)

Senior Staff Software Engineer - Full Stack locations US, CA, Santa Clara time type Full time posted on Posted 8 Days Ago Senior DevOps Engineer locations 2 Locations time type Full time posted on Posted 11 Days Ago Senior Software Engineer, Cloud-Native Stack – CSP Engagements locations 4 Locations time type Full time posted on Posted 4 Days Ago NVIDIA is the world leader in accelerated computing. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society.

#J-18808-Ljbffr