Senior Software Engineer - Parallel Computing Systems
NVIDIA - California
Work at NVIDIA
Overview
- View job
Overview
Senior Software Engineer - Parallel Computing Systems
Join to apply for the Senior Software Engineer - Parallel Computing Systems role at NVIDIA
Senior Software Engineer - Parallel Computing Systems
Join to apply for the Senior Software Engineer - Parallel Computing Systems role at NVIDIA
Direct message the job poster from NVIDIA
Passionate About AI & Machine Learning - NVIDIA
Do you have expertise in CUDA kernel optimization, C++ systems programming, or compiler infrastructure? Join NVIDIA's nvFuser team to build the next-generation fusion compiler that automatically optimizes deep learning models for workloads scaling to thousands of GPUs! We're looking for engineers who excel at parallel programming and systems-level performance work and want to directly impact the future of AI compilation.
The Deep Learning Frameworks Team @ NVIDIA is responsible for building nvFuser , an advanced compiler that sits at the intersection of compiler technology and high-performance computing. You'll work closely with the PyTorch Core team and collaborate with Lightning-AI/Thunder , which integrates nvFuser to accelerate PyTorch workloads. We collaborate with hardware architects, framework maintainers, and optimization experts to create compiler infrastructure that advances GPU performance, developing manual optimization techniques into systematic, automated compiler optimizations.
What You'll Be Doing
As an nvFuser engineer, you'll work on exciting challenges in compiler technology and performance optimization! You'll design algorithms that generate highly optimized code from deep learning programs and build GPU-aware CPU runtime systems that coordinate kernel execution for maximum performance. Working directly with NVIDIA's hardware engineers, you'll master the latest GPU architectures while collaborating with optimization specialists to develop innovative techniques for emerging AI workloads. From debugging performance bottlenecks in thousand-GPU distributed systems to influencing next-generation hardware design, we push the boundaries of what's possible in AI compilation.
What We Need To See
- MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience).
- 4+ years advanced C++ programming with large codebase development, template meta-programming, and performance-critical code.
- Strong parallel programming experience with multi-threading, OpenMP, CUDA, MPI, NCCL, NVSHMEM, or other parallel computing technologies.
- Shown experience with low-level performance optimization and systematic bottleneck identification beyond basic profiling.
- Performance analysis skills: experience analyzing high-level programs to identify performance bottlenecks and develop optimization strategies.
- Collaborative problem-solving approach with adaptability in ambiguous situations, first-principles based thinking, and a sense of ownership.
- Excellent verbal and written communication skills.
- Experience with HPC/Scientific Computing: CUDA optimization, GPU programming, numerical libraries (cuBLAS, NCCL), or distributed computing.
- Compiler engineering background: LLVM, GCC, domain-specific language design, program analysis, or IR transformations and optimization passes.
- Deep technical foundation in CPU/GPU architectures, numeric libraries, modular software design, or runtime systems.
- Experience with large software projects, performance profiling, and demonstrated track record of rapid learning.
- Expertise with distributed parallelism techniques, tensor operations, auto-tuning, or performance modeling.
You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
JR1998609
Seniority level
Seniority level
Mid-Senior level
Employment type
Employment type
Full-time
Job function
Industries
Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing
Referrals increase your chances of interviewing at NVIDIA by 2x
Get notified about new Senior System Software Engineer jobs in California, United States .
Senior Engineer - Warehouse Management System
Brisbane, CA $140,000 - $180,000 2 weeks ago
Los Angeles, CA $160,000 - $190,000 3 weeks ago
Mountain View, CA $160,000 - $230,000 1 year ago
Senior Systems Engineer (R&D and Product Development) - Remote
Founding Team Senior Software Engineer Manager for a Social Travel Platform targeting Digital Nomads
San Francisco, CA
$150,000.00
-
$200,000.00
4 months ago
Sr. Software Engineer - Marketing (SFMC)
Beverly Hills, CA
$128,000.00
-
$160,000.00
2 weeks ago
Custom IC & System Power Management Architect
Los Angeles, CA
$140,000.00
-
$170,000.00
1 month ago
Senior Full Stack Software Engineer (Billing)
Senior Software Engineer, Infrastructure
Senior Full Stack Software Engineer (Vault)
Santa Barbara, CA
$140,000.00
-
$200,000.00
1 month ago
Lancaster, CA
$175,000.00
-
$230,000.00
14 hours ago
San Francisco, CA
$150,000.00
-
$207,000.00
5 months ago
San Francisco, CA
$154,000.00
-
$210,000.00
4 weeks ago
California, United States
$164,500.00
-
$247,600.00
1 week ago
San Francisco, CA $130,000 - $200,000 11 months ago
San Francisco, CA $150,000 - $175,000 2 months ago
San Francisco, CA $150,000 - $170,000 3 weeks ago
Senior Software Engineer, Backend - Fintech
Backend Staff Software Engineer, On Trip
San Francisco, CA $204,000 - $260,000 2 weeks ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr