Logo
NVIDIA

Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA, Germantown, Ohio, United States

Save Job

Overview

NVIDIA is leading in Artificial Intelligence, High Performance Computing and Visualization. The GPU is central to our products and services, enabling creativity and discovery across AI, autonomous vehicles and more. We are looking for a Distinguished Software Architect to help co-design our next generation data center platforms for DL and HPC communications, with emphasis on performance across large GPU deployments and high-speed interconnects (e.g., NVLink, PCIe) and networking (e.g., InfiniBand, Ethernet). Join to apply for the

Distinguished Software Architect - Deep Learning and HPC Communications

role at

NVIDIA . What You Will Be Doing

Research new communication technologies (e.g., expand the GPUDirect technology portfolio) and design new features for our communication libraries Propose innovative HW/SW solutions for next-gen platforms and co-design these with GPU, Networking, and SW architects; ensure seamless integration with software stacks Inspire changes based on quantitative data from proof-of-concepts or detailed analysis/modeling Drive adoption of new communication technologies across application verticals Collaborate with diverse teams (internal and external), including DL researchers and customers What We Need To See

PHD in Computer Science, Computer Engineering or related field or strong equivalent experience; 15+ years of relevant experience in academia or industry Expertise in HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA Deep understanding of high-performance networking: InfiniBand, Ethernet, network design/topologies, debugging and performance analysis Strong in several areas: ML/DL fundamentals related to communications, parallel algorithms, fault tolerance, performance analysis and optimization for large clusters, DL frameworks (PyTorch, TensorFlow) Programming fluency with C or C++ for systems software Ability to work and communicate across HW/SW teams and timezones Ways To Stand Out From The Crowd

Industry-recognized leader in HPC/DL communications with patents, publications and conference talks Influential in industry standards (MPI, OpenSHMEM) and open source software (PyTorch, UCX, Open MPI) About NVIDIA and Perks

NVIDIA is widely regarded as a top technology employer with forward-thinking teams. If you are creative and autonomous, we want to hear from you! JR1997938 Seniority level

Mid-Senior level Employment type

Full-time Job function

Industries Industries

Computer Hardware Manufacturing Software Development Computers and Electronics Manufacturing Referrals increase your chances of interviewing at NVIDIA by 2x. Get notified about new Software Architect jobs in

Germany . Cologne, North Rhine-Westphalia, Germany

#J-18808-Ljbffr