NVIDIA
Distinguished Software Architect - Deep Learning and HPC Communications
NVIDIA, Germantown, Ohio, United States
Overview
NVIDIA is leading in Artificial Intelligence, High Performance Computing and Visualization. The GPU is central to our products and services, enabling creativity and discovery across AI, autonomous vehicles and more. We are looking for a Distinguished Software Architect to help co-design our next generation data center platforms for DL and HPC communications, with emphasis on performance across large GPU deployments and high-speed interconnects (e.g., NVLink, PCIe) and networking (e.g., InfiniBand, Ethernet). Join to apply for the
Distinguished Software Architect - Deep Learning and HPC Communications
role at
NVIDIA . What You Will Be Doing
Research new communication technologies (e.g., expand the GPUDirect technology portfolio) and design new features for our communication libraries Propose innovative HW/SW solutions for next-gen platforms and co-design these with GPU, Networking, and SW architects; ensure seamless integration with software stacks Inspire changes based on quantitative data from proof-of-concepts or detailed analysis/modeling Drive adoption of new communication technologies across application verticals Collaborate with diverse teams (internal and external), including DL researchers and customers What We Need To See
PHD in Computer Science, Computer Engineering or related field or strong equivalent experience; 15+ years of relevant experience in academia or industry Expertise in HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA Deep understanding of high-performance networking: InfiniBand, Ethernet, network design/topologies, debugging and performance analysis Strong in several areas: ML/DL fundamentals related to communications, parallel algorithms, fault tolerance, performance analysis and optimization for large clusters, DL frameworks (PyTorch, TensorFlow) Programming fluency with C or C++ for systems software Ability to work and communicate across HW/SW teams and timezones Ways To Stand Out From The Crowd
Industry-recognized leader in HPC/DL communications with patents, publications and conference talks Influential in industry standards (MPI, OpenSHMEM) and open source software (PyTorch, UCX, Open MPI) About NVIDIA and Perks
NVIDIA is widely regarded as a top technology employer with forward-thinking teams. If you are creative and autonomous, we want to hear from you! JR1997938 Seniority level
Mid-Senior level Employment type
Full-time Job function
Industries Industries
Computer Hardware Manufacturing Software Development Computers and Electronics Manufacturing Referrals increase your chances of interviewing at NVIDIA by 2x. Get notified about new Software Architect jobs in
Germany . Cologne, North Rhine-Westphalia, Germany
#J-18808-Ljbffr
NVIDIA is leading in Artificial Intelligence, High Performance Computing and Visualization. The GPU is central to our products and services, enabling creativity and discovery across AI, autonomous vehicles and more. We are looking for a Distinguished Software Architect to help co-design our next generation data center platforms for DL and HPC communications, with emphasis on performance across large GPU deployments and high-speed interconnects (e.g., NVLink, PCIe) and networking (e.g., InfiniBand, Ethernet). Join to apply for the
Distinguished Software Architect - Deep Learning and HPC Communications
role at
NVIDIA . What You Will Be Doing
Research new communication technologies (e.g., expand the GPUDirect technology portfolio) and design new features for our communication libraries Propose innovative HW/SW solutions for next-gen platforms and co-design these with GPU, Networking, and SW architects; ensure seamless integration with software stacks Inspire changes based on quantitative data from proof-of-concepts or detailed analysis/modeling Drive adoption of new communication technologies across application verticals Collaborate with diverse teams (internal and external), including DL researchers and customers What We Need To See
PHD in Computer Science, Computer Engineering or related field or strong equivalent experience; 15+ years of relevant experience in academia or industry Expertise in HPC, parallel programming models (MPI, SHMEM), at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC), computer and system architecture, GPU architecture and CUDA Deep understanding of high-performance networking: InfiniBand, Ethernet, network design/topologies, debugging and performance analysis Strong in several areas: ML/DL fundamentals related to communications, parallel algorithms, fault tolerance, performance analysis and optimization for large clusters, DL frameworks (PyTorch, TensorFlow) Programming fluency with C or C++ for systems software Ability to work and communicate across HW/SW teams and timezones Ways To Stand Out From The Crowd
Industry-recognized leader in HPC/DL communications with patents, publications and conference talks Influential in industry standards (MPI, OpenSHMEM) and open source software (PyTorch, UCX, Open MPI) About NVIDIA and Perks
NVIDIA is widely regarded as a top technology employer with forward-thinking teams. If you are creative and autonomous, we want to hear from you! JR1997938 Seniority level
Mid-Senior level Employment type
Full-time Job function
Industries Industries
Computer Hardware Manufacturing Software Development Computers and Electronics Manufacturing Referrals increase your chances of interviewing at NVIDIA by 2x. Get notified about new Software Architect jobs in
Germany . Cologne, North Rhine-Westphalia, Germany
#J-18808-Ljbffr