Voltage Park
Infrastructure Engineer (Infiniband / NCCL)
Voltage Park, San Francisco, California, United States, 94199
Join to apply for the
Infrastructure Engineer (Infiniband)
role at
Voltage Park 1 day ago Be among the first 25 applicants Join to apply for the
Infrastructure Engineer (Infiniband)
role at
Voltage Park Get AI-powered advice on this job and more exclusive features. This range is provided by Voltage Park. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range
$140,000.00/yr - $170,000.00/yr We are seeking an
Infrastructure Engineer
with a focus on
InfiniBand/NCCL
to join our Infrastructure Engineering team. Our engineers design and build automation, tooling, and systems that bridge the gap between physical infrastructure and the platforms that power large-scale AI/ML and HPC workloads.
This role combines the breadth of a core infrastructure engineer with a specialty in high-performance networking and GPU communication. You’ll help ensure our InfiniBand fabric and NCCL stack are tuned, reliable, and efficient at scale — supporting some of the world’s largest GPU clusters.
This is a fully remote position, although candidates must be based in the continental United States. Unfortunately, we are unable to provide sponsorship for this role.
Responsibilities
Design, build, and maintain automation, APIs, and frameworks to manage physical infrastructure at scale. Develop and extend systems for server lifecycle management. Implement and tune InfiniBand networking and NCCL configurations for multi-GPU communication. Collaborate with Network, Platform, and Infrastructure Operations teams to support new infrastructure rollouts. Diagnose and improve performance across GPU, NVSwitch, PCIe, and InfiniBand layers. Write clear design documents and technical documentation to capture best practices.
Qualifications
8+ years of professional experience in infrastructure engineering, HPC, or related domains. Strong experience with Linux in production environments. Proficiency in Python or similar languages for automation. Deep understanding of InfiniBand networking (CX7 HCAs, fabrics, partitioning, GPUDirect). Familiarity with NCCL, CUDA, and GPU topology optimization. Knowledge of containerization and orchestration concepts. Strong written and verbal communication skills.
Ideal Experiences
Experience with Dell PowerEdge XE9680 or other GPU-dense servers. Prior work with NVIDIA H100s, NVSwitch, and large-scale NCCL testing. Familiarity with Mellanox OFED, UCX, and Redfish/iDRAC for management. Broader experience across infrastructure areas (storage, virtualization, networking).
Culture
Enjoy collaborating with a motivated, execution-focused team. Comfortable operating with autonomy while aligning to company objectives. Value precision, documentation, and knowledge-sharing. Excited to grow as both a domain specialist (InfiniBand/NCCL) and a generalist infrastructure engineer.
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.
Compensation Range: $140K - $170K
Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Job function Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at Voltage Park by 2x Sign in to set job alerts for “Infrastructure Engineer” roles.
San Francisco, CA $150,000.00-$250,000.00 1 year ago San Francisco, CA $191,000.00-$232,000.00 2 days ago San Francisco, CA $175,000.00-$250,000.00 1 month ago San Francisco, CA $107,600.00-$134,500.00 2 weeks ago San Francisco, CA $184,000.00-$229,000.00 1 month ago San Francisco, CA $85,000.00-$142,000.00 1 day ago San Francisco, CA $175,000.00-$250,000.00 2 months ago Senior System/Network Engineer (Windows-Linux-ProxMox)
San Francisco, CA $99,000.00-$167,000.00 1 day ago San Francisco, CA $180,000.00-$200,000.00 6 months ago San Francisco, CA $110,000.00-$205,000.00 1 day ago Engineer, DevOps Infrastructure as Code (IaC) - AI Training (Freelance, Remote)
Senior Engineer - Warehouse Management System
Brisbane, CA $140,000.00-$180,000.00 2 months ago San Francisco, CA $100,000.00-$200,000.00 1 month ago San Francisco, CA $170,000.00-$220,000.00 2 days ago San Francisco, CA $175,000.00-$250,000.00 2 weeks ago Systems Engineer, Platform Requirements and Verification
San Francisco, CA $140,000.00-$190,000.00 5 days ago San Francisco, CA $120,000.00-$150,000.00 2 months ago San Francisco, CA $140,000.00-$170,000.00 1 day ago Software Engineer, Data Infrastructure & Acquisition - San Francisco, USA
San Francisco, CA $140,000.00-$200,000.00 2 weeks ago San Francisco, CA $175,000.00-$250,000.00 2 weeks ago Machine Learning Infrastructure Engineer
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Infrastructure Engineer (Infiniband)
role at
Voltage Park 1 day ago Be among the first 25 applicants Join to apply for the
Infrastructure Engineer (Infiniband)
role at
Voltage Park Get AI-powered advice on this job and more exclusive features. This range is provided by Voltage Park. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range
$140,000.00/yr - $170,000.00/yr We are seeking an
Infrastructure Engineer
with a focus on
InfiniBand/NCCL
to join our Infrastructure Engineering team. Our engineers design and build automation, tooling, and systems that bridge the gap between physical infrastructure and the platforms that power large-scale AI/ML and HPC workloads.
This role combines the breadth of a core infrastructure engineer with a specialty in high-performance networking and GPU communication. You’ll help ensure our InfiniBand fabric and NCCL stack are tuned, reliable, and efficient at scale — supporting some of the world’s largest GPU clusters.
This is a fully remote position, although candidates must be based in the continental United States. Unfortunately, we are unable to provide sponsorship for this role.
Responsibilities
Design, build, and maintain automation, APIs, and frameworks to manage physical infrastructure at scale. Develop and extend systems for server lifecycle management. Implement and tune InfiniBand networking and NCCL configurations for multi-GPU communication. Collaborate with Network, Platform, and Infrastructure Operations teams to support new infrastructure rollouts. Diagnose and improve performance across GPU, NVSwitch, PCIe, and InfiniBand layers. Write clear design documents and technical documentation to capture best practices.
Qualifications
8+ years of professional experience in infrastructure engineering, HPC, or related domains. Strong experience with Linux in production environments. Proficiency in Python or similar languages for automation. Deep understanding of InfiniBand networking (CX7 HCAs, fabrics, partitioning, GPUDirect). Familiarity with NCCL, CUDA, and GPU topology optimization. Knowledge of containerization and orchestration concepts. Strong written and verbal communication skills.
Ideal Experiences
Experience with Dell PowerEdge XE9680 or other GPU-dense servers. Prior work with NVIDIA H100s, NVSwitch, and large-scale NCCL testing. Familiarity with Mellanox OFED, UCX, and Redfish/iDRAC for management. Broader experience across infrastructure areas (storage, virtualization, networking).
Culture
Enjoy collaborating with a motivated, execution-focused team. Comfortable operating with autonomy while aligning to company objectives. Value precision, documentation, and knowledge-sharing. Excited to grow as both a domain specialist (InfiniBand/NCCL) and a generalist infrastructure engineer.
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.
Compensation Range: $140K - $170K
Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Job function Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at Voltage Park by 2x Sign in to set job alerts for “Infrastructure Engineer” roles.
San Francisco, CA $150,000.00-$250,000.00 1 year ago San Francisco, CA $191,000.00-$232,000.00 2 days ago San Francisco, CA $175,000.00-$250,000.00 1 month ago San Francisco, CA $107,600.00-$134,500.00 2 weeks ago San Francisco, CA $184,000.00-$229,000.00 1 month ago San Francisco, CA $85,000.00-$142,000.00 1 day ago San Francisco, CA $175,000.00-$250,000.00 2 months ago Senior System/Network Engineer (Windows-Linux-ProxMox)
San Francisco, CA $99,000.00-$167,000.00 1 day ago San Francisco, CA $180,000.00-$200,000.00 6 months ago San Francisco, CA $110,000.00-$205,000.00 1 day ago Engineer, DevOps Infrastructure as Code (IaC) - AI Training (Freelance, Remote)
Senior Engineer - Warehouse Management System
Brisbane, CA $140,000.00-$180,000.00 2 months ago San Francisco, CA $100,000.00-$200,000.00 1 month ago San Francisco, CA $170,000.00-$220,000.00 2 days ago San Francisco, CA $175,000.00-$250,000.00 2 weeks ago Systems Engineer, Platform Requirements and Verification
San Francisco, CA $140,000.00-$190,000.00 5 days ago San Francisco, CA $120,000.00-$150,000.00 2 months ago San Francisco, CA $140,000.00-$170,000.00 1 day ago Software Engineer, Data Infrastructure & Acquisition - San Francisco, USA
San Francisco, CA $140,000.00-$200,000.00 2 weeks ago San Francisco, CA $175,000.00-$250,000.00 2 weeks ago Machine Learning Infrastructure Engineer
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr