Logo
Fluidstack

Head of Infrastructure

Fluidstack, New York, New York, us, 10261

Save Job

Overview

Head of Infrastructure at Fluidstack. Fluidstack is the AI Cloud Platform. We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivated, and focused on providing a world class supercomputing experience. We hold ourselves to high standards and value a growth mindset, ownership from inception to delivery, and a customer-first approach. About the Role

FluidStack is hiring a Head of Infrastructure to lead deployments of 100,000+ GPU supercomputers globally. You will lead engagements with OEMs, data centers, ISPs, and other infrastructure partners. You will own sourcing, procurement, and the timely deployment of large-scale GPU clusters. You will build a world-class deployment team to deliver multi-thousand GPU clusters in a matter of days. This is an opportunity to shape the infrastructure function from the ground up in a fast-paced environment. You are expected to have exceptional technical and interpersonal communication skills and be able to share knowledge concisely and accurately with teammates, customers, and suppliers. Responsibilities

Own the entire supply chain from original component sourcing to handover of the burned-in cluster to customers. Manage relationships with OEMs and continuously improve delivery timelines and costs across the supply chain. Design and build AI clusters to meet high-level customer requirements and leverage deployment learnings. Sourcing of additional data center capacity to support scaling of the supercomputer business. Hire and manage a small deployment engineering team responsible for fast setup, burn-in, and delivery of reliable GPU clusters. Partner with engineering, sales, finance, and legal to keep infrastructure ready ahead of customer needs. Travel significantly to conferences, data centers, customer sites, and OEM factories. About You

An ideal candidate meets at least the following requirements: 3+ years deploying GPU clusters; 5+ years deploying infrastructure at global scale. On-site experience installing hardware in data centers. Strong relationships with compute and storage OEMs, data centers, ISPs, and others. Experience with InfiniBand or RoCE networking deployments. Understanding of software running on these clusters: Kubernetes/SLURM, PyTorch/Jax, etc. Extreme attention to detail with ability to prioritize in a fast-paced environment. Proactive with a sense of urgency and ownership. Strong engineering background (e.g., Computer/Electrical/Software/CS/Math/OR/Logistics or similar). Exceptional candidates have one or more of the following experiences: Designed, built, and operated a 4000+ GPU cluster. Built tooling to manage bare metal hardware via MaaS, Netbox, or similar tooling. Deployed and managed petabyte-scale all-flash storage systems (DDN, VAST, Weka; or Ceph/LUSTRE, or similar tools). Benefits

Competitive total compensation package (cash + equity). Retirement or pension plan, in line with local norms. Health, dental, and vision insurance. Generous PTO policy, in line with local norms. Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veterans’ status, or any other characteristic protected by law. Fluidstack will consider qualified applicants with arrest and conviction records pursuant to applicable law. Seniority level

Director Employment type

Full-time Job function

Information Technology Industries: Technology, Information and Internet

#J-18808-Ljbffr