FluidStack
About FluidStack
Fluidstack is the AI Cloud Platform. We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivated, and focused on providing a world class supercomputing experience. We put our customers first in everything we do, working hard to not just win the sale, but to win repeated business and customer referrals. We hold ourselves and each other to high standards. We expect you to care deeply about the work you do, the products you build, and the experience our customers have in every interaction with us. You must work hard, take ownership from inception to delivery, and approach every problem with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset.
About the Role
We are seeking a DCIM Lead Software Engineer to join our Data Center Operations team. In this role, you will architect and develop comprehensive data center infrastructure management solutions that power our GPU supercomputing platform. You'll lead the technical implementation of our DCIM strategy, building systems that monitor, manage, and optimize our global data center footprint. This position requires both deep technical expertise and the ability to translate complex infrastructure requirements into scalable software solutions..
Focus
Design and implement comprehensive DCIM solutions including digital twin capabilities, real-time power monitoring, and environmental tracking systems
Lead the development of asset maintenance management systems that track hardware lifecycle, predictive maintenance schedules, and automated alerting workflows
Build workforce management tools that optimize technician scheduling, work order routing, and capacity planning across multiple data center locations
Create automation-first solutions that reduce manual intervention in routine data center operations, with a focus on API-driven integrations and self-healing systems
Collaborate closely with the infrastructure team to identify opportunities for tooling improvements and contribute written reports on automation initiatives
About You
5+ years of software engineering experience with at least 2 years focused on DCIM, IoT, or infrastructure management systems
Strong programming skills in Python, Go, or Java with experience building scalable microservices and RESTful APIs
Deep understanding of data center operations including power distribution, cooling systems, rack management, and environmental monitoring protocols
Experience with time-series databases, real-time data processing, and building monitoring dashboards for critical infrastructure
Benefits
Experience with Network Source of Truth Tooling (Netbox, ...)
Experience with digital twin technologies and 3D visualization frameworks for data center modeling
Previous experience in hyperscale data center environments or managing infrastructure for AI/GPU workloads
Fluidstack is the AI Cloud Platform. We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivated, and focused on providing a world class supercomputing experience. We put our customers first in everything we do, working hard to not just win the sale, but to win repeated business and customer referrals. We hold ourselves and each other to high standards. We expect you to care deeply about the work you do, the products you build, and the experience our customers have in every interaction with us. You must work hard, take ownership from inception to delivery, and approach every problem with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset.
About the Role
We are seeking a DCIM Lead Software Engineer to join our Data Center Operations team. In this role, you will architect and develop comprehensive data center infrastructure management solutions that power our GPU supercomputing platform. You'll lead the technical implementation of our DCIM strategy, building systems that monitor, manage, and optimize our global data center footprint. This position requires both deep technical expertise and the ability to translate complex infrastructure requirements into scalable software solutions..
Focus
Design and implement comprehensive DCIM solutions including digital twin capabilities, real-time power monitoring, and environmental tracking systems
Lead the development of asset maintenance management systems that track hardware lifecycle, predictive maintenance schedules, and automated alerting workflows
Build workforce management tools that optimize technician scheduling, work order routing, and capacity planning across multiple data center locations
Create automation-first solutions that reduce manual intervention in routine data center operations, with a focus on API-driven integrations and self-healing systems
Collaborate closely with the infrastructure team to identify opportunities for tooling improvements and contribute written reports on automation initiatives
About You
5+ years of software engineering experience with at least 2 years focused on DCIM, IoT, or infrastructure management systems
Strong programming skills in Python, Go, or Java with experience building scalable microservices and RESTful APIs
Deep understanding of data center operations including power distribution, cooling systems, rack management, and environmental monitoring protocols
Experience with time-series databases, real-time data processing, and building monitoring dashboards for critical infrastructure
Benefits
Experience with Network Source of Truth Tooling (Netbox, ...)
Experience with digital twin technologies and 3D visualization frameworks for data center modeling
Previous experience in hyperscale data center environments or managing infrastructure for AI/GPU workloads