Logo
FluidStack

DCIM Product Engineer

FluidStack, San Francisco, California, United States, 94102

Save Job

About FluidStack

Fluidstack is the AI Cloud Platform. We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivated, and focused on providing a world class supercomputing experience. We put our customers first in everything we do, working hard to not just win the sale, but to win repeated business and customer referrals. We hold ourselves and each other to high standards. We expect you to care deeply about the work you do, the products you build, and the experience our customers have in every interaction with us. You must work hard, take ownership from inception to delivery, and approach every problem with an open mind and a positive attitude. We value effectiveness, competence, and a growth mindset.

About the Role

We are seeking a DCIM Lead Software Engineer to join our Data Center Operations team. In this role, you will architect and develop comprehensive data center infrastructure management solutions that power our GPU supercomputing platform. You'll lead the technical implementation of our DCIM strategy, building systems that monitor, manage, and optimize our global data center footprint. This position requires both deep technical expertise and the ability to translate complex infrastructure requirements into scalable software solutions..

Focus

Design and implement comprehensive DCIM solutions including digital twin capabilities, real-time power monitoring, and environmental tracking systems

Lead the development of asset maintenance management systems that track hardware lifecycle, predictive maintenance schedules, and automated alerting workflows

Build workforce management tools that optimize technician scheduling, work order routing, and capacity planning across multiple data center locations

Create automation-first solutions that reduce manual intervention in routine data center operations, with a focus on API-driven integrations and self-healing systems

Collaborate closely with the infrastructure team to identify opportunities for tooling improvements and contribute written reports on automation initiatives

About You

5+ years of software engineering experience with at least 2 years focused on DCIM, IoT, or infrastructure management systems

Strong programming skills in Python, Go, or Java with experience building scalable microservices and RESTful APIs

Deep understanding of data center operations including power distribution, cooling systems, rack management, and environmental monitoring protocols

Experience with time-series databases, real-time data processing, and building monitoring dashboards for critical infrastructure

Benefits

Experience with Network Source of Truth Tooling (Netbox, ...)

Experience with digital twin technologies and 3D visualization frameworks for data center modeling

Previous experience in hyperscale data center environments or managing infrastructure for AI/GPU workloads