Logo
Vast.ai

Senior Infrastructure Engineer

Vast.ai, San Francisco, California, United States, 94199

Save Job

About Us

Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity. Job Title

Staff Infrastructure Engineer Location

On-site at our office in San Francisco or Westwood, Los Angeles. About the Role

As a Senior Infrastructure Engineer, you will help design and scale the core systems that play the role of Vast.ai’s global GPU marketplace. You’ll work closely with our founders and core engineering team to extend the underlying compute infrastructure — from GPU provisioning and scheduling to billing, orchestration, and marketplace dynamics. We’re looking for someone who has previously built large‑scale infrastructure platforms — systems with similarities to Vast.ai, or distributed compute orchestration frameworks. Tech Stack

Python, C++, PostgreSQL, Linux, Docker, KVM, Redis, Terraform, AWS, REST/gRPC APIs Ideal Experience

Distributed Systems: Experience building high‑throughput backend systems or compute clouds Compute Orchestration: Familiarity with Docker, or custom scheduling frameworks GPU Infrastructure: Understanding of GPU provisioning, driver management, and workload scheduling Billing & Metering: Implemented or integrated usage‑based billing and account credit systems Marketplace Dynamics: Knowledge of dynamic pricing, spot instances, or supply‑demand balancing mechanisms Security & Multi‑Tenancy: Experience designing secure, multi‑tenant systems in cloud environments Programming: Strong programming skills in Python and C++; ability to write performant, maintainable, well‑architected code Database Expertise: Comfortable designing schemas and queries for large‑scale data systems (PostgreSQL preferred) Bonus Points

Experience with GPU security, virtualization, or zero‑trust compute isolation Prior startup experience or end‑to‑end product ownership Key Responsibilities

Improve the backend systems that power Vast.ai’s compute marketplace Integrate GPU provider onboarding, usage tracking, billing, and orchestration APIs Develop scalable infrastructure for workload scheduling and resource management Optimize pricing and marketplace logic for efficiency and transparency Benchmark, profile, and harden systems for performance, reliability, and fault tolerance Collaborate with product and infrastructure teams to shape the future of decentralized compute Interview Process (≈ 1 week)

15 min – Initial screening with member of your future team (virtual) 40 min – Systems and architectures (virtual) 1 hour – LLM‑assisted coding assessment (virtual) 2 hours – Meet and greet with coding assessment (on‑site) Annual Salary Range

$180,000 – $300,000 + equity + benefits Benefits

Comprehensive health, dental, vision, and life insurance 401(k) with company match Meaningful early‑stage equity Onsite meals, snacks, and close collaboration with founders and tech leads Ambitious, fast‑paced startup culture where initiative is rewarded

#J-18808-Ljbffr