Montauk Climate
Head of ML Infrastructure and Operations
Montauk Climate, Washington, District of Columbia, us, 20022
Head of ML Infrastructure and OperationsHead of ML Infrastructure and Operations Get AI-powered advice on this job and more exclusive features.
Please make sure you read the following details carefully before making any applications. Direct message the job poster from Montauk Climate Head of Machine Learning Infrastructure and Operations - Stealth AI Infrastructure Company Montauk Climate
operates a venture studio and an investment platform, both focused on climate-centric opportunities. Our studio platform identifies untapped technological potential within the climate sector and develops these concepts into ventures by assembling specialized teams and collaborating with Fortune 500 companies. Our investment platform functions as an AI and software venture firm, leading Series A and B funding rounds for external companies in the energy and infrastructure sectors. Stealth AI Infrastructure Company The internet’s new bottleneck isn’t bandwidth—it’s affordable, responsive GPU compute. Stealth AI infrastructure platform is building predictive GPU autoscaling that cuts cloud costs by 30‑40 %, wipes out inference‑time latency spikes, and drives down energy waste. Backed by Montauk Ventures, we’re assembling a founding team to create the control plane that will power AI workloads at global scale. Head of Machine Learning Infrastructure and Operations role Partner with the CEO to build the industry’s first scalable autonomous AI platform that monitors, predicts and optimizes GPU workloads. You’ll shape the product roadmap, architect production systems, and lead engineering to deliver a scalable autoscaling engine that responds in real time to AI compute demand. You’ll also help recruit a high-caliber technical team, support early design partners, and evolve the platform from MVP to a commercial product Impact Infra that matters
– Every AI application depends on affordable GPUs; you’ll solve that pain at its root. Co‑founder equity
– Join at formation, shape the cap table, and grow into the CTO role. Mentorship & capital
– Work beside a founder who has scaled global infra; leverage Montauk’s data‑center network and studio resources. Mission + impact
– Lower compute costs, shrink emissions, and keep the internet fast as AI demand explodes. Technical Expertise: 7–12 years building distributed systems or ML infrastructure at cloud scale (e.g., GPU schedulers, K8s controllers, low‑latency trading, or CDN backbones). Familiarity with GPU architectures, CUDA programming, GPU observability frameworks (e.g. DCGM, Prometheus, Gragana), and GPU-accelerated applications Familiar with cloud‑provider spot/RI markets, cluster autoscaler internals, and FinOps tooling Expert-level knowledge of major cloud platforms (AWS, GCP, Azure), with a focus on GPU-enabled services and Kubernetes Experience working with HPC-like workloads or large-scale GPU clusters where performance, cost, and scheduling must be carefully balanced. Deep understanding of ML workflows, model training, and inference optimization techniques Experience designing and implementing large-scale, fault-tolerant distributed systems Strong coding skills in languages with experience in high-performance computing DevOps and MLOps: Expertise in Continuous Integration / Continuous Delivery pipelines, infrastructure-as-code, and MLOps best practices Experience with advanced forecasting techniques and their application to resource allocation Understanding of cloud security best practices and data protection regulations Qualifications Bachelor's or Master's degree in Computer Science, Engineering, or related field; Ph.D. is a plus 10+ years of experience in impactful technology roles, with a focus on cloud computing and machine learning Proven track record of successfully delivering complex, scalable technology solutions Seniority level
Seniority levelDirector Employment type
Employment typeFull-time Job function
Job functionEngineering and Information Technology IndustriesVenture Capital and Private Equity Principals, Energy Technology, and Technology, Information and Media Referrals increase your chances of interviewing at Montauk Climate by 2x Get notified about new Head of Machine Learning jobs in
Washington, DC . District of Columbia, United States $225,000.00-$250,000.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Please make sure you read the following details carefully before making any applications. Direct message the job poster from Montauk Climate Head of Machine Learning Infrastructure and Operations - Stealth AI Infrastructure Company Montauk Climate
operates a venture studio and an investment platform, both focused on climate-centric opportunities. Our studio platform identifies untapped technological potential within the climate sector and develops these concepts into ventures by assembling specialized teams and collaborating with Fortune 500 companies. Our investment platform functions as an AI and software venture firm, leading Series A and B funding rounds for external companies in the energy and infrastructure sectors. Stealth AI Infrastructure Company The internet’s new bottleneck isn’t bandwidth—it’s affordable, responsive GPU compute. Stealth AI infrastructure platform is building predictive GPU autoscaling that cuts cloud costs by 30‑40 %, wipes out inference‑time latency spikes, and drives down energy waste. Backed by Montauk Ventures, we’re assembling a founding team to create the control plane that will power AI workloads at global scale. Head of Machine Learning Infrastructure and Operations role Partner with the CEO to build the industry’s first scalable autonomous AI platform that monitors, predicts and optimizes GPU workloads. You’ll shape the product roadmap, architect production systems, and lead engineering to deliver a scalable autoscaling engine that responds in real time to AI compute demand. You’ll also help recruit a high-caliber technical team, support early design partners, and evolve the platform from MVP to a commercial product Impact Infra that matters
– Every AI application depends on affordable GPUs; you’ll solve that pain at its root. Co‑founder equity
– Join at formation, shape the cap table, and grow into the CTO role. Mentorship & capital
– Work beside a founder who has scaled global infra; leverage Montauk’s data‑center network and studio resources. Mission + impact
– Lower compute costs, shrink emissions, and keep the internet fast as AI demand explodes. Technical Expertise: 7–12 years building distributed systems or ML infrastructure at cloud scale (e.g., GPU schedulers, K8s controllers, low‑latency trading, or CDN backbones). Familiarity with GPU architectures, CUDA programming, GPU observability frameworks (e.g. DCGM, Prometheus, Gragana), and GPU-accelerated applications Familiar with cloud‑provider spot/RI markets, cluster autoscaler internals, and FinOps tooling Expert-level knowledge of major cloud platforms (AWS, GCP, Azure), with a focus on GPU-enabled services and Kubernetes Experience working with HPC-like workloads or large-scale GPU clusters where performance, cost, and scheduling must be carefully balanced. Deep understanding of ML workflows, model training, and inference optimization techniques Experience designing and implementing large-scale, fault-tolerant distributed systems Strong coding skills in languages with experience in high-performance computing DevOps and MLOps: Expertise in Continuous Integration / Continuous Delivery pipelines, infrastructure-as-code, and MLOps best practices Experience with advanced forecasting techniques and their application to resource allocation Understanding of cloud security best practices and data protection regulations Qualifications Bachelor's or Master's degree in Computer Science, Engineering, or related field; Ph.D. is a plus 10+ years of experience in impactful technology roles, with a focus on cloud computing and machine learning Proven track record of successfully delivering complex, scalable technology solutions Seniority level
Seniority levelDirector Employment type
Employment typeFull-time Job function
Job functionEngineering and Information Technology IndustriesVenture Capital and Private Equity Principals, Energy Technology, and Technology, Information and Media Referrals increase your chances of interviewing at Montauk Climate by 2x Get notified about new Head of Machine Learning jobs in
Washington, DC . District of Columbia, United States $225,000.00-$250,000.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr