CoreWeave
Operations Engineering Manager, Fleet Reliability
Join to apply for the
Operations Engineering Manager, Fleet Reliability
role at
CoreWeave Overview
CoreWeave is the AI Hyperscaler, providing a cloud platform with cutting-edge services for AI workloads. Since 2017, we've operated data centers across the US and Europe, ranked among TIME's 100 most influential companies of 2024. Role Description
The Fleet Reliability Operations Team manages capacity delivery and maintenance, provisioning, updating, and troubleshooting server nodes. The Manager will lead a 24/7 team, improve processes, automate remediation, and ensure high reliability and customer satisfaction as we scale. Responsibilities
Build and lead a 24/7 team of reliability-focused engineers. Standardize and document processes for node provisioning, validation, and troubleshooting. Advocate for automation and process improvements, focusing on event-driven remediation. Provide high-criticality support for node delivery and maintenance. Enhance onboarding, documentation, and team performance. Foster a positive team culture and support internal communication. Qualifications
7+ years in software or infrastructure engineering, with 2+ years in leadership. Knowledge of SRE principles, incident management, observability, and change management. Passion for automation, reliability, and cross-team collaboration. Strong interpersonal skills to influence partners, peers, and leadership. Compensation & Benefits
Base salary ranges from $210,000 to $230,000, based on experience and location. Benefits include health insurance, life and disability insurance, 401(k), paid parental leave, tuition reimbursement, wellness benefits, and flexible work options. Work Environment
CoreWeave operates as a hybrid workplace, supporting flexible in-office and remote work arrangements. Onboarding is conducted at hubs if remote, with quarterly team gatherings to foster collaboration. Equal Opportunity
We are committed to diversity and inclusion, providing reasonable accommodations for applicants with disabilities. Contact: careers@coreweave.com Additional Info
Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and IT Industries: Technology, Internet
#J-18808-Ljbffr
Join to apply for the
Operations Engineering Manager, Fleet Reliability
role at
CoreWeave Overview
CoreWeave is the AI Hyperscaler, providing a cloud platform with cutting-edge services for AI workloads. Since 2017, we've operated data centers across the US and Europe, ranked among TIME's 100 most influential companies of 2024. Role Description
The Fleet Reliability Operations Team manages capacity delivery and maintenance, provisioning, updating, and troubleshooting server nodes. The Manager will lead a 24/7 team, improve processes, automate remediation, and ensure high reliability and customer satisfaction as we scale. Responsibilities
Build and lead a 24/7 team of reliability-focused engineers. Standardize and document processes for node provisioning, validation, and troubleshooting. Advocate for automation and process improvements, focusing on event-driven remediation. Provide high-criticality support for node delivery and maintenance. Enhance onboarding, documentation, and team performance. Foster a positive team culture and support internal communication. Qualifications
7+ years in software or infrastructure engineering, with 2+ years in leadership. Knowledge of SRE principles, incident management, observability, and change management. Passion for automation, reliability, and cross-team collaboration. Strong interpersonal skills to influence partners, peers, and leadership. Compensation & Benefits
Base salary ranges from $210,000 to $230,000, based on experience and location. Benefits include health insurance, life and disability insurance, 401(k), paid parental leave, tuition reimbursement, wellness benefits, and flexible work options. Work Environment
CoreWeave operates as a hybrid workplace, supporting flexible in-office and remote work arrangements. Onboarding is conducted at hubs if remote, with quarterly team gatherings to foster collaboration. Equal Opportunity
We are committed to diversity and inclusion, providing reasonable accommodations for applicants with disabilities. Contact: careers@coreweave.com Additional Info
Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and IT Industries: Technology, Internet
#J-18808-Ljbffr