CoreWeave
Software Engineer, Inference AI/ML
Apply for the Software Engineer, Inference AI/ML role at CoreWeave.
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enable innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability.
What You’ll Do Join the Inference team to ship production features that improve latency, reliability, and cost for model serving on our GPU platform. As an IC1, you’ll implement well‑scoped changes, learn our operational practices, and grow quickly with mentorship from experienced engineers.
About the Role
Implement well‑scoped features and fixes in Python/Go/C++ for model‑serving services (e.g., Triton, vLLM, TensorRT‑LLM, Ray Serve).
Write tests, code comments, and short design docs; participate in code reviews.
Add basic metrics and dashboards; assist with alarms and runbooks.
Follow on‑call runbooks and learn incident response in a guided rotation.
Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance.
Who You Are
BS/MS in CS, EE, or related field, or equivalent practical experience.
Foundations in data structures, algorithms, and networked services. Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics.
Exposure to containers and Kubernetes (coursework or projects welcome). Curiosity about GPU inference concepts (micro‑batching, KV cache, streaming).
Preferred
Internship or project that deployed a microservice or ML inference demo.
Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus.
Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling.
Benefits
Medical, dental, and vision insurance – 100% paid for by CoreWeave
Company‑paid Life Insurance
Voluntary supplemental life insurance
Short and long‑term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family‑forming support provided by Carrot
Paid Parental Leave
Full‑service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
Work culture focused on innovative disruption
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take‑off, the growth opportunities within the organization are constantly expanding.
Our Workplace While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.
Export Control Compliance This position requires access to export‑controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be:
A U.S. person (U.S. citizen or national, lawful permanent resident, refugee, or asylee).
Eligible to access the export‑controlled information without a required export authorization, or/C.
Eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
Salary: The base salary range for this role is $92,000 to $135,000. The starting salary will be determined based on job‑related knowledge, skills, experience, and market location. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: careers@coreweave.com.
#J-18808-Ljbffr
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enable innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability.
What You’ll Do Join the Inference team to ship production features that improve latency, reliability, and cost for model serving on our GPU platform. As an IC1, you’ll implement well‑scoped changes, learn our operational practices, and grow quickly with mentorship from experienced engineers.
About the Role
Implement well‑scoped features and fixes in Python/Go/C++ for model‑serving services (e.g., Triton, vLLM, TensorRT‑LLM, Ray Serve).
Write tests, code comments, and short design docs; participate in code reviews.
Add basic metrics and dashboards; assist with alarms and runbooks.
Follow on‑call runbooks and learn incident response in a guided rotation.
Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance.
Who You Are
BS/MS in CS, EE, or related field, or equivalent practical experience.
Foundations in data structures, algorithms, and networked services. Experience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basics.
Exposure to containers and Kubernetes (coursework or projects welcome). Curiosity about GPU inference concepts (micro‑batching, KV cache, streaming).
Preferred
Internship or project that deployed a microservice or ML inference demo.
Coursework/research with PyTorch or TensorFlow; simple CUDA projects a plus.
Familiarity with Grafana/Prometheus/OpenTelemetry or similar tooling.
Benefits
Medical, dental, and vision insurance – 100% paid for by CoreWeave
Company‑paid Life Insurance
Voluntary supplemental life insurance
Short and long‑term disability insurance
Flexible Spending Account
Health Savings Account
Tuition Reimbursement
Employee Stock Purchase Program (ESPP)
Mental Wellness Benefits through Spring Health
Family‑forming support provided by Carrot
Paid Parental Leave
Full‑service childcare support with Kinside
401(k) with a generous employer match
Flexible PTO
Catered lunch each day in our office and data center locations
A casual work environment
Work culture focused on innovative disruption
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take‑off, the growth opportunities within the organization are constantly expanding.
Our Workplace While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.
Export Control Compliance This position requires access to export‑controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be:
A U.S. person (U.S. citizen or national, lawful permanent resident, refugee, or asylee).
Eligible to access the export‑controlled information without a required export authorization, or/C.
Eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
Salary: The base salary range for this role is $92,000 to $135,000. The starting salary will be determined based on job‑related knowledge, skills, experience, and market location. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: careers@coreweave.com.
#J-18808-Ljbffr