Red Hat
Overview
Machine Learning Engineer, vLLM Inference role at Red Hat. Red Hat’s Inference team accelerates AI for the enterprise and provides a stable platform for open-source LLM deployments. This role focuses on vLLM, working to improve model performance and efficiency within our open-source software stack. What You Will Do
Write robust Python and C++, working on vLLM systems, high performance machine learning primitives, performance analysis and modeling, and numerical methods Contribute to the design, development, and testing of various inference optimization algorithms Participate in technical design discussions and provide innovative solutions to complex problems Give thoughtful and prompt code reviews. Proactively utilize AI-assisted development tools for code generation, auto-completion, and intelligent suggestions to accelerate development cycles and enhance code quality Mentor and guide other engineers and foster a culture of continuous learning and innovation What You Will Bring
Extensive experience in writing high performance code for GPUs and deep knowledge of GPU hardware Strong understanding of computer architecture, parallel processing, and distributed computing concepts Experience with tensor math libraries such as PyTorch Modern C++, CUDA, Triton, and CUTLASS experience Mathematical software, especially linear algebra or signal processing Experience optimizing kernels for deep neural networks Experience with NVIDIA Nsight is a plus Strong communications skills with both technical and non-technical team members BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus Pay Transparency
The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications. This position may also be eligible for bonus, commission, and/or equity. For Remote-US locations, the actual salary range may differ by location but will be commensurate with job duties and relevant experience. About Red Hat
Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact. We are committed to an open, inclusive environment where ideas come from people with diverse backgrounds and perspectives. Benefits
Comprehensive medical, dental, and vision coverage Flexible Spending Account - healthcare and dependent care Health Savings Account - high deductible medical plan Retirement 401(k) with employer match Paid time off and holidays Paid parental leave plans for all new parents Leave benefits including disability, paid family medical leave, and paid military leave Employee stock purchase plan, tuition reimbursement, and other benefits Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, disability, medical condition, marital status, or any other basis prohibited by law. Red Hat provides reasonable accommodations to applicants and will respond to inquiries regarding application status via the designated channels.
#J-18808-Ljbffr
Machine Learning Engineer, vLLM Inference role at Red Hat. Red Hat’s Inference team accelerates AI for the enterprise and provides a stable platform for open-source LLM deployments. This role focuses on vLLM, working to improve model performance and efficiency within our open-source software stack. What You Will Do
Write robust Python and C++, working on vLLM systems, high performance machine learning primitives, performance analysis and modeling, and numerical methods Contribute to the design, development, and testing of various inference optimization algorithms Participate in technical design discussions and provide innovative solutions to complex problems Give thoughtful and prompt code reviews. Proactively utilize AI-assisted development tools for code generation, auto-completion, and intelligent suggestions to accelerate development cycles and enhance code quality Mentor and guide other engineers and foster a culture of continuous learning and innovation What You Will Bring
Extensive experience in writing high performance code for GPUs and deep knowledge of GPU hardware Strong understanding of computer architecture, parallel processing, and distributed computing concepts Experience with tensor math libraries such as PyTorch Modern C++, CUDA, Triton, and CUTLASS experience Mathematical software, especially linear algebra or signal processing Experience optimizing kernels for deep neural networks Experience with NVIDIA Nsight is a plus Strong communications skills with both technical and non-technical team members BS, or MS in computer science or computer engineering or a related field. A PhD in a ML related domain is considered a plus Pay Transparency
The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications. This position may also be eligible for bonus, commission, and/or equity. For Remote-US locations, the actual salary range may differ by location but will be commensurate with job duties and relevant experience. About Red Hat
Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact. We are committed to an open, inclusive environment where ideas come from people with diverse backgrounds and perspectives. Benefits
Comprehensive medical, dental, and vision coverage Flexible Spending Account - healthcare and dependent care Health Savings Account - high deductible medical plan Retirement 401(k) with employer match Paid time off and holidays Paid parental leave plans for all new parents Leave benefits including disability, paid family medical leave, and paid military leave Employee stock purchase plan, tuition reimbursement, and other benefits Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, disability, medical condition, marital status, or any other basis prohibited by law. Red Hat provides reasonable accommodations to applicants and will respond to inquiries regarding application status via the designated channels.
#J-18808-Ljbffr