NVIDIA
Senior Research Engineer - Enterprise Products
Join to apply for the
Senior Research Engineer - Enterprise Products
role at
NVIDIA .
What You Will Be Doing
Developing new models and algorithms focused on Large Language Models, Natural Language Processing, and Deep Learning.
Design and implement multi-node serving architectures disaggregated serving and distributed LLM inference.
Optimize multi-LoRA (and other PEFT technique) inference serving systems.
Apply sophisticated quantization techniques (FP4/INT4, FP8) to reduce model footprint while preserving quality.
Implement speculative decoding (draft target, eagle, medusa etc) and other latency optimization strategies.
Demonstrate good engineering practices and mentor other team members to do the same.
Collaborate with engineering teams across NVIDIA to ensure software integrates seamlessly up and down the NVIDIA accelerated serving stack.
What We Need To See
Understanding of modern techniques in Machine Learning, Deep Neural Networks, Natural Language Processing, or Speech Recognition.
8+ years of industry experience in Deep Learning frameworks (PyTorch or TensorFlow).
Passion for software engineering, with excellent C++ and Python development skills and meaningful contributions to major open-source projects.
Strong communication and interpersonal skills, ability to work in a dynamic and distributed team. History of mentoring junior engineers and interns is a plus.
Bachelor's degree or equivalent experience.
A desire to constantly grow and learn new things.
Strong computer science fundamentals — algorithms and data structures, computational complexity, parallel and distributed computing, system software.
Ways To Stand Out From a Crowd
Experience architecting or developing large-scale distributed systems for deep learning.
Knowledge of CPU and/or GPU architecture.
GPU programming (CUDA).
Salary information: Your base salary will be determined based on location, experience, and pay of employees in similar positions. The base salary range is 184,000 USD - 299,000 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until September 30, 2025. NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
Job identifiers: JR2005190
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Engineering and Information Technology
Industries
Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing
Seattle, WA
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Senior Research Engineer - Enterprise Products
role at
NVIDIA .
What You Will Be Doing
Developing new models and algorithms focused on Large Language Models, Natural Language Processing, and Deep Learning.
Design and implement multi-node serving architectures disaggregated serving and distributed LLM inference.
Optimize multi-LoRA (and other PEFT technique) inference serving systems.
Apply sophisticated quantization techniques (FP4/INT4, FP8) to reduce model footprint while preserving quality.
Implement speculative decoding (draft target, eagle, medusa etc) and other latency optimization strategies.
Demonstrate good engineering practices and mentor other team members to do the same.
Collaborate with engineering teams across NVIDIA to ensure software integrates seamlessly up and down the NVIDIA accelerated serving stack.
What We Need To See
Understanding of modern techniques in Machine Learning, Deep Neural Networks, Natural Language Processing, or Speech Recognition.
8+ years of industry experience in Deep Learning frameworks (PyTorch or TensorFlow).
Passion for software engineering, with excellent C++ and Python development skills and meaningful contributions to major open-source projects.
Strong communication and interpersonal skills, ability to work in a dynamic and distributed team. History of mentoring junior engineers and interns is a plus.
Bachelor's degree or equivalent experience.
A desire to constantly grow and learn new things.
Strong computer science fundamentals — algorithms and data structures, computational complexity, parallel and distributed computing, system software.
Ways To Stand Out From a Crowd
Experience architecting or developing large-scale distributed systems for deep learning.
Knowledge of CPU and/or GPU architecture.
GPU programming (CUDA).
Salary information: Your base salary will be determined based on location, experience, and pay of employees in similar positions. The base salary range is 184,000 USD - 299,000 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until September 30, 2025. NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
Job identifiers: JR2005190
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Engineering and Information Technology
Industries
Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing
Seattle, WA
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr