NVIDIA
Overview
Senior Research Engineer at NVIDIA. NVIDIA is seeking a research engineer who is passionate about open-source and excited to create the next-generation post-training software stack. You will work at the intersection of research and engineering, collaborating with the Post-Training and Frameworks teams to invent, implement, and scale the core technologies behind our Nemotron models. What You’ll Be Doing
Work with applied researchers to design, implement and test next generation of RL and post-training algorithms Contribute and advance open source by developing NeMo-RL, Megatron Core, and NeMo Framework and related software Engaged as part of one team during Nemotron models post-training Solve large-scale, end-to-end AI training and inference challenges, spanning the full model lifecycle from orchestration and data pre-processing to model training, tuning, and deployment Work at the intersection of computer-architecture, libraries, frameworks, AI applications and the entire software stack Performance tuning and optimizations, model training with mixed precision on next-gen NVIDIA GPU architectures Publish and present results at academic and industry conferences What We Need To See
BS, MS or PhD in Computer Science, AI, Applied Math, or related fields or equivalent experience 3+ years of proven experience in machine learning, systems, distributed computing, or large-scale model training Experience with AI Frameworks such as PyTorch or JAX Experience with at least one inference and deployment environments such as vLLM, SGLang or TRT-LLM Proficient in Python programming, software design, debugging, performance analysis, test design and documentation Strong understanding of AI/Deep-Learning fundamentals and their practical applications Ways To Stand Out From The Crowd
Contributions to open source deep learning libraries Hands-on experience in large-scale AI training with understanding of core compute system concepts and performance tuning Expertise in distributed computing, model parallelism, and mixed precision training Prior experience with Generative AI techniques applied to LLM and Multi-Modal learning (Text, Image, and Video) Knowledge of GPU/CPU architecture and numerical software NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We value diversity in our current and future employees and do not discriminate in hiring or promotion on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Your base salary will be determined based on location, experience, and pay of employees in similar positions. The base salary range is 160,000 USD - 258,750 USD for Level 3, and 184,000 USD - 299,000 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until October 13, 2025. JR2005274
#J-18808-Ljbffr
Senior Research Engineer at NVIDIA. NVIDIA is seeking a research engineer who is passionate about open-source and excited to create the next-generation post-training software stack. You will work at the intersection of research and engineering, collaborating with the Post-Training and Frameworks teams to invent, implement, and scale the core technologies behind our Nemotron models. What You’ll Be Doing
Work with applied researchers to design, implement and test next generation of RL and post-training algorithms Contribute and advance open source by developing NeMo-RL, Megatron Core, and NeMo Framework and related software Engaged as part of one team during Nemotron models post-training Solve large-scale, end-to-end AI training and inference challenges, spanning the full model lifecycle from orchestration and data pre-processing to model training, tuning, and deployment Work at the intersection of computer-architecture, libraries, frameworks, AI applications and the entire software stack Performance tuning and optimizations, model training with mixed precision on next-gen NVIDIA GPU architectures Publish and present results at academic and industry conferences What We Need To See
BS, MS or PhD in Computer Science, AI, Applied Math, or related fields or equivalent experience 3+ years of proven experience in machine learning, systems, distributed computing, or large-scale model training Experience with AI Frameworks such as PyTorch or JAX Experience with at least one inference and deployment environments such as vLLM, SGLang or TRT-LLM Proficient in Python programming, software design, debugging, performance analysis, test design and documentation Strong understanding of AI/Deep-Learning fundamentals and their practical applications Ways To Stand Out From The Crowd
Contributions to open source deep learning libraries Hands-on experience in large-scale AI training with understanding of core compute system concepts and performance tuning Expertise in distributed computing, model parallelism, and mixed precision training Prior experience with Generative AI techniques applied to LLM and Multi-Modal learning (Text, Image, and Video) Knowledge of GPU/CPU architecture and numerical software NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We value diversity in our current and future employees and do not discriminate in hiring or promotion on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Your base salary will be determined based on location, experience, and pay of employees in similar positions. The base salary range is 160,000 USD - 258,750 USD for Level 3, and 184,000 USD - 299,000 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until October 13, 2025. JR2005274
#J-18808-Ljbffr