ByteDance
Research Scientist - Seed Multimodal Interaction and World Model - Reinforcement
ByteDance, Seattle, Washington, us, 98127
Research Scientist - Seed Multimodal Interaction and World Model - Reinforcement Learning Focus
Responsibilities Design and implement reinforcement learning (RL) training systems for large-scale multimodal foundation models Develop unified modeling frameworks that integrate video, audio, and language, with a focus on visual latent reasoning Explore Reinforcement Learning-based approaches to bridge understanding and generation for multimodal visual reasoning Collaborate with researchers to evaluate models on tasks involving world modeling, reasoning, and instruction-conditioned generation Qualifications Master or PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, or other leading conferences in AI and ML Strong research background in at least one of the following: reinforcement learning, multimodal learning, video understanding, or vision-language modeling Preferred Qualifications Experience with reinforcement learning in multimodal or interactive environments Familiarity with video generation or diffusion-based generative models Experience with large-scale model training (e.g., distributed training, curriculum learning, or memory-augmented transformers) Solid programming and engineering skills, with experience building training or evaluation pipelines for ML models Job Information The base salary range for this position is $198,360 - $416,100 annually. Compensation may vary depending on a number of factors, including a candidate's qualifications, skills, competencies, and experience, and location. Benefits include day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, and more. ByteDance is an equal opportunities employer and is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.
#J-18808-Ljbffr
Responsibilities Design and implement reinforcement learning (RL) training systems for large-scale multimodal foundation models Develop unified modeling frameworks that integrate video, audio, and language, with a focus on visual latent reasoning Explore Reinforcement Learning-based approaches to bridge understanding and generation for multimodal visual reasoning Collaborate with researchers to evaluate models on tasks involving world modeling, reasoning, and instruction-conditioned generation Qualifications Master or PhD in Software Development, Computer Science, Computer Engineering, or a related technical discipline Publications in top-tier venues, such as CVPR, ECCV, ICCV, NeurIPS, ICLR, ICML, or other leading conferences in AI and ML Strong research background in at least one of the following: reinforcement learning, multimodal learning, video understanding, or vision-language modeling Preferred Qualifications Experience with reinforcement learning in multimodal or interactive environments Familiarity with video generation or diffusion-based generative models Experience with large-scale model training (e.g., distributed training, curriculum learning, or memory-augmented transformers) Solid programming and engineering skills, with experience building training or evaluation pipelines for ML models Job Information The base salary range for this position is $198,360 - $416,100 annually. Compensation may vary depending on a number of factors, including a candidate's qualifications, skills, competencies, and experience, and location. Benefits include day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, and more. ByteDance is an equal opportunities employer and is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws.
#J-18808-Ljbffr