Snap Inc.
Research Scientist, Animation Direction, Level 4
Snap Inc., San Francisco, California, United States, 94199
Overview
Join to apply for the
Research Scientist, Animation Direction, Level 4
role at
Snap Inc. What you'll do
Propose and develop cutting-edge multimodal techniques for 3D human animation and video generation — enabling expressive Bitmoji avatars and next-gen Snapchat video gen experiences Build models that map text, audio, video, and music to 3D animation and drive video generation downstream tasks, such as background video generation for animation rendering and audio-driven lip synced video generation Advance avatar technologies including animatable 3D Gaussian avatars, head/body reconstruction, and animation control Collaborate with product teams to deploy research at scale and publish at top-tier academic venues Knowledge, Skills, & Abilities
Ability to define impactful research problems and deliver practical solutions, both in academic and product contexts Deep expertise in human motion modeling, facial/body animation, and generative modeling (e.g., text-to-motion, text-to-image, text-to-video, etc) Familiarity with video generation, avatar representations (e.g., mesh, NeRF, Gaussian), and rendering pipelines Strong prototyping and engineering skills (Python, PyTorch, C++), familiar with large scale distributed ML training on GCP/AWS clusters Proven ability to lead and mentor interns, PhD students, and junior researchers, as well as collaborate effectively with product teams Excellent communication and cross-functional collaboration skills Minimum Qualifications
PhD in a related technical field such as computer science, statistics, mathematics, machine learning or equivalent years of experience Strong theoretical foundations of generative AI and practical experience training, tuning, and modifying generative models Hands-on experience with mainstream generative models (Diffusion, Transformers, GANs, VAEs) for animation or video Research or product experience in either of the following: 3D human motion generation from various signals such as text or audio; image/video generation; 3D avatar reconstruction and animation; or multimodal generation Preferred Qualifications
Publications in top-tier venues as the main contributor (e.g., CVPR, SIGGRAPH, NeurIPS, ICCV, ECCV, ICLR); contributed to popular open-source projects code/dataset release Hands-on experience in large scale dataset curation and distributed ML model training such as image/video generation model pre-training or post-training Strong foundation in computer vision, 3D graphics, and multimodal learning Compensation and Benefits
In the United States, work locations are assigned a pay zone which determines the salary range for the position. The starting pay may be negotiable within the salary range. Zone A: $173,000-$259,000; Zone B: $164,000-$246,000; Zone C: $147,000-$220,000. This position is eligible for equity in the form of RSUs. Benefits include paid parental leave, medical coverage, emotional and mental health support, and compensation packages that enable share in Snap’s long-term success. Covered under all applicable equal opportunity laws, Snap Inc. is an equal opportunity employer. We provide employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical or mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law.
#J-18808-Ljbffr
Join to apply for the
Research Scientist, Animation Direction, Level 4
role at
Snap Inc. What you'll do
Propose and develop cutting-edge multimodal techniques for 3D human animation and video generation — enabling expressive Bitmoji avatars and next-gen Snapchat video gen experiences Build models that map text, audio, video, and music to 3D animation and drive video generation downstream tasks, such as background video generation for animation rendering and audio-driven lip synced video generation Advance avatar technologies including animatable 3D Gaussian avatars, head/body reconstruction, and animation control Collaborate with product teams to deploy research at scale and publish at top-tier academic venues Knowledge, Skills, & Abilities
Ability to define impactful research problems and deliver practical solutions, both in academic and product contexts Deep expertise in human motion modeling, facial/body animation, and generative modeling (e.g., text-to-motion, text-to-image, text-to-video, etc) Familiarity with video generation, avatar representations (e.g., mesh, NeRF, Gaussian), and rendering pipelines Strong prototyping and engineering skills (Python, PyTorch, C++), familiar with large scale distributed ML training on GCP/AWS clusters Proven ability to lead and mentor interns, PhD students, and junior researchers, as well as collaborate effectively with product teams Excellent communication and cross-functional collaboration skills Minimum Qualifications
PhD in a related technical field such as computer science, statistics, mathematics, machine learning or equivalent years of experience Strong theoretical foundations of generative AI and practical experience training, tuning, and modifying generative models Hands-on experience with mainstream generative models (Diffusion, Transformers, GANs, VAEs) for animation or video Research or product experience in either of the following: 3D human motion generation from various signals such as text or audio; image/video generation; 3D avatar reconstruction and animation; or multimodal generation Preferred Qualifications
Publications in top-tier venues as the main contributor (e.g., CVPR, SIGGRAPH, NeurIPS, ICCV, ECCV, ICLR); contributed to popular open-source projects code/dataset release Hands-on experience in large scale dataset curation and distributed ML model training such as image/video generation model pre-training or post-training Strong foundation in computer vision, 3D graphics, and multimodal learning Compensation and Benefits
In the United States, work locations are assigned a pay zone which determines the salary range for the position. The starting pay may be negotiable within the salary range. Zone A: $173,000-$259,000; Zone B: $164,000-$246,000; Zone C: $147,000-$220,000. This position is eligible for equity in the form of RSUs. Benefits include paid parental leave, medical coverage, emotional and mental health support, and compensation packages that enable share in Snap’s long-term success. Covered under all applicable equal opportunity laws, Snap Inc. is an equal opportunity employer. We provide employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical or mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law.
#J-18808-Ljbffr