Evolve Group
Direct message the job poster from Evolve Group
Recruiting across Quant Engineering & Algorithmic Trading
Machine Learning Scientist We’ve partnered with one of the most ambitious and technically rigorous AI research labs in the world. Based in San Francisco, this team is building foundation models entirely from scratch. They are now hiring ML Scientists to design and scale the systems that power large-scale, distributed model training. If you’ve built infrastructure that runs across hundreds of GPUs, thrive under technical complexity, and want to work side-by-side with elite AI researchers — this is the role. Key Responsibilities: Build and scale distributed training systems for large-scale model training across LLMs, vision, and robotics. Set up and run large-scale training across many GPUs using tools like Kubernetes, DeepSpeed, and FSDP. Troubleshoot system issues (GPU errors, network problems) and build tools to monitor and recover from failures. Optimize PyTorch pipelines, sharding, and sampling strategies. Collaborate closely with researchers to support novel model training at scale. Requirements: 3–15 years in ML infrastructure, systems, or research engineering roles. Proven experience scaling distributed training for large models. Strong with PyTorch, CUDA, NCCL, Kubernetes. Familiar with setting up distributed training clusters. Deep understanding of PyTorch dataloaders, data sharding, and sampling. Strong communicator with a collaborative, mission-driven mindset. This is a fully in-person role based in
San Francisco , it's ideal for engineers excited to build at the edge of what's possible in AI. Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Job function Information Technology, Research, and Engineering Industries Research Services Referrals increase your chances of interviewing at Evolve Group by 2x Sign in to set job alerts for “Machine Learning Researcher” roles.
Mountain View, CA $145,000.00-$170,000.00 1 week ago Software Engineer, AI Platform - New Grad
Mountain View, CA $145,000.00-$170,000.00 1 week ago Mountain View, CA $138,225.00-$207,575.00 1 week ago Software Engineer (L4), Content & Business Products
Software Engineer, Frontend (All Levels)
Mountain View, CA $130,000.00-$176,000.00 4 days ago San Francisco, CA $57.00-$61.00 2 days ago San Francisco, CA $57.00-$61.00 2 days ago San Francisco, CA $125,000.00-$218,900.00 5 days ago Software Engineer, AI Intern (Winter 2026)
San Francisco, CA $57.00-$61.00 2 days ago Software Engineer, AI Intern (Summer 2026)
San Francisco, CA $57.00-$61.00 2 days ago San Francisco, CA $150,000.00-$180,000.00 5 days ago San Francisco, CA $255,000.00-$405,000.00 2 days ago San Jose, CA $113,400.00-$206,300.00 1 week ago San Jose, CA $100,500.00-$173,250.00 2 days ago San Mateo, CA $110,000.00-$135,000.00 1 week ago San Jose, CA $113,400.00-$206,300.00 1 week ago Software Engineer (Fullstack) - Payments
San Francisco, CA $163,200.00-$223,200.00 2 days ago Frontend Software Engineer - University Graduate 2025
San Mateo, CA $120,000.00-$165,000.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Machine Learning Scientist We’ve partnered with one of the most ambitious and technically rigorous AI research labs in the world. Based in San Francisco, this team is building foundation models entirely from scratch. They are now hiring ML Scientists to design and scale the systems that power large-scale, distributed model training. If you’ve built infrastructure that runs across hundreds of GPUs, thrive under technical complexity, and want to work side-by-side with elite AI researchers — this is the role. Key Responsibilities: Build and scale distributed training systems for large-scale model training across LLMs, vision, and robotics. Set up and run large-scale training across many GPUs using tools like Kubernetes, DeepSpeed, and FSDP. Troubleshoot system issues (GPU errors, network problems) and build tools to monitor and recover from failures. Optimize PyTorch pipelines, sharding, and sampling strategies. Collaborate closely with researchers to support novel model training at scale. Requirements: 3–15 years in ML infrastructure, systems, or research engineering roles. Proven experience scaling distributed training for large models. Strong with PyTorch, CUDA, NCCL, Kubernetes. Familiar with setting up distributed training clusters. Deep understanding of PyTorch dataloaders, data sharding, and sampling. Strong communicator with a collaborative, mission-driven mindset. This is a fully in-person role based in
San Francisco , it's ideal for engineers excited to build at the edge of what's possible in AI. Seniority level
Seniority level Mid-Senior level Employment type
Employment type Full-time Job function
Job function Information Technology, Research, and Engineering Industries Research Services Referrals increase your chances of interviewing at Evolve Group by 2x Sign in to set job alerts for “Machine Learning Researcher” roles.
Mountain View, CA $145,000.00-$170,000.00 1 week ago Software Engineer, AI Platform - New Grad
Mountain View, CA $145,000.00-$170,000.00 1 week ago Mountain View, CA $138,225.00-$207,575.00 1 week ago Software Engineer (L4), Content & Business Products
Software Engineer, Frontend (All Levels)
Mountain View, CA $130,000.00-$176,000.00 4 days ago San Francisco, CA $57.00-$61.00 2 days ago San Francisco, CA $57.00-$61.00 2 days ago San Francisco, CA $125,000.00-$218,900.00 5 days ago Software Engineer, AI Intern (Winter 2026)
San Francisco, CA $57.00-$61.00 2 days ago Software Engineer, AI Intern (Summer 2026)
San Francisco, CA $57.00-$61.00 2 days ago San Francisco, CA $150,000.00-$180,000.00 5 days ago San Francisco, CA $255,000.00-$405,000.00 2 days ago San Jose, CA $113,400.00-$206,300.00 1 week ago San Jose, CA $100,500.00-$173,250.00 2 days ago San Mateo, CA $110,000.00-$135,000.00 1 week ago San Jose, CA $113,400.00-$206,300.00 1 week ago Software Engineer (Fullstack) - Payments
San Francisco, CA $163,200.00-$223,200.00 2 days ago Frontend Software Engineer - University Graduate 2025
San Mateo, CA $120,000.00-$165,000.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr