Scale AI
Machine Learning Research Scientist / Engineer, Reasoning
Scale AI, Lakewood, Washington, us, 98496
About Scale
At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, fueling advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our Series F round, we’re expanding access to high-quality data to drive progress toward Artificial General Intelligence (AGI). We continue to set new standards for both public and private evaluations. About This Role
This role operates at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). The ideal candidate will study data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents. You will help shape Scale’s data strategy by identifying effective data sources and methodologies for improving LLM reasoning. Success requires a deep understanding of LLMs, planning algorithms, and novel approaches to agentic reasoning, as well as creativity in tackling data generation, model interaction, and evaluation. You will contribute to impactful research on language model reasoning, collaborate with external researchers, and work with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions. Ideally, you’d have: Practical experience with LLMs, proficiency in frameworks like PyTorch, JAX, or TensorFlow, and the ability to rapidly interpret research literature and translate ideas into working prototypes.
A track record of published research in top ML/NLP venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, CoLLM, etc.).
At least three years of experience solving complex ML challenges, in research or product development, particularly related to LLM capabilities and reasoning.
Strong written and verbal communication skills and the ability to work effectively across teams.
Nice to have: Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects using PyTorch/JAX.
Experience building applications and evaluations related to LLM-based agents (tool-use, text-to-SQL, browser agents, coding agents, GUI agents).
Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar.
Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH.
Proficiency in cloud-based ML development, with AWS or GCP experience.
Our research interviews assess candidates’ ability to prototype and debug ML models, depth of understanding in research concepts, and alignment with our culture. We do not conduct LeetCode-style problem-solving assessments. Compensation and benefits:
Compensation packages for eligible roles include base salary, equity, and benefits. The salary range shown reflects the minimum and maximum target for new hire salaries, determined by location and other factors including skills, experience, education, and interview performance. Scale employees in eligible roles are granted equity-based compensation, subject to Board approval. Your recruiter can share the salary range for your location and confirm equity eligibility during the hiring process. Benefits include comprehensive health, dental, and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additional benefits such as a commuter stipend may apply. Please reference the job posting’s subtitle for the role’s location. For pay transparency, the base salary range for this full-time position in San Francisco, New York, and Seattle is $220,000—$325,000 USD. Important notes
Our policy requires a 90-day waiting period before reconsidering candidates for the same role to ensure a fair evaluation of all applicants. About Us
At Scale, we believe the transition from traditional software to AI is a major shift. Our mission is to accelerate this transition across industries, powering the world’s most advanced LLMs, generative models, and computer vision models. We are trusted by leading generative AI companies, government agencies, and enterprises. We are expanding our team to accelerate the development of AI applications. We are committed to an inclusive and equal opportunity workplace. We provide equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or veteran status. We provide reasonable accommodations for applicants with disabilities. If you need an accommodation, please contact accommodations@scale.com. We comply with the U.S. Department of Labor’s Pay Transparency and Know Your Rights posters and policies. We collect and use personal data in accordance with our privacy policy.
#J-18808-Ljbffr
At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, fueling advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our Series F round, we’re expanding access to high-quality data to drive progress toward Artificial General Intelligence (AGI). We continue to set new standards for both public and private evaluations. About This Role
This role operates at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). The ideal candidate will study data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents. You will help shape Scale’s data strategy by identifying effective data sources and methodologies for improving LLM reasoning. Success requires a deep understanding of LLMs, planning algorithms, and novel approaches to agentic reasoning, as well as creativity in tackling data generation, model interaction, and evaluation. You will contribute to impactful research on language model reasoning, collaborate with external researchers, and work with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions. Ideally, you’d have: Practical experience with LLMs, proficiency in frameworks like PyTorch, JAX, or TensorFlow, and the ability to rapidly interpret research literature and translate ideas into working prototypes.
A track record of published research in top ML/NLP venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, CoLLM, etc.).
At least three years of experience solving complex ML challenges, in research or product development, particularly related to LLM capabilities and reasoning.
Strong written and verbal communication skills and the ability to work effectively across teams.
Nice to have: Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects using PyTorch/JAX.
Experience building applications and evaluations related to LLM-based agents (tool-use, text-to-SQL, browser agents, coding agents, GUI agents).
Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar.
Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH.
Proficiency in cloud-based ML development, with AWS or GCP experience.
Our research interviews assess candidates’ ability to prototype and debug ML models, depth of understanding in research concepts, and alignment with our culture. We do not conduct LeetCode-style problem-solving assessments. Compensation and benefits:
Compensation packages for eligible roles include base salary, equity, and benefits. The salary range shown reflects the minimum and maximum target for new hire salaries, determined by location and other factors including skills, experience, education, and interview performance. Scale employees in eligible roles are granted equity-based compensation, subject to Board approval. Your recruiter can share the salary range for your location and confirm equity eligibility during the hiring process. Benefits include comprehensive health, dental, and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additional benefits such as a commuter stipend may apply. Please reference the job posting’s subtitle for the role’s location. For pay transparency, the base salary range for this full-time position in San Francisco, New York, and Seattle is $220,000—$325,000 USD. Important notes
Our policy requires a 90-day waiting period before reconsidering candidates for the same role to ensure a fair evaluation of all applicants. About Us
At Scale, we believe the transition from traditional software to AI is a major shift. Our mission is to accelerate this transition across industries, powering the world’s most advanced LLMs, generative models, and computer vision models. We are trusted by leading generative AI companies, government agencies, and enterprises. We are expanding our team to accelerate the development of AI applications. We are committed to an inclusive and equal opportunity workplace. We provide equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or veteran status. We provide reasonable accommodations for applicants with disabilities. If you need an accommodation, please contact accommodations@scale.com. We comply with the U.S. Department of Labor’s Pay Transparency and Know Your Rights posters and policies. We collect and use personal data in accordance with our privacy policy.
#J-18808-Ljbffr