MLabs
Overview Senior Research Engineer role at MLabs. Our client is a research lab that provides post-training data and RL environments to foundation model labs and frontier applied AI companies. They have raised significant funding from top-tier VCs and are growing rapidly. As a Senior Research Engineer, you will drive cutting-edge research at the intersection of scalable infrastructure and modern reinforcement learning frameworks. This is an opportunity to join an early-stage team with high autonomy and direct exposure to projects that are used and validated in production by leading labs.
Scroll down the page to see all associated job requirements, and any responsibilities successful candidates can expect. Location San Francisco, CA Employment type Full-time On-site 5 days/week in-person Responsibilities
Design and implement scalable RL recipes for post-training task-specific models Develop modular environments, reward functions, and evaluator scaffolds for both internal and customer-facing tasks Drive research to enable RL-as-a-service and publish open-source environments and training data Build data generation and curation pipelines to support frontier post-training Collaborate with product teams to deliver a user-friendly interface for non-technical users to generate data Requirements
4-7 years of experience in an AI/LLM research capacity (excluding undergraduate experience) Master's or PhD in Computer Science or a related field Comfortable with core tooling like PyTorch and modern post-training techniques Experience with evaluations and reward engineering Published in top journals (ICLR, NeurIPS, ICML, etc.) Benefits
Salary: $200k - $275k (Based on quality and experience) Equity: 0.5% - 2% Visa sponsorship is available Commitment to Equality and Accessibility MLabs is committed to equal opportunities for all candidates. We ensure no discrimination, accessible job adverts, and information in accessible formats. If you need reasonable adjustments during any part of the hiring process or would like to view the job advert in an accessible format, please email human-resources@mlabs.city. MLabs Ltd collects and processes the personal information provided for recruitment purposes. This information is managed securely under MLabs Ltd\'s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared with clients and trusted partners where necessary for recruitment. You may request deletion of your data or withdraw consent at any time by contacting legal@mlabs.city.
#J-18808-Ljbffr
Scroll down the page to see all associated job requirements, and any responsibilities successful candidates can expect. Location San Francisco, CA Employment type Full-time On-site 5 days/week in-person Responsibilities
Design and implement scalable RL recipes for post-training task-specific models Develop modular environments, reward functions, and evaluator scaffolds for both internal and customer-facing tasks Drive research to enable RL-as-a-service and publish open-source environments and training data Build data generation and curation pipelines to support frontier post-training Collaborate with product teams to deliver a user-friendly interface for non-technical users to generate data Requirements
4-7 years of experience in an AI/LLM research capacity (excluding undergraduate experience) Master's or PhD in Computer Science or a related field Comfortable with core tooling like PyTorch and modern post-training techniques Experience with evaluations and reward engineering Published in top journals (ICLR, NeurIPS, ICML, etc.) Benefits
Salary: $200k - $275k (Based on quality and experience) Equity: 0.5% - 2% Visa sponsorship is available Commitment to Equality and Accessibility MLabs is committed to equal opportunities for all candidates. We ensure no discrimination, accessible job adverts, and information in accessible formats. If you need reasonable adjustments during any part of the hiring process or would like to view the job advert in an accessible format, please email human-resources@mlabs.city. MLabs Ltd collects and processes the personal information provided for recruitment purposes. This information is managed securely under MLabs Ltd\'s Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared with clients and trusted partners where necessary for recruitment. You may request deletion of your data or withdraw consent at any time by contacting legal@mlabs.city.
#J-18808-Ljbffr