Prime Intellect
Research Engineer - Reasoning
Prime Intellect, San Francisco, California, United States, 94199
Research Engineer - Reinforcement Learning
Join to apply for the
Research Engineer - Reinforcement Learning
role at
Prime Intellect .
Prime Intellect is building the open superintelligence stack – from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full RL post‑training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end‑to‑end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.
As a Research Engineer in our Reasoning team, you’ll play a crucial role in shaping our technological direction, focusing on test‑time compute scaling research ideas. If you love working with synthetic data and teaching LLMs reasoning abilities, this role is for you. For more details about the project you would be working on, check out our outlook on decentralized training in the inference‑compute paradigm.
Responsibilities
Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution
Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques.
Contribute to the development of our open‑source libraries and frameworks for synthetic data generation and distributed RL frameworks.
Publish research in top‑tier AI conferences such as ICML & NeurIPS.
Distill highly technical project outcomes in layman‑approachable technical blogs to our customers and developers.
Stay up‑to‑date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform’s capabilities and user experience.
Requirements
Strong background in AI/ML engineering, with extensive experience in designing and implementing end‑to‑end pipelines for the inference or training of large‑scale AI models.
Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads.
Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines.
Passion for advancing the state‑of‑the‑art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.
If you’re not familiar with these, but feel like you can contribute to our mission and you’re a high‑energy person, get familiar with these resources (here, here and here) and please reach out!
Benefits & Perks
Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect.
Flexible work arrangements, with the option to work remotely or in‑person at our offices in San Francisco.
Visa sponsorship and relocation assistance for international candidates.
Quarterly team off‑sites, hackathons, conferences and learning opportunities.
Opportunity to work with a talented, hard‑working and mission‑driven team, united by a shared passion for leveraging technology to accelerate science and AI.
We recently raised $15 million in funding (total of $20 million raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.
If you’re excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what’s possible, we’d love to hear from you.
Seniority level Mid-Senior level
Employment type Full-time
Job function Engineering and Information Technology
Industry Software Development
Referrals increase your chances of interviewing at Prime Intellect by 2x.
#J-18808-Ljbffr
Research Engineer - Reinforcement Learning
role at
Prime Intellect .
Prime Intellect is building the open superintelligence stack – from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full RL post‑training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end‑to‑end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.
As a Research Engineer in our Reasoning team, you’ll play a crucial role in shaping our technological direction, focusing on test‑time compute scaling research ideas. If you love working with synthetic data and teaching LLMs reasoning abilities, this role is for you. For more details about the project you would be working on, check out our outlook on decentralized training in the inference‑compute paradigm.
Responsibilities
Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution
Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques.
Contribute to the development of our open‑source libraries and frameworks for synthetic data generation and distributed RL frameworks.
Publish research in top‑tier AI conferences such as ICML & NeurIPS.
Distill highly technical project outcomes in layman‑approachable technical blogs to our customers and developers.
Stay up‑to‑date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform’s capabilities and user experience.
Requirements
Strong background in AI/ML engineering, with extensive experience in designing and implementing end‑to‑end pipelines for the inference or training of large‑scale AI models.
Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads.
Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines.
Passion for advancing the state‑of‑the‑art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.
If you’re not familiar with these, but feel like you can contribute to our mission and you’re a high‑energy person, get familiar with these resources (here, here and here) and please reach out!
Benefits & Perks
Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect.
Flexible work arrangements, with the option to work remotely or in‑person at our offices in San Francisco.
Visa sponsorship and relocation assistance for international candidates.
Quarterly team off‑sites, hackathons, conferences and learning opportunities.
Opportunity to work with a talented, hard‑working and mission‑driven team, united by a shared passion for leveraging technology to accelerate science and AI.
We recently raised $15 million in funding (total of $20 million raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.
If you’re excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what’s possible, we’d love to hear from you.
Seniority level Mid-Senior level
Employment type Full-time
Job function Engineering and Information Technology
Industry Software Development
Referrals increase your chances of interviewing at Prime Intellect by 2x.
#J-18808-Ljbffr