Bespoke Labs
About Bespoke Labs
Bespoke Labs is an applied AI research lab pioneering data and RL environment curation for training and evaluating agents.
Recently, we curated Open Thoughts, one of the best open reasoning datasets used by multiple frontier labs, trained SOTA specialized models such as Bespoke-MiniChart-7B and Bespoke-MiniCheck, and taught agents to do multi-turn tool-calling with reinforcement learning.
Bespoke is uniquely positioned to capture a large market share of data and RL environment curation.
About the Role We're looking for our first Product Engineer — someone who bridges the gap between cutting-edge AI research and the tools that researchers actually want to use. You'll be building the interfaces and infrastructure that make RL environment curation accessible, intuitive, and powerful.
You possess the energy of a builder and the finesse of an artist. Maybe you've shipped side projects that solve real problems, or you've been the person turning ambitious ideas into working software. You care about the details, both in your code and in how your product feels to use. You can move quickly from concept to prototype, but you also know when it's time to invest in making something production-ready.
Working directly with the rest of our team, you'll create the platform that environment engineers use to build, test, and deploy training worlds for AI agents. This isn't just about infrastructure—it's about crafting experiences that make complex workflows feel simple.
What You'll Do
Build intuitive dashboards where environment engineers can upload, manage, and monitor their RL environments.
Create visualization tools that make complex metrics — environment quality, difficulty levels, training progress — clear and actionable.
Develop the backend infrastructure for scalable hosting and evaluation of RL environments.
Design and implement APIs that let researchers programmatically interact with the platform.
Work directly with our research team to understand their workflows and translate needs into product features.
Own the full stack, from database design to frontend polish, ensuring the platform is both powerful and pleasant to use.
Establish best practices for testing, deployment, and monitoring as we scale.
What We're Looking For Technical skills:
2+ years of full-stack development experience with strong fundamentals in both backend and frontend.
Proficiency in Python and modern web frameworks.
Experience with cloud platforms (GCP, AWS) and distributed systems.
Strong experience with CI/CD, testing, and production monitoring.
Ability to ship working prototypes quickly, then iterate toward production quality.
Product skills:
Good instincts for what makes software easy and enjoyable to use, particularly for technical users.
Ability to make product decisions with incomplete information—deciding what to build and what to defer.
Comfortable working closely with users (in this case, researchers) to understand pain points and validate solutions.
Experience designing systems that can start simple and scale as needs grow.
Logistics Location:
Mountain View, CA (Hybrid) or India (Remote)
Compensation:
Competitive salary and equity based on experience.
Benefits:
Health coverage, flexible work arrangements, and the opportunity to help define a new product category in AI infrastructure.
We encourage you to apply even if you don't meet every qualification. We value diverse perspectives and are happy to discuss how your unique background could be a great fit for this role.
#J-18808-Ljbffr
Recently, we curated Open Thoughts, one of the best open reasoning datasets used by multiple frontier labs, trained SOTA specialized models such as Bespoke-MiniChart-7B and Bespoke-MiniCheck, and taught agents to do multi-turn tool-calling with reinforcement learning.
Bespoke is uniquely positioned to capture a large market share of data and RL environment curation.
About the Role We're looking for our first Product Engineer — someone who bridges the gap between cutting-edge AI research and the tools that researchers actually want to use. You'll be building the interfaces and infrastructure that make RL environment curation accessible, intuitive, and powerful.
You possess the energy of a builder and the finesse of an artist. Maybe you've shipped side projects that solve real problems, or you've been the person turning ambitious ideas into working software. You care about the details, both in your code and in how your product feels to use. You can move quickly from concept to prototype, but you also know when it's time to invest in making something production-ready.
Working directly with the rest of our team, you'll create the platform that environment engineers use to build, test, and deploy training worlds for AI agents. This isn't just about infrastructure—it's about crafting experiences that make complex workflows feel simple.
What You'll Do
Build intuitive dashboards where environment engineers can upload, manage, and monitor their RL environments.
Create visualization tools that make complex metrics — environment quality, difficulty levels, training progress — clear and actionable.
Develop the backend infrastructure for scalable hosting and evaluation of RL environments.
Design and implement APIs that let researchers programmatically interact with the platform.
Work directly with our research team to understand their workflows and translate needs into product features.
Own the full stack, from database design to frontend polish, ensuring the platform is both powerful and pleasant to use.
Establish best practices for testing, deployment, and monitoring as we scale.
What We're Looking For Technical skills:
2+ years of full-stack development experience with strong fundamentals in both backend and frontend.
Proficiency in Python and modern web frameworks.
Experience with cloud platforms (GCP, AWS) and distributed systems.
Strong experience with CI/CD, testing, and production monitoring.
Ability to ship working prototypes quickly, then iterate toward production quality.
Product skills:
Good instincts for what makes software easy and enjoyable to use, particularly for technical users.
Ability to make product decisions with incomplete information—deciding what to build and what to defer.
Comfortable working closely with users (in this case, researchers) to understand pain points and validate solutions.
Experience designing systems that can start simple and scale as needs grow.
Logistics Location:
Mountain View, CA (Hybrid) or India (Remote)
Compensation:
Competitive salary and equity based on experience.
Benefits:
Health coverage, flexible work arrangements, and the opportunity to help define a new product category in AI infrastructure.
We encourage you to apply even if you don't meet every qualification. We value diverse perspectives and are happy to discuss how your unique background could be a great fit for this role.
#J-18808-Ljbffr