techire ai
Join to apply for the
Research Scientist
role at
techire ai
Want to push the boundaries of what reinforcement learning can achieve with frontier models? In this role you will be advancing reinforcement learning methods for large-scale AI systems. You’ll be applying RL techniques to enhance reasoning, planning, and decision-making in models that directly impact fields from biology to climate and materials science.
Your work will combine RL with large language models, experimenting with RLHF, PPO, and DPO, designing evaluation frameworks, and fine‑tuning models at scale. The aim is to go beyond benchmarks and deliver models that researchers can use to accelerate discovery.
You will be a driving force in a team that is building towards a broader superintelligence platform: models that don’t just generate text or data, but drive breakthroughs across multiple domains. As part of this, you’ll collaborate with domain experts to ensure your research translates into real‑world scientific progress.
Base pay range $250,000.00/yr - $400,000.00/yr
Responsibilities Advance reinforcement learning methods for large‑scale AI systems and apply them to improve reasoning, planning, and decision‑making in models across diverse domains.
Experiment with RLHF, PPO, DPO, and other RL techniques on large language models.
Design evaluation frameworks and fine‑tune models at scale.
Collaborate with domain experts to translate research into real‑world scientific progress.
Qualifications
Deep expertise in reinforcement learning (policy optimisation, value‑based, or model‑based methods).
Experience applying RL to large models (RLHF, PPO, DPO).
Hands‑on experience with model training and fine‑tuning at scale.
PhD in Computer Science, Machine Learning, Robotics, or related field, with contributions to top‑tier conferences (NeurIPS, ICML, ICLR, AAAI).
Experience with distributed computing platforms (cloud or HPC clusters).
Track record of running rigorous experiments and improving models based on results.
If you have experience with multi‑agent RL, hierarchical/offline RL, or domain‑specific work with scientific datasets you will be an ideal candidate for this position.
Compensation and Benefits Package: $250k - $400k base + bonus + stock.
Location: SF Bay area or potential for remote with travel to office when needed.
All applicants will receive a response.
Seniority level
Associate
Employment type
Full‑time
Job function
Engineering, Science, and Information Technology
IT Services and IT Consulting, Research Services, and Software Development
Referrals increase your chances of interviewing at techire ai by 2x.
Sign in to set job alerts for “Research Scientist” roles.
#J-18808-Ljbffr
Research Scientist
role at
techire ai
Want to push the boundaries of what reinforcement learning can achieve with frontier models? In this role you will be advancing reinforcement learning methods for large-scale AI systems. You’ll be applying RL techniques to enhance reasoning, planning, and decision-making in models that directly impact fields from biology to climate and materials science.
Your work will combine RL with large language models, experimenting with RLHF, PPO, and DPO, designing evaluation frameworks, and fine‑tuning models at scale. The aim is to go beyond benchmarks and deliver models that researchers can use to accelerate discovery.
You will be a driving force in a team that is building towards a broader superintelligence platform: models that don’t just generate text or data, but drive breakthroughs across multiple domains. As part of this, you’ll collaborate with domain experts to ensure your research translates into real‑world scientific progress.
Base pay range $250,000.00/yr - $400,000.00/yr
Responsibilities Advance reinforcement learning methods for large‑scale AI systems and apply them to improve reasoning, planning, and decision‑making in models across diverse domains.
Experiment with RLHF, PPO, DPO, and other RL techniques on large language models.
Design evaluation frameworks and fine‑tune models at scale.
Collaborate with domain experts to translate research into real‑world scientific progress.
Qualifications
Deep expertise in reinforcement learning (policy optimisation, value‑based, or model‑based methods).
Experience applying RL to large models (RLHF, PPO, DPO).
Hands‑on experience with model training and fine‑tuning at scale.
PhD in Computer Science, Machine Learning, Robotics, or related field, with contributions to top‑tier conferences (NeurIPS, ICML, ICLR, AAAI).
Experience with distributed computing platforms (cloud or HPC clusters).
Track record of running rigorous experiments and improving models based on results.
If you have experience with multi‑agent RL, hierarchical/offline RL, or domain‑specific work with scientific datasets you will be an ideal candidate for this position.
Compensation and Benefits Package: $250k - $400k base + bonus + stock.
Location: SF Bay area or potential for remote with travel to office when needed.
All applicants will receive a response.
Seniority level
Associate
Employment type
Full‑time
Job function
Engineering, Science, and Information Technology
IT Services and IT Consulting, Research Services, and Software Development
Referrals increase your chances of interviewing at techire ai by 2x.
Sign in to set job alerts for “Research Scientist” roles.
#J-18808-Ljbffr