kadence
Direct message the job poster from kadence
Director @ Kadence | AI & Machine Learning Talent Solutions Founding Research Scientist – Multimodal AI & LLMs
Founded by a team of MIT, Stanford, and Oxford PhDs with experience at leading AI labs and top tech companies, combining deep research expertise with real-world product impact.
About Us Our client is an early‑stage AI research company developing next‑generation multimodal intelligence. Our focus is on creating systems that can understand and respond to human behavior in real time — not just through text, but through tone, emotion, and expression.
Our mission is to close the gap between human and machine interaction by building models that truly “see,” “hear,” and “understand.”
The Role As a
Founding Research Scientist , you’ll join a small team of researchers and engineers building foundational technology for real‑time multimodal learning. This role blends deep research with high‑impact engineering — ideal for someone who wants to turn bold ideas into tangible breakthroughs.
You’ll explore how large‑scale language models, vision transformers, and speech systems can converge into a single unified framework for human‑level interaction.
What You’ll Do
Design and train multimodal large language models that combine text, voice, and vision signals.
Prototype new architectures and data pipelines for real‑time inference and low‑latency response.
Conduct experiments, evaluate model behavior, and push the limits of expressive AI systems.
Work closely with engineers to transition research models into production‑grade implementations.
Contribute to publications and open research initiatives.
What You Bring
PhD or equivalent experience in Machine Learning, Computer Vision, NLP, or related fields.
Strong understanding of LLMs, diffusion models, or multimodal transformer architectures.
Experience with deep learning frameworks such as
PyTorch
or
JAX .
Proven track record in training and evaluating large‑scale AI systems.
A love for uncharted technical challenges and independent problem‑solving.
Strong communication skills and the ability to collaborate across disciplines.
Why Join
Be part of the founding research team shaping a new frontier in multimodal AI.
Collaborate with world‑class peers and advisors from leading universities and labs.
Work on problems that haven’t been solved yet — and see your ideas shape real‑world products.
Backed by top‑tier investors in deep tech and AI.
Seniority level
Mid‑Senior level
Employment type
Full‑time
Job function
Research and Engineering
Industries
Software Development and Technology, Information and Media
Referrals increase your chances of interviewing at kadence by 2x
#J-18808-Ljbffr
Director @ Kadence | AI & Machine Learning Talent Solutions Founding Research Scientist – Multimodal AI & LLMs
Founded by a team of MIT, Stanford, and Oxford PhDs with experience at leading AI labs and top tech companies, combining deep research expertise with real-world product impact.
About Us Our client is an early‑stage AI research company developing next‑generation multimodal intelligence. Our focus is on creating systems that can understand and respond to human behavior in real time — not just through text, but through tone, emotion, and expression.
Our mission is to close the gap between human and machine interaction by building models that truly “see,” “hear,” and “understand.”
The Role As a
Founding Research Scientist , you’ll join a small team of researchers and engineers building foundational technology for real‑time multimodal learning. This role blends deep research with high‑impact engineering — ideal for someone who wants to turn bold ideas into tangible breakthroughs.
You’ll explore how large‑scale language models, vision transformers, and speech systems can converge into a single unified framework for human‑level interaction.
What You’ll Do
Design and train multimodal large language models that combine text, voice, and vision signals.
Prototype new architectures and data pipelines for real‑time inference and low‑latency response.
Conduct experiments, evaluate model behavior, and push the limits of expressive AI systems.
Work closely with engineers to transition research models into production‑grade implementations.
Contribute to publications and open research initiatives.
What You Bring
PhD or equivalent experience in Machine Learning, Computer Vision, NLP, or related fields.
Strong understanding of LLMs, diffusion models, or multimodal transformer architectures.
Experience with deep learning frameworks such as
PyTorch
or
JAX .
Proven track record in training and evaluating large‑scale AI systems.
A love for uncharted technical challenges and independent problem‑solving.
Strong communication skills and the ability to collaborate across disciplines.
Why Join
Be part of the founding research team shaping a new frontier in multimodal AI.
Collaborate with world‑class peers and advisors from leading universities and labs.
Work on problems that haven’t been solved yet — and see your ideas shape real‑world products.
Backed by top‑tier investors in deep tech and AI.
Seniority level
Mid‑Senior level
Employment type
Full‑time
Job function
Research and Engineering
Industries
Software Development and Technology, Information and Media
Referrals increase your chances of interviewing at kadence by 2x
#J-18808-Ljbffr