David AI
Software Engineer (Internship, Summer 2026)
David AI, San Francisco, California, United States, 94199
About David AI
David AI is the world's first audio data research company. We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, and we believe audio is the gateway. Speech is versatile, accessible, and human-it fits naturally into everyday life. As audio AI advances and new use cases emerge, high-quality training data is the bottleneck. This is where David AI comes in.
David AI was founded in 2024 by a team of former Scale AI engineers and operators. In less than a year, we've brought on most FAANG companies and AI labs as customers. We recently raised a $50M Series B from Meritech, NVIDIA, Jack Altman (Alt Capital), Amplify Partners, First Round Capital and other Tier 1 investors.
Our team is sharp, humble, ambitious, and tight-knit. We're looking for the best research, engineering, product, and operations minds to join us on our mission to push the frontier of audio AI.
About our Engineering team
At David AI, our engineers build the pipelines, platforms, and models that transform raw audio into high-signal data for leading AI labs and enterprises. We're a tight-knit team of product engineers, infrastructure specialists, and machine learning experts focused on building the world's first audio data research company.
We move fast, own our work end-to-end, and ship to production daily. Our team designs real-time pipelines handling terabytes of speech data and deploys cutting-edge generative audio models.
About this role
As a
Software Engineering Intern
at David AI, you'll build cutting-edge tools that help our users make sense of the audio data they'll use to train their models, working closely with researchers to consistently iterate on how to best collect our data.
This internship will be for Summer 2026, and you will need to be able to work out of the San Francisco office during this time.
In this role, you will Ship full-stack features
that thousands of users will interact with daily. Build scalable systems
to create core data processing pipelines that derive actionable insights from terabytes of audio data every day. Build, deploy, and evaluate LLM and DSP based solutions
to increase our customers' understanding of nuanced features across our datasets. Get early insights into research direction and improving frontier audio model capabilities
months before the market. Example projects
These are some examples of projects that engineers on our team have worked on recently: Build robust fraud-detection systems to remove bad actors to protect our contributor base. Create optimized product and tooling used by 10K+ contributors monthly to complete audio related tasks. Develop new AI infrastructure products to visualize, query, and assess different shapes of audio data. Ship new tools that support and accelerate the growth of newly qualified contributors, so they can do their best work. Your background looks like Pursuing or completing a B.S. (or higher) in Computer Science or a related technical field with a graduation date by summer 2027. Track record of building full-stack web apps, integrating with relevant services and APIs, and shipping new features and products at a rapid pace. Strong coding fundamentals in at least one modern language (Python, Rust, TypeScript etc.). Some technologies we work with
Next.js, TypeScript, TailwindCSS, Node.js, tRPC, PostgreSQL, AWS, Trigger.dev, WebRTC, FFmpeg.
David AI is the world's first audio data research company. We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, and we believe audio is the gateway. Speech is versatile, accessible, and human-it fits naturally into everyday life. As audio AI advances and new use cases emerge, high-quality training data is the bottleneck. This is where David AI comes in.
David AI was founded in 2024 by a team of former Scale AI engineers and operators. In less than a year, we've brought on most FAANG companies and AI labs as customers. We recently raised a $50M Series B from Meritech, NVIDIA, Jack Altman (Alt Capital), Amplify Partners, First Round Capital and other Tier 1 investors.
Our team is sharp, humble, ambitious, and tight-knit. We're looking for the best research, engineering, product, and operations minds to join us on our mission to push the frontier of audio AI.
About our Engineering team
At David AI, our engineers build the pipelines, platforms, and models that transform raw audio into high-signal data for leading AI labs and enterprises. We're a tight-knit team of product engineers, infrastructure specialists, and machine learning experts focused on building the world's first audio data research company.
We move fast, own our work end-to-end, and ship to production daily. Our team designs real-time pipelines handling terabytes of speech data and deploys cutting-edge generative audio models.
About this role
As a
Software Engineering Intern
at David AI, you'll build cutting-edge tools that help our users make sense of the audio data they'll use to train their models, working closely with researchers to consistently iterate on how to best collect our data.
This internship will be for Summer 2026, and you will need to be able to work out of the San Francisco office during this time.
In this role, you will Ship full-stack features
that thousands of users will interact with daily. Build scalable systems
to create core data processing pipelines that derive actionable insights from terabytes of audio data every day. Build, deploy, and evaluate LLM and DSP based solutions
to increase our customers' understanding of nuanced features across our datasets. Get early insights into research direction and improving frontier audio model capabilities
months before the market. Example projects
These are some examples of projects that engineers on our team have worked on recently: Build robust fraud-detection systems to remove bad actors to protect our contributor base. Create optimized product and tooling used by 10K+ contributors monthly to complete audio related tasks. Develop new AI infrastructure products to visualize, query, and assess different shapes of audio data. Ship new tools that support and accelerate the growth of newly qualified contributors, so they can do their best work. Your background looks like Pursuing or completing a B.S. (or higher) in Computer Science or a related technical field with a graduation date by summer 2027. Track record of building full-stack web apps, integrating with relevant services and APIs, and shipping new features and products at a rapid pace. Strong coding fundamentals in at least one modern language (Python, Rust, TypeScript etc.). Some technologies we work with
Next.js, TypeScript, TailwindCSS, Node.js, tRPC, PostgreSQL, AWS, Trigger.dev, WebRTC, FFmpeg.