Stealth Startup
Senior Software Engineer – Foundational Data Systems for AI [32762]
Stealth Startup, Mountain View, California, us, 94039
Senior Software Engineer – Foundational Data Systems for AI
2 days ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
The company is an AI research and systems company building the infrastructure for a new kind of intelligence: one that is structured, efficient, and deeply integrated with data.
Our systems operate at
exabyte scale , processing
petabytes of data each day
for some of the world’s most prominent enterprises in finance, technology, and industry. These systems are already making a measurable difference in how global organizations use data to deploy AI safely and efficiently.
We believe that the next generation of enterprise AI will not come from larger models but from
more efficient data systems . By advancing the frontier of how data is represented, stored, and transformed, we aim to make large-scale intelligence creation sustainable and adaptive.
Our long-term vision is
Efficient Intelligence : AI that learns using fewer resources, generalizes from less data, and reasons through structure rather than scale. To reach that, we are first building the
Foundational Data Systems
that make structured AI possible.
The Mission
AI today is limited not only by model design but by the inefficiency of the data that feeds it. At scale, each redundant byte, each poorly organized dataset, and each inefficient data path slows progress and compounds into enormous cost, latency, and energy waste.
company’s mission is to remove that inefficiency. We combine new research in
information theory ,
probabilistic modeling , and
distributed systems
to design self-optimizing data infrastructure: systems that continuously improve how information is represented and used by AI.
This engineering team partners closely with the company's Research group led by
Prof. Andrea Montanari
(Stanford), bridging advances in information theory and learning efficiency with large-scale distributed systems. Together, we share a conviction that the next leap in AI will come from breakthroughs in efficient systems, not just larger models.
What You’ll Build
Global Metadata Substrate.
Architect the transactional and metadata substrate that supports time-travel, schema evolution, and atomic consistency across petabyte-scale tabular datasets.
Adaptive Engines.
Build systems that reorganize data autonomously, learning from access patterns and workloads to maintain peak efficiency without manual tuning.
Intelligent Data Layouts.
Optimize bit-level organization (encoding, compression, layout) to extract maximal signal per byte read.
Autonomous Compute Pipelines.
Develop distributed compute systems that scale predictably, adapt to dynamic load, and maintain reliability under failure.
Research to Production.
Implement new algorithms in compression, representation, and optimization emerging from ongoing research. Opportunities to publish and open-source are encouraged.
Latency as Intelligence.
Design for minimal time between question and insight, enabling models and humans to learn faster from data.
What You Bring
Experience with columnar formats such as
Parquet
or
ORC
and low-level encoding strategies.
Understanding of metadata-driven architectures and adaptive query planning.
Production experience with
Spark ,
Flink , or custom distributed engines on cloud object storage.
Proficiency in
Java, Rust, Go, or C++
with an emphasis on clarity and quality.
Curiosity about theory of the mathematics of compression, entropy, and learning efficiency.
A builder’s mindset: pragmatic, rigorous, and grounded in long-term systems thinking.
Bonus
Familiarity with
Iceberg ,
Delta Lake , or
Hudi .
Research or open-source contributions in compression, indexing, or distributed computation.
Interest in how data representation affects training dynamics and model reasoning efficiency.
Why us
Fundamental Research Meets Enterprise Impact.
Work at the intersection of science and engineering, turning foundational research into deployed systems serving enterprise workloads at exabyte scale.
AI by Design.
Build the infrastructure that defines how efficiently the world can create and apply intelligence.
Real Ownership.
Design primitives that will underpin the next decade of AI infrastructure.
High-Trust Environment.
Deep technical work, minimal bureaucracy, shared mission.
Enduring Horizon.
Backed by NEA, Bain Capital, and various luminaries from tech and business. We are building a generational company for decades, not quarters or a product cycle.
Competitive salary, meaningful equity, and substantial bonus for top performers
Flexible time off plus comprehensive health coverage for you and your family
Support for research, publication, and deep technical exploration
Join us to build the foundational data systems that power the future of enterprise AI.
Here you will shape the fundamental infrastructure that makes intelligence itself efficient, structured, and enduring.
Seniority level Mid-Senior level
Employment type Full-time
Job function Technology, Information and Internet
Referrals increase your chances of interviewing at Stealth Startup by 2x
#J-18808-Ljbffr
Get AI-powered advice on this job and more exclusive features.
The company is an AI research and systems company building the infrastructure for a new kind of intelligence: one that is structured, efficient, and deeply integrated with data.
Our systems operate at
exabyte scale , processing
petabytes of data each day
for some of the world’s most prominent enterprises in finance, technology, and industry. These systems are already making a measurable difference in how global organizations use data to deploy AI safely and efficiently.
We believe that the next generation of enterprise AI will not come from larger models but from
more efficient data systems . By advancing the frontier of how data is represented, stored, and transformed, we aim to make large-scale intelligence creation sustainable and adaptive.
Our long-term vision is
Efficient Intelligence : AI that learns using fewer resources, generalizes from less data, and reasons through structure rather than scale. To reach that, we are first building the
Foundational Data Systems
that make structured AI possible.
The Mission
AI today is limited not only by model design but by the inefficiency of the data that feeds it. At scale, each redundant byte, each poorly organized dataset, and each inefficient data path slows progress and compounds into enormous cost, latency, and energy waste.
company’s mission is to remove that inefficiency. We combine new research in
information theory ,
probabilistic modeling , and
distributed systems
to design self-optimizing data infrastructure: systems that continuously improve how information is represented and used by AI.
This engineering team partners closely with the company's Research group led by
Prof. Andrea Montanari
(Stanford), bridging advances in information theory and learning efficiency with large-scale distributed systems. Together, we share a conviction that the next leap in AI will come from breakthroughs in efficient systems, not just larger models.
What You’ll Build
Global Metadata Substrate.
Architect the transactional and metadata substrate that supports time-travel, schema evolution, and atomic consistency across petabyte-scale tabular datasets.
Adaptive Engines.
Build systems that reorganize data autonomously, learning from access patterns and workloads to maintain peak efficiency without manual tuning.
Intelligent Data Layouts.
Optimize bit-level organization (encoding, compression, layout) to extract maximal signal per byte read.
Autonomous Compute Pipelines.
Develop distributed compute systems that scale predictably, adapt to dynamic load, and maintain reliability under failure.
Research to Production.
Implement new algorithms in compression, representation, and optimization emerging from ongoing research. Opportunities to publish and open-source are encouraged.
Latency as Intelligence.
Design for minimal time between question and insight, enabling models and humans to learn faster from data.
What You Bring
Experience with columnar formats such as
Parquet
or
ORC
and low-level encoding strategies.
Understanding of metadata-driven architectures and adaptive query planning.
Production experience with
Spark ,
Flink , or custom distributed engines on cloud object storage.
Proficiency in
Java, Rust, Go, or C++
with an emphasis on clarity and quality.
Curiosity about theory of the mathematics of compression, entropy, and learning efficiency.
A builder’s mindset: pragmatic, rigorous, and grounded in long-term systems thinking.
Bonus
Familiarity with
Iceberg ,
Delta Lake , or
Hudi .
Research or open-source contributions in compression, indexing, or distributed computation.
Interest in how data representation affects training dynamics and model reasoning efficiency.
Why us
Fundamental Research Meets Enterprise Impact.
Work at the intersection of science and engineering, turning foundational research into deployed systems serving enterprise workloads at exabyte scale.
AI by Design.
Build the infrastructure that defines how efficiently the world can create and apply intelligence.
Real Ownership.
Design primitives that will underpin the next decade of AI infrastructure.
High-Trust Environment.
Deep technical work, minimal bureaucracy, shared mission.
Enduring Horizon.
Backed by NEA, Bain Capital, and various luminaries from tech and business. We are building a generational company for decades, not quarters or a product cycle.
Competitive salary, meaningful equity, and substantial bonus for top performers
Flexible time off plus comprehensive health coverage for you and your family
Support for research, publication, and deep technical exploration
Join us to build the foundational data systems that power the future of enterprise AI.
Here you will shape the fundamental infrastructure that makes intelligence itself efficient, structured, and enduring.
Seniority level Mid-Senior level
Employment type Full-time
Job function Technology, Information and Internet
Referrals increase your chances of interviewing at Stealth Startup by 2x
#J-18808-Ljbffr