Biohub
Senior Staff Data Scientist, Imaging, Data
Biohub, Redwood City, California, United States, 94061
Senior Staff Data Scientist, Imaging, Data
Join to apply for the
Senior Staff Data Scientist, Imaging, Data
role at
Biohub .
Biohub is leading the new era of AI‑powered biology to cure or prevent disease through its 501(c)(3) medical research organization, supported by the Chan Zuckerberg Initiative.
The Team Biohub supports the science and technology that will help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem ambitious, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.
Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems—paving the way for new discoveries that will change medicine in the decades that follow.
Building an AI‑based virtual cell model to predict and understand cellular behavior
Developing novel imaging technologies to map, measure and model complex biological systems
Creating new tools for sensing and directly measuring inflammation within tissues in real time, tissues to better understand inflammation, a key driver of many diseases
Harnessing the immune system for early detection, prevention, and treatment of disease
The Opportunity
Billions of standardized cells of single‑cell transcriptomic data, with a focus on measuring genetic and environmental perturbations
10s of thousands of donor‑matched DNA & RNA samples
PB‑scale static and dynamic imaging datasets
TB‑scale mass spectrometry datasets
Diverse, large multi‑modal biological datasets that enable biological bridges across measurement types and facilitate multi‑modal model training to define how cells act.
After model training, we make all data products available through public resources like CELLxGENE Discover and the Cryo‑ET Portal, used by tens of thousands of scientists monthly to advance understanding of genetic variants, disease risk, drug toxicities, and therapeutic discovery.
As a Senior Staff Data Scientist, you’ll lead the creation of groundbreaking imaging datasets that decode cellular function at the molecular level, describe development, and predict responses to genetic or environmental changes. Working at the intersection of data science, biology, and AI, you’ll define data needs, format standards, analysis approaches, quality metrics, and our technical strategy—creating systems to ingest, transform, validate, and deploy data products.
Success in this role Success for this role means delivering high‑quality, usable datasets that directly address modeling challenges and accelerate scientific progress.
What You’ll Do
Define the technical strategy for a robust imaging data ecosystem and build data ingestion pipelines, define data formats, write validation tools, QC metrics, and analysis pipelines.
Collaborate with ML engineers, AI researchers, and data engineers to iteratively evaluate, refine and grow datasets to maximize model performance.
Discover and define new data generation opportunities, and manage the delivery of those data products to our AI team.
Collaborate with engineers, product managers, UX designers, and other data scientists to publish valuable datasets as part of CZI’s open data ecosystem.
What You’ll Bring
10+ years of experience with large‑scale biological imaging data.
Demonstrated delivery of multiple large biological data products.
Experience with big data: extraction, transport, loading, databases, standardization, validation, QC, and analysis.
Experience with processing and orchestration pipelines, such as Argo Workflows and Databricks.
Strong fundamentals in statistical reasoning and machine learning.
Experience with biological data analysis and QC best practices.
Excellent written and verbal communication skills.
Enthusiasm to ramp up on technologies and learn new domains.
Experience working in a multidisciplinary environment (engineering, product, AI research).
Compensation The Redwood City, CA base pay range for a new hire in this role is $241,000 – $331,100. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job‑related skills and experience, as evaluated throughout the interview process.
Better Together This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately three days a week, with specific in‑office days determined by the team’s manager. The exact schedule will be at the hiring manager’s discretion and communicated during the interview process.
Benefits for the Whole You
Generous employer match on employee 401(k) contributions to support planning for the future.
Paid time off to volunteer at an organization of your choice.
Funding for select family‑forming benefits.
Relocation support for employees who need assistance moving.
If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
Seniority level Mid‑Senior level
Employment type Full‑time
Job function Engineering and Information Technology
#J-18808-Ljbffr
Senior Staff Data Scientist, Imaging, Data
role at
Biohub .
Biohub is leading the new era of AI‑powered biology to cure or prevent disease through its 501(c)(3) medical research organization, supported by the Chan Zuckerberg Initiative.
The Team Biohub supports the science and technology that will help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem ambitious, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.
Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems—paving the way for new discoveries that will change medicine in the decades that follow.
Building an AI‑based virtual cell model to predict and understand cellular behavior
Developing novel imaging technologies to map, measure and model complex biological systems
Creating new tools for sensing and directly measuring inflammation within tissues in real time, tissues to better understand inflammation, a key driver of many diseases
Harnessing the immune system for early detection, prevention, and treatment of disease
The Opportunity
Billions of standardized cells of single‑cell transcriptomic data, with a focus on measuring genetic and environmental perturbations
10s of thousands of donor‑matched DNA & RNA samples
PB‑scale static and dynamic imaging datasets
TB‑scale mass spectrometry datasets
Diverse, large multi‑modal biological datasets that enable biological bridges across measurement types and facilitate multi‑modal model training to define how cells act.
After model training, we make all data products available through public resources like CELLxGENE Discover and the Cryo‑ET Portal, used by tens of thousands of scientists monthly to advance understanding of genetic variants, disease risk, drug toxicities, and therapeutic discovery.
As a Senior Staff Data Scientist, you’ll lead the creation of groundbreaking imaging datasets that decode cellular function at the molecular level, describe development, and predict responses to genetic or environmental changes. Working at the intersection of data science, biology, and AI, you’ll define data needs, format standards, analysis approaches, quality metrics, and our technical strategy—creating systems to ingest, transform, validate, and deploy data products.
Success in this role Success for this role means delivering high‑quality, usable datasets that directly address modeling challenges and accelerate scientific progress.
What You’ll Do
Define the technical strategy for a robust imaging data ecosystem and build data ingestion pipelines, define data formats, write validation tools, QC metrics, and analysis pipelines.
Collaborate with ML engineers, AI researchers, and data engineers to iteratively evaluate, refine and grow datasets to maximize model performance.
Discover and define new data generation opportunities, and manage the delivery of those data products to our AI team.
Collaborate with engineers, product managers, UX designers, and other data scientists to publish valuable datasets as part of CZI’s open data ecosystem.
What You’ll Bring
10+ years of experience with large‑scale biological imaging data.
Demonstrated delivery of multiple large biological data products.
Experience with big data: extraction, transport, loading, databases, standardization, validation, QC, and analysis.
Experience with processing and orchestration pipelines, such as Argo Workflows and Databricks.
Strong fundamentals in statistical reasoning and machine learning.
Experience with biological data analysis and QC best practices.
Excellent written and verbal communication skills.
Enthusiasm to ramp up on technologies and learn new domains.
Experience working in a multidisciplinary environment (engineering, product, AI research).
Compensation The Redwood City, CA base pay range for a new hire in this role is $241,000 – $331,100. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job‑related skills and experience, as evaluated throughout the interview process.
Better Together This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately three days a week, with specific in‑office days determined by the team’s manager. The exact schedule will be at the hiring manager’s discretion and communicated during the interview process.
Benefits for the Whole You
Generous employer match on employee 401(k) contributions to support planning for the future.
Paid time off to volunteer at an organization of your choice.
Funding for select family‑forming benefits.
Relocation support for employees who need assistance moving.
If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
Seniority level Mid‑Senior level
Employment type Full‑time
Job function Engineering and Information Technology
#J-18808-Ljbffr