Somite AI
Senior Bioinformatics Scientist – Transcriptomics
Somite AI, Boston, Massachusetts, us, 02298
Senior Bioinformatics Scientist – Transcriptomics
About Somite
Somite.ai is a venture‑backed company aiming to become the
OpenAI of stem cell biology , developing
AI foundation models
for human stem cells to drive novel cell therapies at scale.
We take a full‑stack approach – generating data at 1,000× lower cost, training large‑scale models with unmatched predictive power, and using these models to discover, refine, and optimize high‑impact applications in cell therapy.
Our founding team includes repeat AI entrepreneur Dr. Micha Breakstone (Chorus.ai, acquired for $575M), the Head of the Fundamental AI Group at MIT, and three Harvard Medical School professors and National Academy of Sciences/Medicine members, including the Chair of the Genetics Department at HMS. Based in Boston and launching operations in January 2024, Somite has raised close to $60 M to date.
Core Values
We show up – fully accountable, all‑in, doing whatever it takes
We act with urgency – swift, decisive, proactive
We support one another – collaborative, helpful, empathetic
Location Boston, MA (onsite, full‑time)
About the Role We seek a skilled
bioinformatician
to lead the design and execution of analysis pipelines for large‑scale transcriptomic datasets. You’ll be responsible for preprocessing, quality control, and biological interpretation of sequencing data, and for contributing to the development of novel sequencing protocols and technologies in close collaboration with wet‑lab scientists, computational biologists and machine‑learning researchers to support Somite’s therapeutic programs.
This is a hands‑on, data‑centric role ideal for someone who thrives in a fast‑paced, interdisciplinary environment and wants to help build transformative cell‑therapy technologies from the ground up.
Responsibilities
Build, maintain, and run robust, reproducible pipelines for single‑cell and bulk RNA‑seq data (QC, alignment, quantification, normalization, clustering, annotation, etc.).
Apply standard and advanced analyses using tools like Scanpy, Seurat, and R/Bioconductor packages.
Process and interpret data from state‑of‑the‑art single‑cell transcriptomic and multiomic platforms, alongside new in‑house protocols being developed to enable innovative microbiological sequencing workflows.
Collaborate with wet‑lab scientists to support experimental design and data interpretation.
Mentor and foster collaboration and knowledge sharing across disciplines.
Help design and validate cutting‑edge protocols for novel transcriptomic technologies, integrating new experimental modalities with bioinformatics pipeline.
Qualifications
PhD in Bioinformatics, Genomics, Computational Biology, or a related field (or equivalent industry experience).
3+ years of hands‑on experience analyzing large‑scale single‑cell or bulk transcriptomic datasets.
Proficiency with common genomics tools: SAMtools, STAR/STARsolo, Bowtie, Tophat, HTSeq, Clustal, bash scripting, etc.
Strong command of Scanpy or Seurat.
Demonstrated fluency in Nextflow/Snakemake and workflow reproducibility principles.
Strong R and/or Python programming skills.
Excellent organizational and documentation habits.
Preferred Qualifications
Experience working in a quantitative lab or with biotech/pharma pipelines.
Experience working in a fast‑paced startup or interdisciplinary team.
The opportunity to contribute to platform technologies with real‑world therapeutic impact.
Experience at the interface of technology development and data analysis, particularly involving novel molecular biology or sequencing techniques.
What We Offer
A mission‑driven team reshaping healthcare through innovation in AI and cellular engineering.
The opportunity to contribute to platform technologies with real‑world therapeutic impact.
Competitive compensation and equity in a high‑growth company.
Flexible work arrangements, generous benefits, and a collaborative environment.
Exceptional candidates who demonstrate outstanding capabilities and potential will be considered, even if they do not meet every qualification listed.
Join us and help unlock the full potential of AI for the benefit of human health!
#J-18808-Ljbffr
OpenAI of stem cell biology , developing
AI foundation models
for human stem cells to drive novel cell therapies at scale.
We take a full‑stack approach – generating data at 1,000× lower cost, training large‑scale models with unmatched predictive power, and using these models to discover, refine, and optimize high‑impact applications in cell therapy.
Our founding team includes repeat AI entrepreneur Dr. Micha Breakstone (Chorus.ai, acquired for $575M), the Head of the Fundamental AI Group at MIT, and three Harvard Medical School professors and National Academy of Sciences/Medicine members, including the Chair of the Genetics Department at HMS. Based in Boston and launching operations in January 2024, Somite has raised close to $60 M to date.
Core Values
We show up – fully accountable, all‑in, doing whatever it takes
We act with urgency – swift, decisive, proactive
We support one another – collaborative, helpful, empathetic
Location Boston, MA (onsite, full‑time)
About the Role We seek a skilled
bioinformatician
to lead the design and execution of analysis pipelines for large‑scale transcriptomic datasets. You’ll be responsible for preprocessing, quality control, and biological interpretation of sequencing data, and for contributing to the development of novel sequencing protocols and technologies in close collaboration with wet‑lab scientists, computational biologists and machine‑learning researchers to support Somite’s therapeutic programs.
This is a hands‑on, data‑centric role ideal for someone who thrives in a fast‑paced, interdisciplinary environment and wants to help build transformative cell‑therapy technologies from the ground up.
Responsibilities
Build, maintain, and run robust, reproducible pipelines for single‑cell and bulk RNA‑seq data (QC, alignment, quantification, normalization, clustering, annotation, etc.).
Apply standard and advanced analyses using tools like Scanpy, Seurat, and R/Bioconductor packages.
Process and interpret data from state‑of‑the‑art single‑cell transcriptomic and multiomic platforms, alongside new in‑house protocols being developed to enable innovative microbiological sequencing workflows.
Collaborate with wet‑lab scientists to support experimental design and data interpretation.
Mentor and foster collaboration and knowledge sharing across disciplines.
Help design and validate cutting‑edge protocols for novel transcriptomic technologies, integrating new experimental modalities with bioinformatics pipeline.
Qualifications
PhD in Bioinformatics, Genomics, Computational Biology, or a related field (or equivalent industry experience).
3+ years of hands‑on experience analyzing large‑scale single‑cell or bulk transcriptomic datasets.
Proficiency with common genomics tools: SAMtools, STAR/STARsolo, Bowtie, Tophat, HTSeq, Clustal, bash scripting, etc.
Strong command of Scanpy or Seurat.
Demonstrated fluency in Nextflow/Snakemake and workflow reproducibility principles.
Strong R and/or Python programming skills.
Excellent organizational and documentation habits.
Preferred Qualifications
Experience working in a quantitative lab or with biotech/pharma pipelines.
Experience working in a fast‑paced startup or interdisciplinary team.
The opportunity to contribute to platform technologies with real‑world therapeutic impact.
Experience at the interface of technology development and data analysis, particularly involving novel molecular biology or sequencing techniques.
What We Offer
A mission‑driven team reshaping healthcare through innovation in AI and cellular engineering.
The opportunity to contribute to platform technologies with real‑world therapeutic impact.
Competitive compensation and equity in a high‑growth company.
Flexible work arrangements, generous benefits, and a collaborative environment.
Exceptional candidates who demonstrate outstanding capabilities and potential will be considered, even if they do not meet every qualification listed.
Join us and help unlock the full potential of AI for the benefit of human health!
#J-18808-Ljbffr