SpatialData: an open and universal data framework for spatial omics.


Journal

Nature methods
ISSN: 1548-7105
Titre abrégé: Nat Methods
Pays: United States
ID NLM: 101215604

Informations de publication

Date de publication:
20 Mar 2024
Historique:
received: 15 05 2023
accepted: 14 02 2024
medline: 21 3 2024
pubmed: 21 3 2024
entrez: 21 3 2024
Statut: aheadofprint

Résumé

Spatially resolved omics technologies are transforming our understanding of biological tissues. However, the handling of uni- and multimodal spatial omics datasets remains a challenge owing to large data volumes, heterogeneity of data types and the lack of flexible, spatially aware data structures. Here we introduce SpatialData, a framework that establishes a unified and extensible multiplatform file-format, lazy representation of larger-than-memory data, transformations and alignment to common coordinate systems. SpatialData facilitates spatial annotations and cross-modal aggregation and analysis, the utility of which is illustrated in the context of multiple vignettes, including integrative analysis on a multimodal Xenium and Visium breast cancer study.

Identifiants

pubmed: 38509327
doi: 10.1038/s41592-024-02212-x
pii: 10.1038/s41592-024-02212-x
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2024. The Author(s).

Références

Asp, M., Bergenstråhle, J. & Lundeberg, J. Spatially resolved transcriptomes-next generation tools for tissue exploration. Bioessays 42, e1900221 (2020).
doi: 10.1002/bies.201900221 pubmed: 32363691
Rao, A., Barkley, D., França, G. S. & Yanai, I. Exploring tissue architecture using spatial transcriptomics. Nature 596, 211–220 (2021).
doi: 10.1038/s41586-021-03634-9 pubmed: 34381231 pmcid: 8475179
Vandereyken, K., Sifrim, A., Thienpont, B. & Voet, T. Methods and applications for single-cell and spatial multi-omics. Nat. Rev. Genet. 24, 494–515 (2023).
doi: 10.1038/s41576-023-00580-2 pubmed: 36864178
Seferbekova, Z., Lomakin, A., Yates, L. R. & Gerstung, M. Spatial biology of cancer evolution. Nat. Rev. Genet. 24, 295–313 (2023).
doi: 10.1038/s41576-022-00553-x pubmed: 36494509
Moses, L. & Pachter, L. Museum of spatial transcriptomics. Nat. Methods 19, 534–546 (2022).
doi: 10.1038/s41592-022-01409-2 pubmed: 35273392
Park, J. et al. Spatial omics technologies at multimodal and single cell/subcellular level. Genome Biol. 23, 256 (2022).
doi: 10.1186/s13059-022-02824-6 pubmed: 36514162 pmcid: 9746133
Moffitt, J. R., Lundberg, E. & Heyn, H. The emerging landscape of spatial profiling technologies. Nat. Rev. Genet. 23, 741–759 (2022).
doi: 10.1038/s41576-022-00515-3 pubmed: 35859028
Rood, J. E. et al. Toward a common coordinate framework for the human body. Cell 179, 1455–1467 (2019).
doi: 10.1016/j.cell.2019.11.019 pubmed: 31835027 pmcid: 6934046
Palla, G., Fischer, D. S., Regev, A. & Theis, F. J. Spatial components of molecular tissue biology. Nat. Biotechnol. 40, 308–318 (2022).
doi: 10.1038/s41587-021-01182-1 pubmed: 35132261
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).
doi: 10.1038/sdata.2016.18 pubmed: 26978244 pmcid: 4792175
Moore, J. et al. OME-Zarr: a cloud-optimized bioimaging file format with international community support. Histochem. Cell Biol. 160, 223–251 (2023).
Moore, J. et al. OME-NGFF: a next-generation file format for expanding bioimaging data-access strategies. Nat. Methods 18, 1496–1498 (2021).
doi: 10.1038/s41592-021-01326-w pubmed: 34845388 pmcid: 8648559
Virshup, I. et al. The scverse project provides a computational ecosystem for single-cell omics data analysis. Nat. Biotechnol. 41, 604–606 (2023).
doi: 10.1038/s41587-023-01733-8 pubmed: 37037904
Wolf, F. A., Angerer, P. & Theis, F. J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 19, 15 (2018).
doi: 10.1186/s13059-017-1382-0 pubmed: 29409532 pmcid: 5802054
Palla, G. et al. Squidpy: a scalable framework for spatial omics analysis. Nat. Methods 19, 171–178 (2022).
doi: 10.1038/s41592-021-01358-2 pubmed: 35102346 pmcid: 8828470
Lopez, R., Regier, J., Cole, M. B., Jordan, M. I. & Yosef, N. Deep generative modeling for single-cell transcriptomics. Nat. Methods 15, 1053–1058 (2018).
doi: 10.1038/s41592-018-0229-2 pubmed: 30504886 pmcid: 6289068
Janesick, A. et al. High resolution mapping of the breast cancer tumor microenvironment using integrated single cell, spatial and in situ analysis of FFPE tissue. Nat. Commun. 14, 8353 (2023).
Gao, R. et al. Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes. Nat. Biotechnol. 39, 599–608 (2021).
doi: 10.1038/s41587-020-00795-2 pubmed: 33462507 pmcid: 8122019
Wu, S. Z. et al. A single-cell and spatially resolved atlas of human breast cancers. Nat. Genet. 53, 1334–1347 (2021).
doi: 10.1038/s41588-021-00911-1 pubmed: 34493872 pmcid: 9044823
Kleshchevnikov, V. et al. Cell2location maps fine-grained cell types in spatial transcriptomics. Nat. Biotechnol. 40, 661–671 (2022).
doi: 10.1038/s41587-021-01139-4 pubmed: 35027729
Gentleman, R. C. et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 5, R80 (2004).
doi: 10.1186/gb-2004-5-10-r80 pubmed: 15461798 pmcid: 545600
Keller, M. S. et al. Vitessce: a framework for integrative visualization of multi-modal and spatially-resolved single-cell data. Preprint at https://doi.org/10.31219/osf.io/y8thv (2023).
The MONAI Consortium. Project MONAI. Zenodo https://doi.org/10.5281/zenodo.4323059 (2020).
Gayoso, A. et al. A Python library for probabilistic analysis of single-cell omics data. Nat. Biotechnol. 40, 163–166 (2022).
doi: 10.1038/s41587-021-01206-w pubmed: 35132262
Virshup, I., Rybakov, S., Theis, F. J., Angerer, P. & Alexander Wolf, F. anndata: Annotated data. Preprint at bioRxiv https://doi.org/10.1101/2021.12.16.473007 (2021).
anndataR: AnnData interoperability in R. GitHub https://github.com/scverse/anndataR (2023).
Li, T. et al. WebAtlas pipeline for integrated single cell and spatial transcriptomic data. Preprint at bioRxiv https://doi.org/10.1101/2023.05.19.541329 (2023).
Moses, L., Jackson, K., Luebbert, L. & Pachter, L. Voyager: from geospatial to spatial omics. GitHub https://github.com/pachterlab/voyager (2023).
Righelli, D. et al. SpatialExperiment: infrastructure for spatially-resolved transcriptomics data in R using Bioconductor. Bioinformatics 38, 3128–3131 (2022).
Dries, R. et al. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. Genome Biol. 22, 78 (2021).
doi: 10.1186/s13059-021-02286-2 pubmed: 33685491 pmcid: 7938609
Peters Couto, B. Z., Robertson, N., Patrick, E. & Ghazanfar, S. MoleculeExperiment enables consistent infrastructure for molecule-resolved spatial omics data in bioconductor. Bioinformatics 39, btad550 (2023).
doi: 10.1093/bioinformatics/btad550 pubmed: 37698995 pmcid: 10504467
Yuan, Z. et al. SODB facilitates comprehensive exploration of spatial omics data. Nat. Methods 20, 387–399 (2023).
doi: 10.1038/s41592-023-01773-7 pubmed: 36797409
Xu, Z. et al. STOmicsDB: a database of Spatial Transcriptomic data. Nucleic Acids Res. 52, D1053–D1061 (2024).
Baker, E. A. G. et al. emObject: domain specific data abstraction for spatial omics. Preprint at bioRxiv https://doi.org/10.1101/2023.06.07.543950 (2023).
Long, B., Miller, J. & The SpaceTx Consortium. SpaceTx: a roadmap for benchmarking spatial transcriptomics exploration of the brain. Preprint at https://doi.org/10.48550/arXiv.2301.08436 (2023).
Erickson, A. et al. Spatially resolved clonal copy number alterations in benign and malignant tissue. Nature 608, 360–367 (2022).
doi: 10.1038/s41586-022-05023-2 pubmed: 35948708 pmcid: 9365699

Auteurs

Luca Marconato (L)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.
Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany.
Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany.

Giovanni Palla (G)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.
TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany.

Kevin A Yamauchi (KA)

Department of Biosystems, Science and Engineering, ETH Zürich, Basel, Switzerland.
Swiss Institute of Bioinformatics, Basel, Switzerland.

Isaac Virshup (I)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.

Elyas Heidari (E)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.
Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany.
Division of Artificial Intelligence in Oncology, German Cancer Research Center, Heidelberg, Germany.

Tim Treis (T)

Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany.
Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.

Wouter-Michiel Vierdag (WM)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.

Marcella Toth (M)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.

Sonja Stockhaus (S)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.
TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.

Rahul B Shrestha (RB)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.

Benjamin Rombaut (B)

Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.
Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium.
VIB Center for AI and Computational Biology, Ghent, Belgium.

Lotte Pollaris (L)

Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.
Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium.
VIB Center for AI and Computational Biology, Ghent, Belgium.

Laurens Lehner (L)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany.
TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.

Harald Vöhringer (H)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.
Molecular Medicine Partnership Unit, Heidelberg, Germany.
Department of Medicine V, Hematology, Oncology, and Rheumatology, University of Heidelberg, Heidelberg, Germany.

Ilia Kats (I)

Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany.

Yvan Saeys (Y)

Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium.
Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium.
VIB Center for AI and Computational Biology, Ghent, Belgium.

Sinem K Saka (SK)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.

Wolfgang Huber (W)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.

Moritz Gerstung (M)

Division of Artificial Intelligence in Oncology, German Cancer Research Center, Heidelberg, Germany.

Josh Moore (J)

German BioImaging - Gesellschaft für Mikroskopie und Bildanalyse e.V, Konstanz, Germany. josh@openmicroscopy.org.
Open Microscopy Environment Consortium, Munich, Germany. josh@openmicroscopy.org.

Fabian J Theis (FJ)

Institute of Computational Biology, Helmholtz, Center Munich, Munich, Germany. fabian.theis@helmholtz-munich.de.
TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany. fabian.theis@helmholtz-munich.de.
Department of Mathematics, Technical University of Munich, Munich, Germany. fabian.theis@helmholtz-munich.de.
Cellular Genetics Programme, Wellcome Sanger Institute, Cambridge, UK. fabian.theis@helmholtz-munich.de.

Oliver Stegle (O)

European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany. oliver.stegle@embl.de.
Division of Computational Genomics and System Genetics, German Cancer Research Center, Heidelberg, Germany. oliver.stegle@embl.de.
Cellular Genetics Programme, Wellcome Sanger Institute, Cambridge, UK. oliver.stegle@embl.de.

Classifications MeSH