Alignment and integration of spatial transcriptomics data.


Journal

Nature methods
ISSN: 1548-7105
Titre abrégé: Nat Methods
Pays: United States
ID NLM: 101215604

Informations de publication

Date de publication:
05 2022
Historique:
received: 16 07 2021
accepted: 17 03 2022
pubmed: 17 5 2022
medline: 24 5 2022
entrez: 16 5 2022
Statut: ppublish

Résumé

Spatial transcriptomics (ST) measures mRNA expression across thousands of spots from a tissue slice while recording the two-dimensional (2D) coordinates of each spot. We introduce probabilistic alignment of ST experiments (PASTE), a method to align and integrate ST data from multiple adjacent tissue slices. PASTE computes pairwise alignments of slices using an optimal transport formulation that models both transcriptional similarity and physical distances between spots. PASTE further combines pairwise alignments to construct a stacked 3D alignment of a tissue. Alternatively, PASTE can integrate multiple ST slices into a single consensus slice. We show that PASTE accurately aligns spots across adjacent slices in both simulated and real ST data, demonstrating the advantages of using both transcriptional similarity and spatial information. We further show that the PASTE integrated slice improves the identification of cell types and differentially expressed genes compared with existing approaches that either analyze single ST slices or ignore spatial information.

Identifiants

pubmed: 35577957
doi: 10.1038/s41592-022-01459-6
pii: 10.1038/s41592-022-01459-6
pmc: PMC9334025
mid: NIHMS1819659
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

567-575

Subventions

Organisme : NHGRI NIH HHS
ID : T32 HG003284
Pays : United States
Organisme : NCI NIH HHS
ID : U24 CA211000
Pays : United States
Organisme : NCI NIH HHS
ID : U24 CA248453
Pays : United States

Commentaires et corrections

Type : CommentIn

Informations de copyright

© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.

Références

Ståhl, P. L. et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353, 78–82 (2016).
pubmed: 27365449 doi: 10.1126/science.aaf2403
10x Genomics. Visium spatial gene expression: map the whole transcriptome within the tissue context. https://www.10xgenomics.com/products/spatial-gene-expression/ (accessed October 2020) (2019).
Zhao, E. et al. Spatial transcriptomics at subspot resolution with bayesspace.Nat. Biotechnol. 39, 1375–1384 (2021).
pubmed: 34083791 pmcid: 8763026 doi: 10.1038/s41587-021-00935-2
Berglund, E. et al. Spatial maps of prostate cancer transcriptomes reveal an unexplored landscape of heterogeneity. Nat. Commun. 9, 2419 (2018).
pubmed: 29925878 pmcid: 6010471 doi: 10.1038/s41467-018-04724-5
Thrane, K., Eriksson, H., Maaskola, J., Hansson, J. & Lundeberg, J. Spatially resolved transcriptomics enables dissection of genetic heterogeneity in stage iii cutaneous malignant melanoma. Cancer Res. 78, 5970–5979 (2018).
pubmed: 30154148
Moncada, R. et al. Integrating microarray-based spatial transcriptomics and single-cell RNA-seq reveals tissue architecture in pancreatic ductal adenocarcinomas. Nat. Biotechnol. 38, 333–342 (2020).
pubmed: 31932730 doi: 10.1038/s41587-019-0392-8
Ji, A. et al. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell 182, 1661–1662 (2020).
pubmed: 32946785 pmcid: 7505493 doi: 10.1016/j.cell.2020.08.043
Chen, W.-T. et al. Spatial transcriptomics and in situ sequencing to study Alzheimer’s disease. Cell 182, 976–991.e19 (2020).
pubmed: 32702314
Lundmark, A. et al. Gene expression profiling of periodontitis-affected gingival tissue by spatial transcriptomics. Sci. Rep. 8, 9370 (2018).
Asp, M. et al. Spatial detection of fetal marker genes expressed at low level in adult human heart tissue. Sci. Rep. 7, 12941 (2017).
pubmed: 29021611 pmcid: 5636908 doi: 10.1038/s41598-017-13462-5
Maniatis, S. et al. Spatiotemporal dynamics of molecular pathology in amyotrophic lateral sclerosis. Science 364, 89–93 (2019).
pubmed: 30948552 doi: 10.1126/science.aav9776
Maynard, K. R. et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat. Neurosci. 24, 425–436 (2021).
pubmed: 33558695 pmcid: 8095368 doi: 10.1038/s41593-020-00787-0
Liu, R. et al. Modeling spatial correlation of transcripts with application to developing pancreas. Sci. Rep. 9, 5592 (2019).
Svensson, V., Teichmann, S. A. & Stegle, O. SpatialDE: identification of spatially variable genes. Nat. Meth. 15, 343 (2018).
doi: 10.1038/nmeth.4636
Arnol, D., Schapiro, D., Bodenmiller, B., Saez-Rodriguez, J. & Stegle, O. Modeling cell-cell interactions from spatial molecular data with spatial variance component analysis. Cell Rep. 29, 202–211 (2019).
pubmed: 31577949 pmcid: 6899515 doi: 10.1016/j.celrep.2019.08.077
Cang, Z. & Nie, Q. Inferring spatial and signaling relationships between cells from single cell transcriptomic data. Nat. Commun. 11, 2084 (2020).
pubmed: 32350282 pmcid: 7190659 doi: 10.1038/s41467-020-15968-5
Ji, N. & Oudenaarden, A. Single-molecule fluorescent in situ hybridization (smFISH) of C. elegans worms and embryos. In WormBook: The Online Review of C. elegans Biology (ed. WormBook) 1–16 (The C. elegans Research Community, 2012).
Eng, C.-H. L. et al. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH+. Nature 568, 235–239 (2019).
pubmed: 30911168 pmcid: 6544023 doi: 10.1038/s41586-019-1049-y
Wang, X. et al. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science 361, eaat5691 (2018).
pubmed: 29930089 pmcid: 6339868 doi: 10.1126/science.aat5691
Stickels, R. R. et al. Highly sensitive spatial transcriptomics at near-cellular resolution with slide-seqv2. Nat. Biotechnol. 39, 313–319 (2021).
pubmed: 33288904 doi: 10.1038/s41587-020-0739-1
Elosua-Bayes, M., Nieto, P., Mereu, E., Gut, I. & Heyn, H. SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes. Nucleic Acids Res. 49, e50–e50 (2021).
pubmed: 33544846 pmcid: 8136778 doi: 10.1093/nar/gkab043
Bergenstråhle, J., Larsson, L. & Lundeberg, J. Seamless integration of image and molecular analysis for spatial transcriptomics workflows. BMC Genomics 21, 482 (2020).
pubmed: 32664861 pmcid: 7386244 doi: 10.1186/s12864-020-06832-3
Äijö, T. et al. Splotch: robust estimation of aligned spatial temporal gene expression data. Preprint at bioRxiv https://doi.org/10.1101/757096 (2019).
Haghverdi, L., Lun, A. T., Morgan, M. D. & Marioni, J. C. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. Biotechnol. 36, 421–427 (2018).
pubmed: 29608177 pmcid: 6152897 doi: 10.1038/nbt.4091
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902.e21 (2019).
pubmed: 31178118 pmcid: 6687398 doi: 10.1016/j.cell.2019.05.031
Hie, B., Bryson, B. & Berger, B. Efficient integration of heterogeneous single-cell transcriptomes using scanorama. Nat. Biotechnol. 37, 685–691 (2019).
pubmed: 31061482 pmcid: 6551256 doi: 10.1038/s41587-019-0113-3
Mandric, I., Hill, B. L., Freund, M. K., Thompson, M. & Halperin, E. Batman: fast and accurate integration of single-cell RNA-seq datasets via minimum-weight matching. iScience 23, 101185 (2020).
pubmed: 32504875 pmcid: 7276436 doi: 10.1016/j.isci.2020.101185
Demetci, P., Santorella, R., Sandstede, B., Noble, W. S. & Singh, R. Gromov-Wasserstein optimal transport to align single-cell multi-omics data. Preprint at bioRxiv https://doi.org/10.1101/2020.04.28.066787 (2020).
Tran, H. T. N. et al. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 21, 12 (2020).
pubmed: 31948481 pmcid: 6964114 doi: 10.1186/s13059-019-1850-9
Titouan, V., Courty, N., Tavenard, R. & Flamary, R. Optimal transport for structured data with application on graphs. In International Conference on Machine Learning (eds Chaudhuri, K. & Salakhutdinov, R.) 6275–6284 (PMLR, 2019).
Lee, D. & Seung, H. Algorithms for non-negative matrix factorization. In Advances in Neural Information Processing Systems 13: Proceedings of the 2000 Conference, NIPS 2000 (Neural Information Processing Systems Foundation, 2001).
Shao, C. & Höfer, T. Robust classification of single-cell transcriptome data by nonnegative matrix factorization. Bioinformatics 33, 235–242 (2016).
pubmed: 27663498 doi: 10.1093/bioinformatics/btw607
Zhu, X., Ching, T., Pan, X., Weissman, S. M. & Garmire, L. Detecting heterogeneity in single-cell RNA-seq data by non-negative matrix factorization. PeerJ 5, e2888 (2017).
pubmed: 28133571 pmcid: 5251935 doi: 10.7717/peerj.2888
Elyanow, R. et al. STARCH: copy number and clone inference from spatial transcriptomics data.Phys. Biol. 18, 035001 (2021).
pubmed: 33022659 doi: 10.1088/1478-3975/abbe99
O’Neill, R. et al. Indices of landscape pattern. Landsc. Ecol. 1, 153–162 (1988).
doi: 10.1007/BF00162741
Maniatis, S. et al. Spatiotemporal dynamics of molecular pathology in amyotrophic lateral sclerosis. Science 364, 89–93 (2019).
pubmed: 30948552 doi: 10.1126/science.aav9776
Andersson, A. et al. Spatial deconvolution of her2-positive breast cancer delineates tumor-associated cell type interactions. Nat. Commun. 12, 6012 (2021).
pubmed: 34650042 pmcid: 8516894 doi: 10.1038/s41467-021-26271-2
Biancalani, T. et al. Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram.Nat. Methods 18, 1352–1362 (2021).
pubmed: 34711971 pmcid: 8566243 doi: 10.1038/s41592-021-01264-7
Yoosuf, N., Navarro, J., Salmén, F., Ståhl, P. L. & Daub, C. O. Identification and transfer of spatial transcriptomics signatures for cancer diagnosis. Breast Cancer Res. 22, 6 (2020).
pubmed: 31931856 pmcid: 6958738 doi: 10.1186/s13058-019-1242-9
Brown, L. G. A survey of image registration techniques. ACM Comput. Surv. 24, 325–376 (1992).
doi: 10.1145/146370.146374
Fatras, K., Zine, Y., Flamary, R., Gribonval, R. & Courty, N. Learning with minibatch Wasserstein: asymptotic and gradient properties. In AISTATS, 2131–2141 http://proceedings.mlr.press/v108/fatras20a.html (2020).
Feydy, J. et al. Interpolating between optimal transport and mmd using sinkhorn divergences. In The 22nd International Conference on Artificial Intelligence and Statistics, 2681–2690 (2019).
Marx, V. Method of the year: spatially resolved transcriptomics. Nat. Methods 18, 9–14 (2021).
pubmed: 33408395 doi: 10.1038/s41592-020-01033-y
Larsson, L., Frisén, J. & Lundeberg, J. Spatially resolved transcriptomics adds a new dimension to genomics. Nat. Methods 18, 15–18 (2021).
pubmed: 33408402 doi: 10.1038/s41592-020-01038-7
Wahba, G. A least squares estimate of satellite attitude. SIAM Rev. 7, 409–409 (1965).
doi: 10.1137/1007077
Kabsch, W. A solution for the best rotation to relate two sets of vectors. Acta Crystallogr. A 32, 922–923 (1976).
doi: 10.1107/S0567739476001873
Lin, P., Troup, M. & Ho, J. W. K. Cidr: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data. Genome Biology 18, 59 (2017).
pubmed: 28351406 pmcid: 5371246 doi: 10.1186/s13059-017-1188-0
Mongia, A., Sengupta, D. & Majumdar, A. Mcimpute: matrix completion based imputation for single cell RNA-seq data. Frontiers in Genetics 10, 9 (2019).
pubmed: 30761179 pmcid: 6361810 doi: 10.3389/fgene.2019.00009
Hou, W., Ji, Z., Ji, H. & Hicks, S. C. A systematic evaluation of single-cell RNA-sequencing imputation methods. Genome Biology 21, 218 (2020).
pubmed: 32854757 pmcid: 7450705 doi: 10.1186/s13059-020-02132-x
Févotte, C. & Cemgil, A. T. Nonnegative matrix factorizations as probabilistic inference in composite models. In 2009 17th European Signal Processing Conference, 1913–1917 (IEEE, 2009).
Durif, G., Modolo, L., Mold, J. E., Lambert-Lacroix, S. & Picard, F. Probabilistic count matrix factorization for single cell expression data analysis. Bioinformatics 35, 4011–4019 (2019).
pubmed: 30865271 doi: 10.1093/bioinformatics/btz177
Townes, F. W., Hicks, S. C., Aryee, M. J. & Irizarry, R. A. Feature selection and dimension reduction for single-cell RNA-seq based on a multinomial model. Genome Biol. 20, 295 (2019).
pubmed: 31870412 pmcid: 6927135 doi: 10.1186/s13059-019-1861-6
Elyanow, R., Dumitrascu, B., Engelhardt, B. E. & Raphael, B. J. netnmf-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis. Genome Res.30, 195–204 (2020).
pubmed: 31992614 pmcid: 7050525 doi: 10.1101/gr.251603.119
Wolf, F. A., Angerer, P. & Theis, F. J. Scanpy: large-scale single-cell gene expression data analysis. Genome Biology 19, 15 (2018).
pubmed: 29409532 pmcid: 5802054 doi: 10.1186/s13059-017-1382-0
Flamary, R. & Courty, N. Pot Python Optimal Transport Library https://pythonot.github.io/ (2017).
Sun, S., Zhu, J., Ma, Y. & Zhou, X. Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis. Genome Biol. 20, 269 (2019).
pubmed: 31823809 pmcid: 6902413 doi: 10.1186/s13059-019-1898-6
Chen, M. & Zhou, X. Viper: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies. Genome Biol. 19, 196 (2018).
pubmed: 30419955 pmcid: 6233584 doi: 10.1186/s13059-018-1575-1

Auteurs

Ron Zeira (R)

Department of Computer Science, Princeton University, Princeton, NJ, USA.

Max Land (M)

Department of Computer Science, Princeton University, Princeton, NJ, USA.

Alexander Strzalkowski (A)

Department of Computer Science, Princeton University, Princeton, NJ, USA.

Benjamin J Raphael (BJ)

Department of Computer Science, Princeton University, Princeton, NJ, USA. braphael@princeton.edu.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Arabidopsis Arabidopsis Proteins Osmotic Pressure Cytoplasm RNA, Messenger
Animals Lung India Sheep Transcriptome

Classifications MeSH