Accurate estimation of cell composition in bulk expression through robust integration of single-cell information.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
24 04 2020
Historique:
received: 03 06 2019
accepted: 25 03 2020
entrez: 26 4 2020
pubmed: 26 4 2020
medline: 4 8 2020
Statut: epublish

Résumé

We present Bisque, a tool for estimating cell type proportions in bulk expression. Bisque implements a regression-based approach that utilizes single-cell RNA-seq (scRNA-seq) or single-nucleus RNA-seq (snRNA-seq) data to generate a reference expression profile and learn gene-specific bulk expression transformations to robustly decompose RNA-seq data. These transformations significantly improve decomposition performance compared to existing methods when there is significant technical variation in the generation of the reference profile and observed bulk expression. Importantly, compared to existing methods, our approach is extremely efficient, making it suitable for the analysis of large genomic datasets that are becoming ubiquitous. When applied to subcutaneous adipose and dorsolateral prefrontal cortex expression datasets with both bulk RNA-seq and snRNA-seq data, Bisque replicates previously reported associations between cell type proportions and measured phenotypes across abundant and rare cell types. We further propose an additional mode of operation that merely requires a set of known marker genes.

Identifiants

pubmed: 32332754
doi: 10.1038/s41467-020-15816-6
pii: 10.1038/s41467-020-15816-6
pmc: PMC7181686
doi:

Substances chimiques

RNA, Small Cytoplasmic 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

1971

Subventions

Organisme : NHGRI NIH HHS
ID : R01 HG010505
Pays : United States
Organisme : NHLBI NIH HHS
ID : P01 HL028481
Pays : United States
Organisme : NIDDK NIH HHS
ID : U01 DK105561
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL095056
Pays : United States
Organisme : Howard Hughes Medical Institute
Pays : United States
Organisme : NHLBI NIH HHS
ID : F31 HL142180
Pays : United States

Commentaires et corrections

Type : ErratumIn

Références

Tomczak, K., Czerwińska, P. & Wiznerowicz, M. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp. Oncol. 19, A68–A77 (2015).
GTEx Consortium. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
doi: 10.1126/science.1262110
Bruning, O. et al. Confounding factors in the transcriptome analysis of an in-vivo exposure experiment. PLoS ONE 11, e0145252 (2016).
doi: 10.1371/journal.pone.0145252
Fridman, W. H., Pagès, F., Sautès-Fridman, C. & Galon, J. The immune contexture in human tumours: impact on clinical outcome. Nat. Rev. Cancer 12, 298–306 (2012).
doi: 10.1038/nrc3245
Rahier, J., Goebbels, R. M. & Henquin, J. C. Cellular composition of the human diabetic pancreas. Diabetologia 24, 366–371 (1983).
Shen-Orr, S. S. et al. Cell type–specific gene expression differences in complex tissues. Nat. Methods 7, 287–289 (2010).
doi: 10.1038/nmeth.1439
Westra, H.-J. et al. Cell specific eQTL analysis without sorting cells. PLoS Genet. 11, e1005223 (2015).
doi: 10.1371/journal.pgen.1005223
Hu, P., Zhang, W., Xin, H. & Deng, G. Single cell isolation and analysis. Front. Cell Dev. Biol. 4, 116 (2016).
Zheng, G. X. Y. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).
doi: 10.1038/ncomms14049
Tasic, B. et al. Shared and distinct transcriptomic cell types across neocortical areas. Nature 563, 72–78 (2018).
doi: 10.1038/s41586-018-0654-5
Macosko, E. Z. et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161, 1202–1214 (2015).
doi: 10.1016/j.cell.2015.05.002
Wang, J. et al. Gene expression distribution deconvolution in single-cell RNA sequencing. Proc. Natl Acad. Sci. USA 115, E6437–E6446 (2018).
doi: 10.1073/pnas.1721085115
Mohammadi, S., Zuckerman, N., Goldsmith, A. & Grama, A. A critical survey of deconvolution methods for separating cell types in complex tissues. Proc. IEEE 105, 340–366 (2017).
doi: 10.1109/JPROC.2016.2607121
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457 (2015).
doi: 10.1038/nmeth.3337
Baron, M. et al. A Single-cell transcriptomic map of the human and mouse pancreas reveals inter- and intra-cell population structure. Cell Syst. 3, 346–360.e4 (2016).
doi: 10.1016/j.cels.2016.08.011
Wang, X., Park, J., Susztak, K., Zhang, N. R. & Li, M. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nat. Commun. 10, 380 (2019).
doi: 10.1038/s41467-018-08023-x
Ziegenhain, C. et al. Comparative analysis of single-cell RNA sequencing methods. Mol. Cell 65, 631–643.e4 (2017).
doi: 10.1016/j.molcel.2017.01.023
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
doi: 10.1038/s41586-018-0414-6
Esteve Ràfols, M. Adipose tissue: cell heterogeneity and functional diversity. Endocrinol. Nutr. 61, 100–112 (2014).
doi: 10.1016/j.endonu.2013.03.011
Newman, A. M. et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. https://doi.org/10.1038/s41587-019-0114-2 (2019).
Rosen, E. D. & Spiegelman, B. M. What we talk about when we talk about fat. Cell 156, 20–44 (2014).
doi: 10.1016/j.cell.2013.12.012
Glastonbury, C. A., Alves, A. C., Moustafa, J. E.-S. & Small, K. S. Cell-type heterogeneity in adipose tissue is associated with complex traits and reveals disease-relevant cell-specific eQTLs. Am. J. Hum. Genet. 104, 1013–1024 (2019).
Spalding, K. L. et al. Dynamics of fat cell turnover in humans. Nature 453, 783–787 (2008).
doi: 10.1038/nature06902
Weisberg, S. P. et al. Obesity is associated with macrophage accumulation in adipose tissue. J. Clin. Investig. 112, 1796–1808 (2003).
doi: 10.1172/JCI200319246
McLaughlin, T. et al. T-cell profile in adipose tissue is associated with insulin resistance and systemic inflammation in humans. Arterioscler. Thromb. Vasc. Biol. 34, 2637–2643 (2014).
doi: 10.1161/ATVBAHA.114.304636
Gutch, M., Kumar, S., Razi, S., Gupta, K. & Gupta, A. Assessment of insulin sensitivity/resistance. Indian J. Endocrinol. Metab. 19, 160–164 (2015).
doi: 10.4103/2230-8210.146874
Mostafavi, S. et al. A molecular network of the aging human brain provides insights into the pathology and cognitive decline of Alzheimer’s disease. Nat. Neurosci. 21, 811–819 (2018).
doi: 10.1038/s41593-018-0154-9
Patrick, E. et al. Deconvolving the contributions of cell-type heterogeneity on cortical gene expression. Preprint at https://biorxiv.org/content/10.1101/566307v1 .
Yankner, B. A. Mechanisms of neuronal degeneration in Alzheimer’s disease. Neuron 16, 921–932 (1996).
doi: 10.1016/S0896-6273(00)80115-4
Hansen, D. V., Hanson, J. E. & Sheng, M. Microglia in Alzheimer’s disease. J. Cell Biol. 217, 459–472 (2018).
doi: 10.1083/jcb.201709069
Navarro, V. et al. Microglia in Alzheimer’s disease: activated, dysfunctional or degenerative. Front. Aging Neurosci. 10, 140 (2018).
Schelker, M. et al. Estimation of immune cell content in tumour tissue using single-cell RNA-seq data. Nat. Commun. 8, 2032 (2017).
doi: 10.1038/s41467-017-02289-3
Nguyen, Q. H., Pervolarakis, N., Nee, K. & Kessenbrock, K. Experimental considerations for single-cell RNA sequencing approaches. Front. Cell Dev. Biol. 6, 108 (2018).
doi: 10.3389/fcell.2018.00108
Wu, H., Kirita, Y., Donnelly, E. L. & Humphreys, B. D. Advantages of single-nucleus over single-cell RNA sequencing of adult kidney: rare cell types and novel cell states revealed in fibrosis. J. Am. Soc. Nephrol. 30, 23–32 (2019).
doi: 10.1681/ASN.2018090912
Bakken, T. E. et al. Single-nucleus and single-cell transcriptomes compared in matched cortical cell types. PLoS ONE 13, e0209648 (2018).
doi: 10.1371/journal.pone.0209648
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420 (2018).
doi: 10.1038/nbt.4096
Hafemeister, C. & Satija, R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 20, 296 (2019).
McInnes, L., Healy, J., Saul, N. & Großberger, L. UMAP: uniform manifold approximation and projection. J. Open Source Softw. 3, 861 (2018).
doi: 10.21105/joss.00861
Franzén, O., Gan, L.-M. & Björkegren, J. L. M. PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data. Database 2019, baz046 (2019).

Auteurs

Brandon Jew (B)

Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, 90095, USA.

Marcus Alvarez (M)

Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA.

Elior Rahmani (E)

Department of Computer Science, School of Engineering, UCLA, Los Angeles, CA, 90095, USA.

Zong Miao (Z)

Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, 90095, USA.
Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA.

Arthur Ko (A)

Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA.

Kristina M Garske (KM)

Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA.

Jae Hoon Sul (JH)

Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, 90095, USA.
Department of Psychiatry and Biobehavioral Sciences, UCLA, Los Angeles, CA, 90095, USA.

Kirsi H Pietiläinen (KH)

Obesity Research Unit, Research Program for Clinical and Molecular Metabolism, University of Helsinki, Helsinki, 00014, Finland.
Obesity Center, Endocrinology Abdominal Center, Helsinki University Central Hospital and University of Helsinki, Helsinki, 00260, Finland.

Päivi Pajukanta (P)

Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, 90095, USA. ppajukanta@mednet.ucla.edu.
Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA. ppajukanta@mednet.ucla.edu.
Institute for Precision Health, School of Medicine, UCLA, Los Angeles, CA, 90095, USA. ppajukanta@mednet.ucla.edu.

Eran Halperin (E)

Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA. ehalperin@cs.ucla.edu.
Department of Computer Science, School of Engineering, UCLA, Los Angeles, CA, 90095, USA. ehalperin@cs.ucla.edu.
Institute for Precision Health, School of Medicine, UCLA, Los Angeles, CA, 90095, USA. ehalperin@cs.ucla.edu.
Department of Anesthesiology, UCLA Health, Los Angeles, CA, 90095, USA. ehalperin@cs.ucla.edu.
Department of Computational Medicine, School of Medicine, UCLA, Los Angeles, CA, 90095, USA. ehalperin@cs.ucla.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH