Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes.


Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
01 08 2023
Historique:
received: 11 01 2023
accepted: 17 07 2023
medline: 3 8 2023
pubmed: 2 8 2023
entrez: 1 8 2023
Statut: epublish

Résumé

RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissues. However, the use of these technologies is operationally challenging because of high costs and stringent sample-collection requirements. Computational deconvolution methods that infer the composition of bulk-profiled samples using scnRNA-seq-characterized cell types can broaden scnRNA-seq applications, but their effectiveness remains controversial. We produced the first systematic evaluation of deconvolution methods on datasets with either known or scnRNA-seq-estimated compositions. Our analyses revealed biases that are common to scnRNA-seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-seq and scnRNA-seq profiles can help improve the accuracy of both scnRNA-seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), which combines RNA-seq transformation and dampened weighted least-squares deconvolution approaches, consistently outperformed other methods in predicting the composition of cell mixtures and tissue samples. We showed that analysis of concurrent RNA-seq and scnRNA-seq profiles with SQUID can produce accurate cell-type abundance estimates and that this accuracy improvement was necessary for identifying outcomes-predictive cancer cell subclones in pediatric acute myeloid leukemia and neuroblastoma datasets. These results suggest that deconvolution accuracy improvements are vital to enabling its applications in the life sciences.

Sections du résumé

BACKGROUND
RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissues. However, the use of these technologies is operationally challenging because of high costs and stringent sample-collection requirements. Computational deconvolution methods that infer the composition of bulk-profiled samples using scnRNA-seq-characterized cell types can broaden scnRNA-seq applications, but their effectiveness remains controversial.
RESULTS
We produced the first systematic evaluation of deconvolution methods on datasets with either known or scnRNA-seq-estimated compositions. Our analyses revealed biases that are common to scnRNA-seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-seq and scnRNA-seq profiles can help improve the accuracy of both scnRNA-seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), which combines RNA-seq transformation and dampened weighted least-squares deconvolution approaches, consistently outperformed other methods in predicting the composition of cell mixtures and tissue samples.
CONCLUSIONS
We showed that analysis of concurrent RNA-seq and scnRNA-seq profiles with SQUID can produce accurate cell-type abundance estimates and that this accuracy improvement was necessary for identifying outcomes-predictive cancer cell subclones in pediatric acute myeloid leukemia and neuroblastoma datasets. These results suggest that deconvolution accuracy improvements are vital to enabling its applications in the life sciences.

Identifiants

pubmed: 37528411
doi: 10.1186/s13059-023-03016-6
pii: 10.1186/s13059-023-03016-6
pmc: PMC10394903
doi:

Substances chimiques

RNA, Small Interfering 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

177

Subventions

Organisme : NEI NIH HHS
ID : P30 EY002520
Pays : United States
Organisme : NIH HHS
ID : S10 OD025240
Pays : United States
Organisme : NIH HHS
ID : S10 OD023469
Pays : United States
Organisme : NCI NIH HHS
ID : R21 CA223140
Pays : United States
Organisme : NIH HHS
ID : S10 OD018033
Pays : United States

Informations de copyright

© 2023. The Author(s).

Références

Elife. 2017 Nov 13;6:
pubmed: 29130882
Oncotarget. 2015 Nov 10;6(35):38005-15
pubmed: 26461224
Genome Biol. 2021 Apr 12;22(1):102
pubmed: 33845875
Nat Genet. 2021 May;53(5):683-693
pubmed: 33767450
Nat Methods. 2015 May;12(5):453-7
pubmed: 25822800
Nat Commun. 2022 Aug 6;13(1):4587
pubmed: 35933466
Cell. 2020 Sep 3;182(5):1232-1251.e22
pubmed: 32822576
Nat Genet. 2021 Sep;53(9):1334-1347
pubmed: 34493872
Cell Rep. 2018 Apr 3;23(1):297-312.e12
pubmed: 29617668
Science. 2018 Dec 7;362(6419):1165-1170
pubmed: 30523111
Nat Commun. 2020 Nov 6;11(1):5650
pubmed: 33159064
Nature. 2017 Oct 18;550(7677):451-453
pubmed: 29072289
Leukemia. 2018 Oct;32(10):2167-2177
pubmed: 29550834
Nat Commun. 2022 Mar 31;13(1):1714
pubmed: 35361816
Nature. 2016 Sep 1;537(7618):102-106
pubmed: 27556950
Gene. 2019 Apr 15;692:35-43
pubmed: 30639603
Nat Commun. 2017 Dec 11;8(1):2032
pubmed: 29230012
Genome Med. 2019 May 24;11(1):34
pubmed: 31126321
Nat Genet. 2013 Mar;45(3):279-84
pubmed: 23334666
Genome Biol. 2020 Jun 2;21(1):130
pubmed: 32487174
Nat Biotechnol. 2021 Nov;39(11):1453-1465
pubmed: 34140680
Science. 2014 Jun 20;344(6190):1396-401
pubmed: 24925914
Science. 2018 May 18;360(6390):758-763
pubmed: 29622724
J Hepatol. 2021 Dec;75(6):1397-1408
pubmed: 34216724
Cancer Cell. 2017 Oct 9;32(4):427-443.e8
pubmed: 29017055
Nature. 2021 Apr;592(7854):450-456
pubmed: 33762733
Nat Commun. 2020 Apr 24;11(1):1971
pubmed: 32332754
Nat Genet. 2015 Aug;47(8):864-71
pubmed: 26121087
Nat Commun. 2019 Jan 22;10(1):380
pubmed: 30670690
Dev Cell. 2015 Oct 12;35(1):49-62
pubmed: 26460946
PLoS Comput Biol. 2019 May 6;15(5):e1006976
pubmed: 31059559
Nat Med. 2016 Aug;22(8):861-8
pubmed: 27376578
Nat Commun. 2019 Jul 5;10(1):2975
pubmed: 31278265
Proc Natl Acad Sci U S A. 2015 Jun 9;112(23):7285-90
pubmed: 26060301
Methods Mol Biol. 2018;1711:243-259
pubmed: 29344893
Nat Med. 2018 Jan;24(1):103-112
pubmed: 29227476
Nat Commun. 2022 Mar 15;13(1):1358
pubmed: 35292647
Genome Res. 2021 Oct;31(10):1913-1926
pubmed: 34548323
Mol Neurobiol. 2021 Mar;58(3):1006-1016
pubmed: 33078370
Nature. 2015 Oct 29;526(7575):715-8
pubmed: 26503053
Cell Rep. 2019 Feb 5;26(6):1627-1640.e7
pubmed: 30726743
Genome Biol. 2023 Aug 1;24(1):177
pubmed: 37528411
Methods Mol Biol. 2020;2117:135-157
pubmed: 31960376

Auteurs

Francisco Avila Cobos (FA)

Department of Biomolecular Medicine, Ghent University, Ghent, Belgium; Cancer Research Institute Ghent, Ghent, Belgium.

Mohammad Javad Najaf Panah (MJN)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.

Jessica Epps (J)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.

Xiaochen Long (X)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.
Department of Statistics, Rice University, Houston, TX, 77251, USA.

Tsz-Kwong Man (TK)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.

Hua-Sheng Chiu (HS)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.

Elad Chomsky (E)

, ImmunAi, New York, NY, USA.

Evgeny Kiner (E)

, ImmunAi, New York, NY, USA.

Michael J Krueger (MJ)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.

Diego di Bernardo (D)

Department Chemical, Materials and Industrial Engineering, Telethon Institute of Genetics and Medicine, University of Naples "Federico II", Via Campi Flegrei 34, 80078, Naples, Pozzuoli, Italy.

Luis Voloch (L)

, ImmunAi, New York, NY, USA.

Jan Molenaar (J)

Princess Maxima Center for Pediatric Oncology, Utrecht, The Netherlands.

Sander R van Hooff (SR)

Princess Maxima Center for Pediatric Oncology, Utrecht, The Netherlands.

Frank Westermann (F)

German Cancer Research Center, DKFZ, Heidelberg, Germany.

Selina Jansky (S)

German Cancer Research Center, DKFZ, Heidelberg, Germany.

Michele L Redell (ML)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA.

Pieter Mestdagh (P)

Department of Biomolecular Medicine, Ghent University, Ghent, Belgium; Cancer Research Institute Ghent, Ghent, Belgium. pieter.mestdagh@ugent.be.

Pavel Sumazin (P)

Department of Pediatrics, Baylor College of Medicine, Texas Children's Hospital Cancer Center, Houston, TX, USA. sumazin@bcm.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH