Conifer: clonal tree inference for tumor heterogeneity with single-cell and bulk sequencing data.

Bayesian nonparametric model Bulk sequencing Clonal tree Heterogeneity of tumor Single-cell sequencing

Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
30 Aug 2021
Historique:
received: 21 02 2021
accepted: 16 08 2021
entrez: 31 8 2021
pubmed: 1 9 2021
medline: 7 9 2021
Statut: epublish

Résumé

Genetic heterogeneity of a cancer tumor that develops during clonal evolution is one of the reasons for cancer treatment failure, by increasing the chance of drug resistance. Clones are cell populations with different genotypes, resulting from differences in somatic mutations that occur and accumulate during cancer development. An appropriate approach for identifying clones is determining the variant allele frequency of mutations that occurred in the tumor. Although bulk sequencing data can be used to provide that information, the frequencies are not informative enough for identifying different clones with the same prevalence and their evolutionary relationships. On the other hand, single-cell sequencing data provides valuable information about branching events in the evolution of a cancerous tumor. However, the temporal order of mutations may be determined with ambiguities using only single-cell data, while variant allele frequencies from bulk sequencing data can provide beneficial information for inferring the temporal order of mutations with fewer ambiguities. In this study, a new method called Conifer (ClONal tree Inference For hEterogeneity of tumoR) is proposed which combines aggregated variant allele frequency from bulk sequencing data with branching event information from single-cell sequencing data to more accurately identify clones and their evolutionary relationships. It is proven that the accuracy of clone identification and clonal tree inference is increased by using Conifer compared to other existing methods on various sets of simulated data. In addition, it is discussed that the evolutionary tree provided by Conifer on real cancer data sets is highly consistent with information in both bulk and single-cell data. In this study, we have provided an accurate and robust method to identify clones of tumor heterogeneity and their evolutionary history by combining single-cell and bulk sequencing data.

Sections du résumé

BACKGROUND BACKGROUND
Genetic heterogeneity of a cancer tumor that develops during clonal evolution is one of the reasons for cancer treatment failure, by increasing the chance of drug resistance. Clones are cell populations with different genotypes, resulting from differences in somatic mutations that occur and accumulate during cancer development. An appropriate approach for identifying clones is determining the variant allele frequency of mutations that occurred in the tumor. Although bulk sequencing data can be used to provide that information, the frequencies are not informative enough for identifying different clones with the same prevalence and their evolutionary relationships. On the other hand, single-cell sequencing data provides valuable information about branching events in the evolution of a cancerous tumor. However, the temporal order of mutations may be determined with ambiguities using only single-cell data, while variant allele frequencies from bulk sequencing data can provide beneficial information for inferring the temporal order of mutations with fewer ambiguities.
RESULT RESULTS
In this study, a new method called Conifer (ClONal tree Inference For hEterogeneity of tumoR) is proposed which combines aggregated variant allele frequency from bulk sequencing data with branching event information from single-cell sequencing data to more accurately identify clones and their evolutionary relationships. It is proven that the accuracy of clone identification and clonal tree inference is increased by using Conifer compared to other existing methods on various sets of simulated data. In addition, it is discussed that the evolutionary tree provided by Conifer on real cancer data sets is highly consistent with information in both bulk and single-cell data.
CONCLUSIONS CONCLUSIONS
In this study, we have provided an accurate and robust method to identify clones of tumor heterogeneity and their evolutionary history by combining single-cell and bulk sequencing data.

Identifiants

pubmed: 34461827
doi: 10.1186/s12859-021-04338-7
pii: 10.1186/s12859-021-04338-7
pmc: PMC8404257
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

416

Informations de copyright

© 2021. The Author(s).

Références

Genome Biol. 2015 May 06;16:91
pubmed: 25944252
Genome Biol. 2015 Feb 13;16:35
pubmed: 25786235
Mol Oncol. 2014 Sep 12;8(6):1095-111
pubmed: 25087573
Nature. 2013 Sep 19;501(7467):327
pubmed: 24048064
Cell Syst. 2020 Apr 22;10(4):323-332.e8
pubmed: 32864481
Nat Methods. 2014 Apr;11(4):396-8
pubmed: 24633410
Genome Res. 2017 Nov;27(11):1885-1894
pubmed: 29030470
Genome Biol. 2017 Mar 1;18(1):44
pubmed: 28249593
Nat Methods. 2016 Jul;13(7):573-6
pubmed: 27183439
Cancer Discov. 2015 Aug;5(8):806-20
pubmed: 26193902
Nat Rev Cancer. 2006 Dec;6(12):924-35
pubmed: 17109012
Genome Biol. 2014 Aug 30;15(8):452
pubmed: 25222669
PeerJ. 2015 Feb 19;3:e784
pubmed: 25737822
Bioinformatics. 2015 Jun 15;31(12):i62-70
pubmed: 26072510
BMC Bioinformatics. 2014 Feb 01;15:35
pubmed: 24484323
Biochim Biophys Acta Rev Cancer. 2017 Apr;1867(2):127-138
pubmed: 28193548
Genome Biol. 2016 May 05;17:86
pubmed: 27149953
Nat Rev Clin Oncol. 2018 Feb;15(2):81-94
pubmed: 29115304
Genome Res. 2017 Aug;27(8):1287-1299
pubmed: 28546418
Science. 1976 Oct 1;194(4260):23-8
pubmed: 959840
Mol Cell. 2015 May 21;58(4):598-609
pubmed: 26000845
Nature. 2014 Aug 14;512(7513):155-60
pubmed: 25079324
BMC Bioinformatics. 2014 Jan 24;15:27
pubmed: 24460695
Algorithms Mol Biol. 2019 Jul 27;14:17
pubmed: 31372179
Genome Biol. 2017 Sep 19;18(1):178
pubmed: 28927434
Genome Res. 2019 Nov;29(11):1847-1859
pubmed: 31628257
Nat Commun. 2019 Jun 21;10(1):2750
pubmed: 31227714
Genome Res. 2019 Nov;29(11):1860-1877
pubmed: 31628256
PLoS Comput Biol. 2014 Jul 10;10(7):e1003703
pubmed: 25010360
Nat Rev Cancer. 2012 Apr 19;12(5):323-34
pubmed: 22513401
Genome Biol. 2016 Apr 15;17:69
pubmed: 27083415
Cell Syst. 2015 Sep 23;1(3):210-223
pubmed: 26645048

Auteurs

Leila Baghaarabani (L)

Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.

Sama Goliaei (S)

Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran.

Mohammad-Hadi Foroughmand-Araabi (MH)

Department of Mathematical Sciences, Sharif University of Technology, Tehran, Iran.

Seyed Peyman Shariatpanahi (SP)

Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.

Bahram Goliaei (B)

Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran. goliaei@ut.ac.ir.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH