PhISCS-BnB: a fast branch and bound algorithm for the perfect tumor phylogeny reconstruction problem.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 07 2020
Historique:
entrez: 14 7 2020
pubmed: 14 7 2020
medline: 9 3 2021
Statut: ppublish

Résumé

Recent advances in single-cell sequencing (SCS) offer an unprecedented insight into tumor emergence and evolution. Principled approaches to tumor phylogeny reconstruction via SCS data are typically based on general computational methods for solving an integer linear program, or a constraint satisfaction program, which, although guaranteeing convergence to the most likely solution, are very slow. Others based on Monte Carlo Markov Chain or alternative heuristics not only offer no such guarantee, but also are not faster in practice. As a result, novel methods that can scale up to handle the size and noise characteristics of emerging SCS data are highly desirable to fully utilize this technology. We introduce PhISCS-BnB (phylogeny inference using SCS via branch and bound), a branch and bound algorithm to compute the most likely perfect phylogeny on an input genotype matrix extracted from an SCS dataset. PhISCS-BnB not only offers an optimality guarantee, but is also 10-100 times faster than the best available methods on simulated tumor SCS data. We also applied PhISCS-BnB on a recently published large melanoma dataset derived from the sublineages of a cell line involving 20 clones with 2367 mutations, which returned the optimal tumor phylogeny in <4 h. The resulting phylogeny agrees with and extends the published results by providing a more detailed picture on the clonal evolution of the tumor. https://github.com/algo-cancer/PhISCS-BnB. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 32657358
pii: 5870466
doi: 10.1093/bioinformatics/btaa464
pmc: PMC7355310
doi:

Types de publication

Journal Article Research Support, N.I.H., Intramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

i169-i176

Subventions

Organisme : Medical Research Council
ID : MR/P014712/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P014712/2
Pays : United Kingdom

Informations de copyright

Published by Oxford University Press 2020.

Références

Cell Syst. 2016 Jul;3(1):43-53
pubmed: 27467246
Int J Cancer. 2019 Oct 1;145(7):1889-1901
pubmed: 30861105
PLoS Comput Biol. 2014 Aug 07;10(8):e1003665
pubmed: 25102416
Genome Biol. 2015 Feb 13;16:35
pubmed: 25786235
Cell. 2019 Sep 19;179(1):219-235.e21
pubmed: 31522890
Nat Commun. 2018 Dec 4;9(1):5144
pubmed: 30514897
Bioinformatics. 2017 Jul 15;33(14):i152-i160
pubmed: 28882002
Bioinformatics. 2020 Feb 1;36(3):742-750
pubmed: 31504211
Ann Oncol. 2017 Dec 1;28(12):3076-3082
pubmed: 28950321
Genome Biol. 2016 Apr 15;17:69
pubmed: 27083415
J Comput Biol. 2017 Jun;24(6):515-523
pubmed: 28056180
Genome Biol. 2015 May 06;16:91
pubmed: 25944252
Bioinformatics. 2015 Jun 15;31(12):i62-70
pubmed: 26072510
BMC Bioinformatics. 2014 Feb 01;15:35
pubmed: 24484323
Biochim Biophys Acta Rev Cancer. 2017 Apr;1867(2):127-138
pubmed: 28193548
Genome Biol. 2016 May 05;17:86
pubmed: 27149953
Cell. 2019 Nov 14;179(5):1207-1221.e22
pubmed: 31730858
IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):165-73
pubmed: 17048402
Bioinformatics. 2015 May 1;31(9):1349-56
pubmed: 25568283
Bioinformatics. 2018 Sep 1;34(17):i671-i679
pubmed: 30423070
Algorithms Mol Biol. 2019 Jul 27;14:17
pubmed: 31372179
Genome Biol. 2017 Sep 19;18(1):178
pubmed: 28927434
Nat Med. 2015 Aug;21(8):846-53
pubmed: 26248267
Genome Res. 2019 Nov;29(11):1847-1859
pubmed: 31628257
Nat Commun. 2019 Jun 21;10(1):2750
pubmed: 31227714
Genome Res. 2019 Nov;29(11):1860-1877
pubmed: 31628256
Bioinformatics. 2014 Jun 15;30(12):i78-86
pubmed: 24932008
Nucleic Acids Res. 2013 Sep;41(17):e165
pubmed: 23892400
Nat Methods. 2016 Jul;13(7):573-6
pubmed: 27183439

Auteurs

Erfan Sadeqi Azer (E)

Department of Computer Science, Indiana University, Bloomington, IN 47408, USA.

Farid Rashidi Mehrabadi (F)

Department of Computer Science, Indiana University, Bloomington, IN 47408, USA.
Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Salem Malikić (S)

Department of Computer Science, Indiana University, Bloomington, IN 47408, USA.

Xuan Cindy Li (XC)

Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
Program in Computational Biology, Bioinformatics and Genomics, University of Maryland, College Park, MD 20742, USA.

Osnat Bartok (O)

Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel.

Kevin Litchfield (K)

Cancer Evolution and Genome Instability Laboratory, Francis Crick Institute, London NW1 1AT, UK.
Cancer Research UK Lung Cancer Centre of Excellence London, University College London Cancer Institute, London WC1E 6DD, UK.

Ronen Levy (R)

Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel.

Yardena Samuels (Y)

Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel.

Alejandro A Schäffer (AA)

Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

E Michael Gertz (EM)

Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Chi-Ping Day (CP)

Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Eva Pérez-Guijarro (E)

Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Kerrie Marie (K)

Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Maxwell P Lee (MP)

Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Glenn Merlino (G)

Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Funda Ergun (F)

Department of Computer Science, Indiana University, Bloomington, IN 47408, USA.

S Cenk Sahinalp (SC)

Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH