Accurate Identification of Subclones in Tumor Genomes.
cancer evolution, genomics, statistical modeling
Journal
Molecular biology and evolution
ISSN: 1537-1719
Titre abrégé: Mol Biol Evol
Pays: United States
ID NLM: 8501455
Informations de publication
Date de publication:
02 07 2022
02 07 2022
Historique:
pubmed:
25
6
2022
medline:
16
7
2022
entrez:
24
6
2022
Statut:
ppublish
Résumé
Understanding intratumor heterogeneity is critical for studying tumorigenesis and designing personalized treatments. To decompose the mixed cell population in a tumor, subclones are inferred computationally based on variant allele frequency (VAF) from bulk sequencing data. In this study, we showed that sequencing depth, mean VAF, and variance of VAF of a subclone are confounded. Without considering this effect, current methods require deep-sequencing data (>300× depth) to reliably infer subclones. Here, we present a novel algorithm that incorporates depth-variance and mean-variance dependencies in a clustering error model and successfully identifies subclones in tumors sequenced at depths of as low as 30×. We implemented the algorithm as a model-based adaptive grouping of subclones (MAGOS) method. Analyses of computer simulated data and empirical sequencing data showed that MAGOS outperformed existing methods on minimum sequencing depth, decomposition accuracy, and computation efficiency. The most prominent improvements were observed in analyzing tumors sequenced at depths between 30× and 200×, whereas the performance was comparable between MAGOS and existing methods on deeply sequenced tumors. MAGOS supports analysis of single-nucleotide variants and copy number variants from a single sample or multiple samples of a tumor. We applied MAGOS to whole-exome data of late-stage liver cancers and discovered that high subclone count in a tumor was a significant risk factor of poor prognosis. Lastly, our analysis suggested that sequencing multiple samples of the same tumor at standard depth is more cost-effective and robust for subclone characterization than deep sequencing a single sample. MAGOS is available at github (https://github.com/liliulab/magos).
Identifiants
pubmed: 35749590
pii: 6617617
doi: 10.1093/molbev/msac136
pmc: PMC9260306
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : NLM NIH HHS
ID : R01 LM013438
Pays : United States
Organisme : NCI NIH HHS
ID : U54 CA217376
Pays : United States
Informations de copyright
© The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.
Références
N Engl J Med. 2016 Sep 22;375(12):1109-12
pubmed: 27653561
Blood. 2012 Aug 2;120(5):1060-6
pubmed: 22529291
Cell. 2017 Jun 15;169(7):1327-1341.e23
pubmed: 28622513
Nat Methods. 2014 Apr;11(4):396-8
pubmed: 24633410
Proc Natl Acad Sci U S A. 2008 Sep 2;105(35):13081-6
pubmed: 18723673
BMC Bioinformatics. 2011 Jun 28;12:263
pubmed: 21708038
Genome Res. 2010 Jan;20(1):68-80
pubmed: 19903760
PLoS Comput Biol. 2019 Jul 29;15(7):e1007243
pubmed: 31356595
Bioinformatics. 2015 May 1;31(9):1349-56
pubmed: 25568283
Science. 1976 Oct 1;194(4260):23-8
pubmed: 959840
Curr Opin Genet Dev. 2012 Feb;22(1):3-9
pubmed: 22386266
Br J Cancer. 2010 Oct 12;103(8):1139-43
pubmed: 20877357
Genome Biol. 2017 Sep 19;18(1):178
pubmed: 28927434
Bioinformatics. 2014 Jun 15;30(12):i78-86
pubmed: 24932008
Bioinformatics. 2014 Jan 1;30(1):50-60
pubmed: 24177718
Nat Methods. 2016 Jun;13(6):505-7
pubmed: 27088313
Nucleic Acids Res. 2013 Sep;41(17):e165
pubmed: 23892400
Cell. 2017 Feb 9;168(4):613-628
pubmed: 28187284
Nat Rev Cancer. 2017 Oct;17(10):605-619
pubmed: 28912577
PLoS One. 2011;6(11):e26100
pubmed: 22125594
Genome Biol. 2015 May 06;16:91
pubmed: 25944252
Nat Rev Genet. 2016 Mar;17(3):175-88
pubmed: 26806412
Nat Biotechnol. 2012 May;30(5):413-21
pubmed: 22544022
PLoS Genet. 2014 Jul 10;10(7):e1004462
pubmed: 25010716
N Engl J Med. 2017 Jun 1;376(22):2109-2121
pubmed: 28445112
PLoS Comput Biol. 2014 Aug 07;10(8):e1003665
pubmed: 25102416
Nat Rev Genet. 2017 Apr;18(4):213-229
pubmed: 28190876
Genome Biol. 2015 Feb 13;16:35
pubmed: 25786235
Cell. 2012 Jul 20;150(2):264-78
pubmed: 22817890
Nat Genet. 2020 Sep;52(9):898-907
pubmed: 32879509
PLoS Comput Biol. 2015 Oct 05;11(10):e1004416
pubmed: 26436540
Bioinformatics. 2015 Jun 15;31(12):i62-70
pubmed: 26072510
Bioinformatics. 2018 Dec 1;34(23):4017-4026
pubmed: 29931046
BMC Bioinformatics. 2014 Feb 01;15:35
pubmed: 24484323
Biochim Biophys Acta Rev Cancer. 2017 Apr;1867(2):127-138
pubmed: 28193548
Genome Biol. 2016 May 05;17:86
pubmed: 27149953
Br J Cancer. 2013 Feb 19;108(3):479-85
pubmed: 23299535
Nature. 2011 Apr 7;472(7341):90-4
pubmed: 21399628
Nat Biotechnol. 2019 Apr;37(4):367-369
pubmed: 30877282
Cell. 2013 Feb 14;152(4):714-26
pubmed: 23415222
Cell. 2012 May 25;149(5):994-1007
pubmed: 22608083
Nature. 2012 Jan 18;481(7381):306-13
pubmed: 22258609
Nature. 2012 Jan 11;481(7382):506-10
pubmed: 22237025
Cell Syst. 2015 Sep 23;1(3):210-223
pubmed: 26645048
Bioinformatics. 2018 Jun 1;34(11):1808-1816
pubmed: 29342233