A Bayesian model integration for mutation calling through data partitioning.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
01 11 2019
01 11 2019
Historique:
received:
23
03
2018
revised:
06
09
2018
accepted:
28
03
2019
pubmed:
30
3
2019
medline:
1
7
2020
entrez:
30
3
2019
Statut:
ppublish
Résumé
Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mutation candidates or overlapping paired-end read information. However, existing methods cannot take multiple information sources into account simultaneously. Existing Bayesian hierarchical model-based methods construct two generative models, the tumor model and error model, and limited information sources have been modeled. We proposed a Bayesian model integration framework named as partitioning-based model integration. In this framework, through introducing partitions for paired-end reads based on given information sources, we integrate existing generative models and utilize multiple information sources. Based on that, we constructed a novel Bayesian hierarchical model-based method named as OHVarfinDer. In both the tumor model and error model, we introduced partitions for a set of paired-end reads that cover a mutation candidate position, and applied a different generative model for each category of paired-end reads. We demonstrated that our method can utilize both heterozygous SNP information and overlapping paired-end read information effectively in simulation datasets and real datasets. https://github.com/takumorizo/OHVarfinDer. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 30924874
pii: 5423180
doi: 10.1093/bioinformatics/btz233
pmc: PMC6821361
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
4247-4254Informations de copyright
© The Author(s) 2019. Published by Oxford University Press.
Références
Bioinformatics. 2014 Dec 1;30(23):3302-9
pubmed: 25123903
Genome Res. 2011 Jun;21(6):961-73
pubmed: 20980555
Nat Biotechnol. 2013 Mar;31(3):213-9
pubmed: 23396013
BMC Genomics. 2013 Feb 12;14:96
pubmed: 23402258
IEEE Trans Nanobioscience. 2017 Mar 01;16(2):116-122
pubmed: 28278479
Nature. 2011 Sep 11;478(7367):64-9
pubmed: 21909114
Nucleic Acids Res. 2013 Apr;41(7):e89
pubmed: 23471004
Bioinformatics. 2012 Apr 1;28(7):907-13
pubmed: 22285562
Genome Res. 2012 Mar;22(3):568-76
pubmed: 22300766
Nat Rev Genet. 2010 Oct;11(10):685-96
pubmed: 20847746