Customized de novo mutation detection for any variant calling pipeline: SynthDNM.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
25 Oct 2021
25 Oct 2021
Historique:
received:
31
10
2020
accepted:
01
04
2021
medline:
7
4
2021
pubmed:
7
4
2021
entrez:
6
4
2021
Statut:
ppublish
Résumé
As sequencing technologies and analysis pipelines evolve, de novo mutation (DNM) calling tools must be adapted. Therefore, a flexible approach is needed that can accurately identify DNMs from genome or exome sequences from a variety of datasets and variant calling pipelines. Here, we describe SynthDNM, a random-forest based classifier that can be readily adapted to new sequencing or variant-calling pipelines by applying a flexible approach to constructing simulated training examples from real data. The optimized SynthDNM classifiers predict de novo SNPs and indels with robust accuracy across multiple methods of variant calling. SynthDNM is freely available on Github (https://github.com/james-guevara/synthdnm). Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 33821956
pii: 6209072
doi: 10.1093/bioinformatics/btab225
pmc: PMC8545295
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
3640-3641Subventions
Organisme : NIMH NIH HHS
ID : R01 MH113715
Pays : United States
Organisme : NIH HHS
ID : MH113715
Pays : United States
Organisme : Simons Foundation Autism Research Initiative
ID : 606768
Organisme : National Natural Science Foundation of China
ID : 81730036
Informations de copyright
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.