Analyzing Low-Level mtDNA Heteroplasmy-Pitfalls and Challenges from Bench to Benchmarking.
Aging
/ genetics
Benchmarking
DNA, Mitochondrial
/ genetics
DNA-Directed DNA Polymerase
/ genetics
Genetic Predisposition to Disease
Genetic Variation
/ genetics
Genome, Mitochondrial
/ genetics
Heteroplasmy
/ genetics
High-Throughput Nucleotide Sequencing
Humans
Mitochondria
/ genetics
Mutation
/ genetics
Sequence Analysis, DNA
DNA polymerase
heteroplasmy
mitochondrial DNA (mtDNA)
next generation sequencing (NGS)
variant callers
Journal
International journal of molecular sciences
ISSN: 1422-0067
Titre abrégé: Int J Mol Sci
Pays: Switzerland
ID NLM: 101092791
Informations de publication
Date de publication:
19 Jan 2021
19 Jan 2021
Historique:
received:
09
12
2020
revised:
05
01
2021
accepted:
15
01
2021
entrez:
22
1
2021
pubmed:
23
1
2021
medline:
2
4
2021
Statut:
epublish
Résumé
Massive parallel sequencing technologies are promising a highly sensitive detection of low-level mutations, especially in mitochondrial DNA (mtDNA) studies. However, processes from DNA extraction and library construction to bioinformatic analysis include several varying tasks. Further, there is no validated recommendation for the comprehensive procedure. In this study, we examined potential pitfalls on the sequencing results based on two-person mtDNA mixtures. Therefore, we compared three DNA polymerases, six different variant callers in five mixtures between 50% and 0.5% variant allele frequencies generated with two different amplification protocols. In total, 48 samples were sequenced on Illumina MiSeq. Low-level variant calling at the 1% variant level and below was performed by comparing trimming and PCR duplicate removal as well as six different variant callers. The results indicate that sensitivity, specificity, and precision highly depend on the investigated polymerase but also vary based on the analysis tools. Our data highlight the advantage of prior standardization and validation of the individual laboratory setup with a DNA mixture model. Finally, we provide an artificial heteroplasmy benchmark dataset that can help improve somatic variant callers or pipelines, which may be of great interest for research related to cancer and aging.
Identifiants
pubmed: 33477827
pii: ijms22020935
doi: 10.3390/ijms22020935
pmc: PMC7832847
pii:
doi:
Substances chimiques
DNA, Mitochondrial
0
DNA-Directed DNA Polymerase
EC 2.7.7.7
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Références
J Korean Med Sci. 2018 Dec 11;33(52):e337
pubmed: 30584415
PLoS Comput Biol. 2012;8(10):e1002737
pubmed: 23133345
Sci Rep. 2019 Mar 5;9(1):3503
pubmed: 30837525
Hum Mol Genet. 2013 Jan 15;22(2):384-90
pubmed: 23077218
Am J Hum Genet. 2010 Aug 13;87(2):237-49
pubmed: 20696290
J Mol Evol. 2009 May;68(5):516-27
pubmed: 19407924
Appl Transl Genom. 2016 Jul 01;10:2-9
pubmed: 27668169
Sci Rep. 2014 Jul 18;4:5737
pubmed: 25034901
Mitochondrial DNA B Resour. 2019 Dec 12;5(1):108-112
pubmed: 33366444
BMC Genomics. 2009 Mar 30;10:139
pubmed: 19331681
Electrophoresis. 2005 Sep;26(18):3414-29
pubmed: 16167362
Nat Rev Genet. 2018 May;19(5):269-285
pubmed: 29576615
Nucleic Acids Res. 2012 Dec;40(22):11189-201
pubmed: 23066108
BMC Cancer. 2012 Feb 02;12:53
pubmed: 22299657
Nucleic Acids Res. 2016 Jul 8;44(W1):W64-9
pubmed: 27084948
PLoS Genet. 2016 Mar 15;12(3):e1005939
pubmed: 26978189
Hum Mutat. 2013 Oct;34(10):1432-8
pubmed: 23766071
Biotechniques. 2010 Apr;48(4):287-96
pubmed: 20569205
Proc Natl Acad Sci U S A. 2014 Oct 28;111(43):E4546-7
pubmed: 25319266
BMC Bioinformatics. 2018 Nov 19;19(1):429
pubmed: 30453880
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Proc Natl Acad Sci U S A. 2014 Oct 28;111(43):E4548-50
pubmed: 25319265
Cold Spring Harb Perspect Biol. 2013 Jul 01;5(7):
pubmed: 23818499
Forensic Sci Int Genet. 2018 Jul;35:57-64
pubmed: 29665468
Sci Rep. 2015 Jan 27;5:8056
pubmed: 25623996
Am J Hum Genet. 2010 Dec 10;87(6):802-12
pubmed: 21129724
Proc Natl Acad Sci U S A. 2019 Dec 10;116(50):25172-25178
pubmed: 31757848
Proc Natl Acad Sci U S A. 2014 Jul 22;111(29):10654-9
pubmed: 25002485
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
Nat Genet. 1996 Apr;12(4):417-20
pubmed: 8630496
Cancers (Basel). 2020 Jul 17;12(7):
pubmed: 32708892
Croat Med J. 2011 Jun;52(3):299-313
pubmed: 21674826
Forensic Sci Int Genet. 2014 Nov;13:20-9
pubmed: 25051226
Int J Legal Med. 2003 Jun;117(3):180-4
pubmed: 12799738
J Med Genet. 2019 Dec;56(12):792-800
pubmed: 31300550
Nucleic Acids Res. 2016 Jul 8;44(W1):W58-63
pubmed: 27084951
J Genet Genomics. 2013 Dec 20;40(12):607-15
pubmed: 24377867
BMC Biol. 2019 Jul 8;17(1):53
pubmed: 31286943
Int J Legal Med. 2018 Jul;132(4):1049-1056
pubmed: 29279961
Investig Genet. 2015 Mar 10;6:3
pubmed: 25798216
Nat Genet. 2020 Mar;52(3):342-352
pubmed: 32024997
Genome Res. 2012 Mar;22(3):568-76
pubmed: 22300766
PLoS One. 2015 Aug 11;10(8):e0135643
pubmed: 26262956
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Nat Commun. 2020 Mar 20;11(1):1487
pubmed: 32198407
Nat Genet. 1999 Oct;23(2):147
pubmed: 10508508
Nucleic Acids Res. 2016 Jun 20;44(11):e108
pubmed: 27060149
Forensic Sci Int Genet. 2015 Sep;18:131-9
pubmed: 26009256
Bioinformatics. 2011 Apr 15;27(8):1157-8
pubmed: 21320865
BMC Bioinformatics. 2016 Mar 11;17:125
pubmed: 26968756