Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples.
SARS-CoV-2
VOC
sequencing
variant callers
wastewater
Journal
Microbial genomics
ISSN: 2057-5858
Titre abrégé: Microb Genom
Pays: England
ID NLM: 101671820
Informations de publication
Date de publication:
04 2023
04 2023
Historique:
medline:
21
4
2023
pubmed:
19
4
2023
entrez:
19
04
2023
Statut:
ppublish
Résumé
Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinical sequencing, supporting the insights gained and helping to make informed public-health decisions. Consequently, many groups globally have developed bioinformatics pipelines to analyse sequencing data from wastewater. Accurate calling of mutations is critical in this process and in the assignment of circulating variants; yet, to date, the performance of variant-calling algorithms in wastewater samples has not been investigated. To address this, we compared the performance of six variant callers (VarScan, iVar, GATK, FreeBayes, LoFreq and BCFtools), used widely in bioinformatics pipelines, on 19 synthetic samples with known ratios of three different SARS-CoV-2 variants of concern (VOCs) (Alpha, Beta and Delta), as well as 13 wastewater samples collected in London between the 15th and 18th December 2021. We used the fundamental parameters of recall (sensitivity) and precision (specificity) to confirm the presence of mutational profiles defining specific variants across the six variant callers. Our results show that BCFtools, FreeBayes and VarScan found the expected variants with higher precision and recall than GATK or iVar, although the latter identified more expected defining mutations than other callers. LoFreq gave the least reliable results due to the high number of false-positive mutations detected, resulting in lower precision. Similar results were obtained for both the synthetic and wastewater samples.
Identifiants
pubmed: 37074153
doi: 10.1099/mgen.0.000933
pmc: PMC10210938
doi:
Substances chimiques
Wastewater
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Références
J Environ Manage. 2021 Dec 1;299:113563
pubmed: 34488114
Genome Res. 2011 Jun;21(6):961-73
pubmed: 20980555
PLoS One. 2017 Jul 28;12(7):e0182392
pubmed: 28753663
Water Res. 2022 Apr 1;212:118070
pubmed: 35101695
Sci Total Environ. 2020 Oct 15;739:139076
pubmed: 32758929
Emerg Infect Dis. 2022 Jun;28(6):1101-1109
pubmed: 35452383
Comput Struct Biotechnol J. 2018 Feb 06;16:15-24
pubmed: 29552334
Front Genet. 2015 Jul 07;6:235
pubmed: 26217378
Annu Rev Microbiol. 1995;49:461-87
pubmed: 8561468
Nat Biotechnol. 2020 Oct;38(10):1164-1167
pubmed: 32948856
Sci Total Environ. 2021 Jun 25;775:145790
pubmed: 33618308
Indian J Ophthalmol. 2008 Jan-Feb;56(1):45-50
pubmed: 18158403
Sci Total Environ. 2020 Nov 15;743:140444
pubmed: 32649988
Sci Rep. 2017 Feb 24;7:43169
pubmed: 28233799
Environ Res. 2021 Feb;193:110265
pubmed: 33011225
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
PLoS Comput Biol. 2022 May 31;18(5):e1009123
pubmed: 35639788
BMC Bioinformatics. 2018 Nov 19;19(1):429
pubmed: 30453880
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Water Res. 2020 Jul 15;179:115899
pubmed: 32361598
Plants (Basel). 2020 Apr 02;9(4):
pubmed: 32252268
Nat Biotechnol. 2020 Mar;38(3):276-278
pubmed: 32055031
Biomed Res Int. 2015;2015:456479
pubmed: 26539496
Gigascience. 2021 Feb 16;10(2):
pubmed: 33590861
Appl Environ Microbiol. 2017 Feb 15;83(5):
pubmed: 28039136
mSystems. 2021 Oct 26;6(5):e0082921
pubmed: 34519528
Int J Environ Res Public Health. 2020 Dec 10;17(24):
pubmed: 33321987
PLoS One. 2022 Mar 17;17(3):e0265622
pubmed: 35298548
Bioinformatics. 2021 Jul 19;37(12):1673-1680
pubmed: 33471068
Water Res. 2005 Sep;39(14):3309-19
pubmed: 15996707
Acta Biomed. 2020 Mar 19;91(1):157-160
pubmed: 32191675
Appl Environ Microbiol. 2015 Mar;81(5):1859-64
pubmed: 25556189
Brief Bioinform. 2021 May 20;22(3):
pubmed: 34020538
Sci Total Environ. 2020 Nov 25;745:140910
pubmed: 32758747
Nucleic Acids Res. 2012 Dec;40(22):11189-201
pubmed: 23066108
Nat Biotechnol. 2021 Jun;39(6):727-736
pubmed: 33462508
Bioinformatics. 2009 Sep 1;25(17):2283-5
pubmed: 19542151
Sci Total Environ. 2020 Aug 1;728:138764
pubmed: 32387778
Nat Microbiol. 2022 Aug;7(8):1151-1160
pubmed: 35851854
Nature. 2020 Apr;580(7802):176-177
pubmed: 32246117
Genome Biol. 2019 Jan 8;20(1):8
pubmed: 30621750
J Exp Med. 1940 May 31;71(6):765-77
pubmed: 19870997
Sci Rep. 2022 May 3;12(1):7201
pubmed: 35504966