Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples.


Journal

Microbial genomics
ISSN: 2057-5858
Titre abrégé: Microb Genom
Pays: England
ID NLM: 101671820

Informations de publication

Date de publication:
04 2023
Historique:
medline: 21 4 2023
pubmed: 19 4 2023
entrez: 19 04 2023
Statut: ppublish

Résumé

Wastewater-based epidemiology has been used extensively throughout the COVID-19 (coronavirus disease 19) pandemic to detect and monitor the spread and prevalence of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) and its variants. It has proven an excellent, complementary tool to clinical sequencing, supporting the insights gained and helping to make informed public-health decisions. Consequently, many groups globally have developed bioinformatics pipelines to analyse sequencing data from wastewater. Accurate calling of mutations is critical in this process and in the assignment of circulating variants; yet, to date, the performance of variant-calling algorithms in wastewater samples has not been investigated. To address this, we compared the performance of six variant callers (VarScan, iVar, GATK, FreeBayes, LoFreq and BCFtools), used widely in bioinformatics pipelines, on 19 synthetic samples with known ratios of three different SARS-CoV-2 variants of concern (VOCs) (Alpha, Beta and Delta), as well as 13 wastewater samples collected in London between the 15th and 18th December 2021. We used the fundamental parameters of recall (sensitivity) and precision (specificity) to confirm the presence of mutational profiles defining specific variants across the six variant callers. Our results show that BCFtools, FreeBayes and VarScan found the expected variants with higher precision and recall than GATK or iVar, although the latter identified more expected defining mutations than other callers. LoFreq gave the least reliable results due to the high number of false-positive mutations detected, resulting in lower precision. Similar results were obtained for both the synthetic and wastewater samples.

Identifiants

pubmed: 37074153
doi: 10.1099/mgen.0.000933
pmc: PMC10210938
doi:

Substances chimiques

Wastewater 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Références

J Environ Manage. 2021 Dec 1;299:113563
pubmed: 34488114
Genome Res. 2011 Jun;21(6):961-73
pubmed: 20980555
PLoS One. 2017 Jul 28;12(7):e0182392
pubmed: 28753663
Water Res. 2022 Apr 1;212:118070
pubmed: 35101695
Sci Total Environ. 2020 Oct 15;739:139076
pubmed: 32758929
Emerg Infect Dis. 2022 Jun;28(6):1101-1109
pubmed: 35452383
Comput Struct Biotechnol J. 2018 Feb 06;16:15-24
pubmed: 29552334
Front Genet. 2015 Jul 07;6:235
pubmed: 26217378
Annu Rev Microbiol. 1995;49:461-87
pubmed: 8561468
Nat Biotechnol. 2020 Oct;38(10):1164-1167
pubmed: 32948856
Sci Total Environ. 2021 Jun 25;775:145790
pubmed: 33618308
Indian J Ophthalmol. 2008 Jan-Feb;56(1):45-50
pubmed: 18158403
Sci Total Environ. 2020 Nov 15;743:140444
pubmed: 32649988
Sci Rep. 2017 Feb 24;7:43169
pubmed: 28233799
Environ Res. 2021 Feb;193:110265
pubmed: 33011225
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
PLoS Comput Biol. 2022 May 31;18(5):e1009123
pubmed: 35639788
BMC Bioinformatics. 2018 Nov 19;19(1):429
pubmed: 30453880
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Water Res. 2020 Jul 15;179:115899
pubmed: 32361598
Plants (Basel). 2020 Apr 02;9(4):
pubmed: 32252268
Nat Biotechnol. 2020 Mar;38(3):276-278
pubmed: 32055031
Biomed Res Int. 2015;2015:456479
pubmed: 26539496
Gigascience. 2021 Feb 16;10(2):
pubmed: 33590861
Appl Environ Microbiol. 2017 Feb 15;83(5):
pubmed: 28039136
mSystems. 2021 Oct 26;6(5):e0082921
pubmed: 34519528
Int J Environ Res Public Health. 2020 Dec 10;17(24):
pubmed: 33321987
PLoS One. 2022 Mar 17;17(3):e0265622
pubmed: 35298548
Bioinformatics. 2021 Jul 19;37(12):1673-1680
pubmed: 33471068
Water Res. 2005 Sep;39(14):3309-19
pubmed: 15996707
Acta Biomed. 2020 Mar 19;91(1):157-160
pubmed: 32191675
Appl Environ Microbiol. 2015 Mar;81(5):1859-64
pubmed: 25556189
Brief Bioinform. 2021 May 20;22(3):
pubmed: 34020538
Sci Total Environ. 2020 Nov 25;745:140910
pubmed: 32758747
Nucleic Acids Res. 2012 Dec;40(22):11189-201
pubmed: 23066108
Nat Biotechnol. 2021 Jun;39(6):727-736
pubmed: 33462508
Bioinformatics. 2009 Sep 1;25(17):2283-5
pubmed: 19542151
Sci Total Environ. 2020 Aug 1;728:138764
pubmed: 32387778
Nat Microbiol. 2022 Aug;7(8):1151-1160
pubmed: 35851854
Nature. 2020 Apr;580(7802):176-177
pubmed: 32246117
Genome Biol. 2019 Jan 8;20(1):8
pubmed: 30621750
J Exp Med. 1940 May 31;71(6):765-77
pubmed: 19870997
Sci Rep. 2022 May 3;12(1):7201
pubmed: 35504966

Auteurs

Irene Bassano (I)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
Department of Infectious Disease, Imperial College London, London SW7 2AZ, UK.

Vinoy K Ramachandran (VK)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.

Mohammad S Khalifa (MS)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
Division of Biosciences, College of Health, Medicine and Life Sciences, Brunel University, London UB8 3PH, UK.

Chris J Lilley (CJ)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.

Mathew R Brown (MR)

School of Engineering, Newcastle University, Newcastle-upon-Tyne NE1 7RU, UK.

Ronny van Aerle (R)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
International Centre of Excellence for Aquatic Animal Health, Centre for Environment, Fisheries and Aquaculture Science (Cefas), Clyst Honiton EX5 2FN, UK.

Hubert Denise (H)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.

William Rowe (W)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.

Airey George (A)

Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.

Edward Cairns (E)

Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.

Claudia Wierzbicki (C)

Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.

Natalie D Pickwell (ND)

DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.

Matthew Carlile (M)

DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.

Nadine Holmes (N)

DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.

Alexander Payne (A)

DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.

Matthew Loose (M)

DeepSeq, Centre for Genetics and Genomics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.

Terry A Burke (TA)

NERC Environmental Omics Facility, Ecology and Evolutionary Biology, School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK.

Steve Paterson (S)

Centre for Genomic Research and NERC Environmental Omics Facility, Institute of Infection, Veterinary and Ecological Sciences (IVES), University of Liverpool, Liverpool L69 7ZB, UK.

Matthew J Wade (MJ)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.
School of Engineering, Newcastle University, Newcastle-upon-Tyne NE1 7RU, UK.

Jasmine M S Grimsley (JMS)

Analytics & Data Science Directorate, UK Health Security Agency, London SW1P 3JR, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH