Assessment of SARS-CoV-2 Genome Sequencing: Quality Criteria and Low-Frequency Variants.
SARS-CoV-2
accreditation
contamination
genome sequencing
quality assessment
variants of concern
Journal
Journal of clinical microbiology
ISSN: 1098-660X
Titre abrégé: J Clin Microbiol
Pays: United States
ID NLM: 7505564
Informations de publication
Date de publication:
20 09 2021
20 09 2021
Historique:
pubmed:
29
7
2021
medline:
24
9
2021
entrez:
28
7
2021
Statut:
ppublish
Résumé
Although many laboratories worldwide have developed their sequencing capacities in response to the need for SARS-CoV-2 genome-based surveillance of variants, only a few reported some quality criteria to ensure sequence quality before lineage assignment and submission to public databases. Hence, we aimed here to provide simple quality control criteria for SARS-CoV-2 sequencing to prevent erroneous interpretation of low-quality or contaminated data. We retrospectively investigated 647 SARS-CoV-2 genomes obtained over 10 tiled amplicons sequencing runs. We extracted 26 potentially relevant metrics covering the entire workflow from sample selection to bioinformatics analysis. Based on data distribution, critical values were established for 11 selected metrics to prompt further quality investigations for problematic samples, in particular those with a low viral RNA quantity. Low-frequency variants (<70% of supporting reads) can result from PCR amplification errors, sample cross contaminations, or presence of distinct SARS-CoV2 genomes in the sample sequenced. The number and the prevalence of low-frequency variants can be used as a robust quality criterion to identify possible sequencing errors or contaminations. Overall, we propose 11 metrics with fixed cutoff values as a simple tool to evaluate the quality of SARS-CoV-2 genomes, among which are cycle thresholds, mean depth, proportion of genome covered at least 10×, and the number of low-frequency variants combined with mutation prevalence data.
Identifiants
pubmed: 34319802
doi: 10.1128/JCM.00944-21
pmc: PMC8451431
doi:
Substances chimiques
RNA, Viral
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0094421Références
Microorganisms. 2021 Mar 25;9(4):
pubmed: 33806013
Chest. 2020 Nov;158(5):1804-1805
pubmed: 33160519
Virus Evol. 2020 Oct 05;6(2):veaa075
pubmed: 33318859
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
J Virol. 2021 Mar 1;95(10):
pubmed: 33649194
Bioinformatics. 2018 Dec 1;34(23):4121-4123
pubmed: 29790939
Microorganisms. 2021 Feb 02;9(2):
pubmed: 33540596
Clin Microbiol Infect. 2021 Jul;27(7):1036.e1-1036.e8
pubmed: 33813118
Lancet Infect Dis. 2020 Nov;20(11):1263-1272
pubmed: 32679081
Virus Res. 2017 Jul 15;239:97-105
pubmed: 27993623
Sci Transl Med. 2020 Dec 9;12(573):
pubmed: 33229462
Microbes Infect. 2020 Nov - Dec;22(10):617-621
pubmed: 32911086
Nat Med. 2020 Sep;26(9):1398-1404
pubmed: 32647358
Nat Commun. 2020 Dec 11;11(1):6351
pubmed: 33311501
Bioinformatics. 2016 Jan 15;32(2):292-4
pubmed: 26428292
Am J Clin Pathol. 1950 Nov;20(11):1059-66
pubmed: 14783086
Genes (Basel). 2020 Aug 17;11(8):
pubmed: 32824573
Clin Chem. 1981 Mar;27(3):493-501
pubmed: 7471403
Nucleic Acids Res. 2015 Dec 2;43(21):e143
pubmed: 26187991
Genes (Basel). 2020 Aug 12;11(8):
pubmed: 32806776