Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets.


Journal

JCI insight
ISSN: 2379-3708
Titre abrégé: JCI Insight
Pays: United States
ID NLM: 101676073

Informations de publication

Date de publication:
11 01 2022
Historique:
pubmed: 24 11 2021
medline: 22 3 2022
entrez: 23 11 2021
Statut: epublish

Résumé

Identification and analysis of fungal communities commonly rely on internal transcribed spacer-based (ITS-based) amplicon sequencing. There is no gold standard used to infer and classify fungal constituents since methodologies have been adapted from analyses of bacterial communities. To achieve high-resolution inference of fungal constituents, we customized a DADA2-based pipeline using a mix of 11 medically relevant fungi. While DADA2 allowed the discrimination of ITS1 sequences differing by single nucleotides, quality filtering, sequencing bias, and database selection were identified as key variables determining the accuracy of sample inference. Due to species-specific differences in sequencing quality, default filtering settings removed most reads that originated from Aspergillus species, Saccharomyces cerevisiae, and Candida glabrata. By fine-tuning the quality filtering process, we achieved an improved representation of the fungal communities. By adapting a wobble nucleotide in the ITS1 forward primer region, we further increased the yield of S. cerevisiae and C. glabrata sequences. Finally, we showed that a BLAST-based algorithm based on the UNITE+INSD or the NCBI NT database achieved a higher reliability in species-level taxonomic annotation compared with the naive Bayesian classifier implemented in DADA2. These steps optimized a robust fungal ITS1 sequencing pipeline that, in most instances, enabled species-level assignment of community members.

Identifiants

pubmed: 34813499
pii: 151663
doi: 10.1172/jci.insight.151663
pmc: PMC8765055
doi:
pii:

Substances chimiques

DNA, Fungal 0
DNA, Intergenic 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIAID NIH HHS
ID : R01 AI139632
Pays : United States
Organisme : NIAID NIH HHS
ID : R21 AI156157
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA008748
Pays : United States
Organisme : NIAID NIH HHS
ID : R56 AI137269
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI137269
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI093808
Pays : United States
Organisme : NIAID NIH HHS
ID : R21 AI105617
Pays : United States

Références

mSystems. 2017 Mar 7;2(2):
pubmed: 28289731
J Infect Dis. 2013 Aug 15;208(4):690-8
pubmed: 23661798
Database (Oxford). 2020 Jan 1;2020:
pubmed: 32761142
Front Microbiol. 2017 Feb 15;8:238
pubmed: 28261190
J Clin Microbiol. 2021 Jan 21;59(2):
pubmed: 33028600
Appl Environ Microbiol. 2007 Aug;73(16):5261-7
pubmed: 17586664
Bioinformatics. 2010 Oct 1;26(19):2460-1
pubmed: 20709691
PeerJ. 2018 Apr 18;6:e4652
pubmed: 29682424
mSystems. 2016 Oct 18;1(5):
pubmed: 27822553
Nucleic Acids Res. 2016 Jan 4;44(D1):D67-72
pubmed: 26590407
BMC Microbiol. 2012 Nov 08;12:255
pubmed: 23136846
PLoS Pathog. 2014 Sep 18;10(9):e1004365
pubmed: 25233198
Cancer Cell. 2021 Sep 13;39(9):1202-1213.e6
pubmed: 34329585
N Engl J Med. 2016 Dec 15;375(24):2369-2379
pubmed: 27974040
Nat Med. 2020 Jan;26(1):59-64
pubmed: 31907459
Curr Opin Microbiol. 2020 Aug;56:1-6
pubmed: 32599521
Genome Biol. 2019 Apr 29;20(1):85
pubmed: 31036053
PLoS One. 2020 Jan 16;15(1):e0227434
pubmed: 31945086
Nat Methods. 2016 Jul;13(7):581-3
pubmed: 27214047
Mol Ecol. 2019 Feb;28(4):721-730
pubmed: 30582650
Database (Oxford). 2017 Jan 1;2017:
pubmed: 29220466
Nat Methods. 2013 Oct;10(10):996-8
pubmed: 23955772
ISME J. 2017 Dec;11(12):2639-2643
pubmed: 28731476
Nucleic Acids Res. 2019 Jan 8;47(D1):D259-D264
pubmed: 30371820
N Engl J Med. 2020 Feb 27;382(9):822-834
pubmed: 32101664
mSphere. 2020 Feb 26;5(1):
pubmed: 32102941
PLoS One. 2018 Oct 25;13(10):e0206428
pubmed: 30359454
BMC Bioinformatics. 2019 Nov 15;20(1):581
pubmed: 31730472
Genome Res. 1998 Mar;8(3):186-94
pubmed: 9521922
Proc Natl Acad Sci U S A. 2012 Apr 17;109(16):6241-6
pubmed: 22454494
PeerJ. 2016 Oct 18;4:e2584
pubmed: 27781170
Bioinformatics. 2015 Nov 1;31(21):3476-82
pubmed: 26139637
Front Physiol. 2021 Jul 19;12:699253
pubmed: 34349667
Mol Ecol Resour. 2021 May;21(4):1141-1148
pubmed: 33459491
Mol Biol Evol. 2008 Nov;25(11):2251-4
pubmed: 18728073
Int J Mol Sci. 2015 Oct 22;16(10):25067-79
pubmed: 26506340
Proc Natl Acad Sci U S A. 2007 Mar 6;104(10):3901-6
pubmed: 17360450

Auteurs

Thierry Rolling (T)

Infectious Disease Service, Department of Medicine, and.
Immunology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA.
Division of Infectious Diseases, First Department of Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.

Bing Zhai (B)

Infectious Disease Service, Department of Medicine, and.
Immunology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA.

John Frame (J)

Infectious Disease Service, Department of Medicine, and.

Tobias M Hohl (TM)

Infectious Disease Service, Department of Medicine, and.
Immunology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA.
Weill Cornell Medical College, New York, New York, USA.

Ying Taur (Y)

Infectious Disease Service, Department of Medicine, and.
Weill Cornell Medical College, New York, New York, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Populus Soil Microbiology Soil Microbiota Fungi

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Classifications MeSH