Mining, analyzing, and integrating viral signals from metagenomic data.
Antibiotics
Metagenome
Phage
Phage-host interaction
Journal
Microbiome
ISSN: 2049-2618
Titre abrégé: Microbiome
Pays: England
ID NLM: 101615147
Informations de publication
Date de publication:
19 03 2019
19 03 2019
Historique:
received:
04
10
2018
accepted:
07
03
2019
entrez:
21
3
2019
pubmed:
21
3
2019
medline:
14
6
2019
Statut:
epublish
Résumé
Viruses are important components of microbial communities modulating community structure and function; however, only a couple of tools are currently available for phage identification and analysis from metagenomic sequencing data. Here we employed the random forest algorithm to develop VirMiner, a web-based phage contig prediction tool especially sensitive for high-abundances phage contigs, trained and validated by paired metagenomic and phagenomic sequencing data from the human gut flora. VirMiner achieved 41.06% ± 17.51% sensitivity and 81.91% ± 4.04% specificity in the prediction of phage contigs. In particular, for the high-abundance phage contigs, VirMiner outperformed other tools (VirFinder and VirSorter) with much higher sensitivity (65.23% ± 16.94%) than VirFinder (34.63% ± 17.96%) and VirSorter (18.75% ± 15.23%) at almost the same specificity. Moreover, VirMiner provides the most comprehensive phage analysis pipeline which is comprised of metagenomic raw reads processing, functional annotation, phage contig identification, and phage-host relationship prediction (CRISPR-spacer recognition) and supports two-group comparison when the input (metagenomic sequence data) includes different conditions (e.g., case and control). Application of VirMiner to an independent cohort of human gut metagenomes obtained from individuals treated with antibiotics revealed that 122 KEGG orthology and 118 Pfam groups had significantly differential abundance in the pre-treatment samples compared to samples at the end of antibiotic administration, including clustered regularly interspaced short palindromic repeats (CRISPR), multidrug resistance, and protein transport. The VirMiner webserver is available at http://sbb.hku.hk/VirMiner/ . We developed a comprehensive tool for phage prediction and analysis for metagenomic samples. Compared to VirSorter and VirFinder-the most widely used tools-VirMiner is able to capture more high-abundance phage contigs which could play key roles in infecting bacteria and modulating microbial community dynamics. The European Union Clinical Trials Register, EudraCT Number: 2013-003378-28 . Registered on 9 April 2014.
Sections du résumé
BACKGROUND
Viruses are important components of microbial communities modulating community structure and function; however, only a couple of tools are currently available for phage identification and analysis from metagenomic sequencing data. Here we employed the random forest algorithm to develop VirMiner, a web-based phage contig prediction tool especially sensitive for high-abundances phage contigs, trained and validated by paired metagenomic and phagenomic sequencing data from the human gut flora.
RESULTS
VirMiner achieved 41.06% ± 17.51% sensitivity and 81.91% ± 4.04% specificity in the prediction of phage contigs. In particular, for the high-abundance phage contigs, VirMiner outperformed other tools (VirFinder and VirSorter) with much higher sensitivity (65.23% ± 16.94%) than VirFinder (34.63% ± 17.96%) and VirSorter (18.75% ± 15.23%) at almost the same specificity. Moreover, VirMiner provides the most comprehensive phage analysis pipeline which is comprised of metagenomic raw reads processing, functional annotation, phage contig identification, and phage-host relationship prediction (CRISPR-spacer recognition) and supports two-group comparison when the input (metagenomic sequence data) includes different conditions (e.g., case and control). Application of VirMiner to an independent cohort of human gut metagenomes obtained from individuals treated with antibiotics revealed that 122 KEGG orthology and 118 Pfam groups had significantly differential abundance in the pre-treatment samples compared to samples at the end of antibiotic administration, including clustered regularly interspaced short palindromic repeats (CRISPR), multidrug resistance, and protein transport. The VirMiner webserver is available at http://sbb.hku.hk/VirMiner/ .
CONCLUSIONS
We developed a comprehensive tool for phage prediction and analysis for metagenomic samples. Compared to VirSorter and VirFinder-the most widely used tools-VirMiner is able to capture more high-abundance phage contigs which could play key roles in infecting bacteria and modulating microbial community dynamics.
TRIAL REGISTRATION
The European Union Clinical Trials Register, EudraCT Number: 2013-003378-28 . Registered on 9 April 2014.
Identifiants
pubmed: 30890181
doi: 10.1186/s40168-019-0657-y
pii: 10.1186/s40168-019-0657-y
pmc: PMC6425642
doi:
Substances chimiques
Anti-Bacterial Agents
0
Banques de données
EudraCT
['2013-003378-28']
Types de publication
Clinical Trial
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
42Références
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W347-52
pubmed: 21672955
ISME J. 2017 Jan;11(1):237-247
pubmed: 27326545
Nucleic Acids Res. 2012 Sep;40(16):e126
pubmed: 22584627
Virus Res. 2016 Jan 4;211:199-208
pubmed: 26541317
Environ Microbiol. 2013 Feb;15(2):307-12
pubmed: 23035931
Environ Microbiol. 2016 Jul;18(7):2196-213
pubmed: 26627180
Nature. 2010 Jul 15;466(7304):334-8
pubmed: 20631792
Nat Biotechnol. 2014 Aug;32(8):822-8
pubmed: 24997787
ISME J. 2017 Jan;11(1):304-308
pubmed: 27505348
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
J Med Microbiol. 2003 May;52(Pt 5):403-408
pubmed: 12721316
ISME J. 2014 Jul;8(7):1391-402
pubmed: 24621522
Proc Natl Acad Sci U S A. 2016 Mar 1;113(9):E1306-15
pubmed: 26884164
J Microbiol. 2014 Mar;52(3):235-42
pubmed: 24585054
Adv Virus Res. 2012;82:91-118
pubmed: 22420852
ISME J. 2017 Jan;11(1):7-14
pubmed: 27420028
PeerJ. 2015 May 28;3:e985
pubmed: 26038737
Environ Microbiol. 2011 Oct;13(10):2702-16
pubmed: 21883790
BMC Bioinformatics. 2007 Jun 18;8:209
pubmed: 17577412
Nucleic Acids Res. 2017 Jan 4;45(D1):D491-D498
pubmed: 27789703
Infect Immun. 2002 Aug;70(8):4669-77
pubmed: 12117981
Appl Environ Microbiol. 2007 Aug;73(16):5261-7
pubmed: 17586664
Nat Rev Microbiol. 2006 Aug;4(8):629-36
pubmed: 16845433
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W451-4
pubmed: 15980510
Cell Microbiol. 2008 Mar;10(3):770-80
pubmed: 18005243
Nature. 2013 Jul 11;499(7457):219-22
pubmed: 23748443
Genome Biol. 2014 Dec 03;15(12):531
pubmed: 25517225
Nat Commun. 2013;4:2420
pubmed: 24036533
Nat Commun. 2014 Jul 24;5:4498
pubmed: 25058116
Nucleic Acids Res. 2002 Jan 1;30(1):281-3
pubmed: 11752315
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30
pubmed: 24288371
Sci Rep. 2016 Jun 15;6:28115
pubmed: 27301427
PLoS One. 2016 Sep 29;11(9):e0163111
pubmed: 27684958
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62
pubmed: 26476454
Nature. 2016 Aug 25;536(7617):425-30
pubmed: 27533034
Nature. 2007 Apr 12;446(7137):749-57
pubmed: 17429392
Microbiome. 2017 Jul 6;5(1):69
pubmed: 28683828
Bioinformatics. 2012 Jun 1;28(11):1420-8
pubmed: 22495754
In Silico Biol. 2006;6(3):223-7
pubmed: 16922685
Bioinformatics. 2017 Oct 01;33(19):3113-3114
pubmed: 28957499
FEMS Microbiol Lett. 2016 May;363(10):
pubmed: 27030726
Nucleic Acids Res. 2010 Jan;38(Database issue):D57-61
pubmed: 19933762
J Bacteriol. 2013 Mar;195(5):941-50
pubmed: 23222723
Nucleic Acids Res. 2006;34(20):5839-51
pubmed: 17062630
Science. 2015 May 22;348(6237):1261359
pubmed: 25999513
ISME J. 2016 Mar;10(3):707-20
pubmed: 26359913
Infect Drug Resist. 2014 Jun 20;7:167-76
pubmed: 25018641
Mob DNA. 2017 Oct 03;8:12
pubmed: 29026445
Elife. 2015 Jul 22;4:
pubmed: 26200428
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
J Pharm Sci. 2011 Dec;100(12):5197-205
pubmed: 22020816
Genomics Proteomics Bioinformatics. 2019 Feb;17(1):39-51
pubmed: 31026582
Bioinformatics. 2008 Mar 15;24(6):863-5
pubmed: 18238785
Trends Microbiol. 2005 Jun;13(6):278-84
pubmed: 15936660
Bioinformatics. 2010 Jun 15;26(12):1481-7
pubmed: 20439257