PathoFact: a pipeline for the prediction of virulence factors and antimicrobial resistance genes in metagenomic data.
Antimicrobial resistance
Bacterial toxins
Bioinformatics
Metagenomics
Microbiome
Mobile genetic elements
Virulence factors
Journal
Microbiome
ISSN: 2049-2618
Titre abrégé: Microbiome
Pays: England
ID NLM: 101615147
Informations de publication
Date de publication:
17 02 2021
17 02 2021
Historique:
received:
21
09
2020
accepted:
29
12
2020
entrez:
18
2
2021
pubmed:
19
2
2021
medline:
24
3
2021
Statut:
epublish
Résumé
Pathogenic microorganisms cause disease by invading, colonizing, and damaging their host. Virulence factors including bacterial toxins contribute to pathogenicity. Additionally, antimicrobial resistance genes allow pathogens to evade otherwise curative treatments. To understand causal relationships between microbiome compositions, functioning, and disease, it is essential to identify virulence factors and antimicrobial resistance genes in situ. At present, there is a clear lack of computational approaches to simultaneously identify these factors in metagenomic datasets. Here, we present PathoFact, a tool for the contextualized prediction of virulence factors, bacterial toxins, and antimicrobial resistance genes with high accuracy (0.921, 0.832 and 0.979, respectively) and specificity (0.957, 0.989 and 0.994). We evaluate the performance of PathoFact on simulated metagenomic datasets and perform a comparison to two other general workflows for the analysis of metagenomic data. PathoFact outperforms all existing workflows in predicting virulence factors and toxin genes. It performs comparably to one pipeline regarding the prediction of antimicrobial resistance while outperforming the others. We further demonstrate the performance of PathoFact on three publicly available case-control metagenomic datasets representing an actual infection as well as chronic diseases in which either pathogenic potential or bacterial toxins are hypothesized to play a role. In each case, we identify virulence factors and AMR genes which differentiated between the case and control groups, thereby revealing novel gene associations with the studied diseases. PathoFact is an easy-to-use, modular, and reproducible pipeline for the identification of virulence factors, bacterial toxins, and antimicrobial resistance genes in metagenomic data. Additionally, our tool combines the prediction of these pathogenicity factors with the identification of mobile genetic elements. This provides further depth to the analysis by considering the genomic context of the pertinent genes. Furthermore, PathoFact's modules for virulence factors, toxins, and antimicrobial resistance genes can be applied independently, thereby making it a flexible and versatile tool. PathoFact, its models, and databases are freely available at https://pathofact.lcsb.uni.lu . Video abstract.
Sections du résumé
BACKGROUND
Pathogenic microorganisms cause disease by invading, colonizing, and damaging their host. Virulence factors including bacterial toxins contribute to pathogenicity. Additionally, antimicrobial resistance genes allow pathogens to evade otherwise curative treatments. To understand causal relationships between microbiome compositions, functioning, and disease, it is essential to identify virulence factors and antimicrobial resistance genes in situ. At present, there is a clear lack of computational approaches to simultaneously identify these factors in metagenomic datasets.
RESULTS
Here, we present PathoFact, a tool for the contextualized prediction of virulence factors, bacterial toxins, and antimicrobial resistance genes with high accuracy (0.921, 0.832 and 0.979, respectively) and specificity (0.957, 0.989 and 0.994). We evaluate the performance of PathoFact on simulated metagenomic datasets and perform a comparison to two other general workflows for the analysis of metagenomic data. PathoFact outperforms all existing workflows in predicting virulence factors and toxin genes. It performs comparably to one pipeline regarding the prediction of antimicrobial resistance while outperforming the others. We further demonstrate the performance of PathoFact on three publicly available case-control metagenomic datasets representing an actual infection as well as chronic diseases in which either pathogenic potential or bacterial toxins are hypothesized to play a role. In each case, we identify virulence factors and AMR genes which differentiated between the case and control groups, thereby revealing novel gene associations with the studied diseases.
CONCLUSION
PathoFact is an easy-to-use, modular, and reproducible pipeline for the identification of virulence factors, bacterial toxins, and antimicrobial resistance genes in metagenomic data. Additionally, our tool combines the prediction of these pathogenicity factors with the identification of mobile genetic elements. This provides further depth to the analysis by considering the genomic context of the pertinent genes. Furthermore, PathoFact's modules for virulence factors, toxins, and antimicrobial resistance genes can be applied independently, thereby making it a flexible and versatile tool. PathoFact, its models, and databases are freely available at https://pathofact.lcsb.uni.lu . Video abstract.
Identifiants
pubmed: 33597026
doi: 10.1186/s40168-020-00993-9
pii: 10.1186/s40168-020-00993-9
pmc: PMC7890817
doi:
Substances chimiques
Anti-Bacterial Agents
0
Anti-Infective Agents
0
Virulence Factors
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Video-Audio Media
Langues
eng
Sous-ensembles de citation
IM
Pagination
49Références
Nucleic Acids Res. 2012 Jan;40(Database issue):D615-20
pubmed: 22102573
Nat Methods. 2020 Mar;17(3):261-272
pubmed: 32015543
Antimicrob Agents Chemother. 2011 Apr;55(4):1485-93
pubmed: 21282452
Nucleic Acids Res. 2009 Jan;37(Database issue):D443-7
pubmed: 18832362
BMC Bioinformatics. 2008 Jan 28;9:62
pubmed: 18226234
Nucleic Acids Res. 2016 Jan 4;44(D1):D279-85
pubmed: 26673716
Bioinformatics. 2016 Aug 15;32(16):2520-3
pubmed: 27153620
NPJ Biofilms Microbiomes. 2017 Jun 22;3:14
pubmed: 28649415
Curr Opin Chem Biol. 2008 Feb;12(1):93-101
pubmed: 18284925
Sci Rep. 2016 May 11;6:25945
pubmed: 27166072
Nat Microbiol. 2016 Feb 01;1:15032
pubmed: 27572438
Bioinformatics. 2006 Jul 1;22(13):1658-9
pubmed: 16731699
Antimicrob Agents Chemother. 2013 Jul;57(7):3348-57
pubmed: 23650175
Clin Microbiol Rev. 2013 Apr;26(2):185-230
pubmed: 23554414
Front Microbiol. 2018 Jul 18;9:1632
pubmed: 30072981
mSystems. 2020 Mar 10;5(2):
pubmed: 32156798
Bioinformatics. 2014 Apr 1;30(7):923-30
pubmed: 24227677
Bioinformatics. 2018 Oct 15;34(20):3600
pubmed: 29788404
Res Microbiol. 2004 Jun;155(5):376-86
pubmed: 15207870
Global Health. 2016 Mar 22;12:8
pubmed: 27000847
PeerJ. 2015 May 28;3:e985
pubmed: 26038737
Nat Commun. 2013;4:2151
pubmed: 23877117
Nat Methods. 2017 Nov;14(11):1063-1071
pubmed: 28967888
Microbiome. 2018 Feb 01;6(1):23
pubmed: 29391044
PLoS One. 2014 Apr 15;9(4):e93907
pubmed: 24736651
Mov Disord. 2020 Mar;35(3):431-442
pubmed: 31737957
Nucleic Acids Res. 2020 Jan 8;48(D1):D517-D525
pubmed: 31665441
Genome Biol. 2014;15(12):550
pubmed: 25516281
Nucleic Acids Res. 2013 Jul;41(12):e121
pubmed: 23598997
Antimicrob Agents Chemother. 2004 Apr;48(4):1416-8
pubmed: 15047557
Bioinformatics. 2018 Jul 15;34(14):2499-2502
pubmed: 29528364
Clin Microbiol Rev. 2002 Oct;15(4):647-79
pubmed: 12364374
PLoS One. 2008;3(10):e3375
pubmed: 18846219
Nucleic Acids Res. 2018 Apr 6;46(6):e35
pubmed: 29346586
Bioinformatics. 2018 Jul 1;34(13):2263-2270
pubmed: 29408954
mBio. 2016 Aug 30;7(4):
pubmed: 27578755
Nature. 2011 Aug 31;477(7365):457-61
pubmed: 21881561
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515
pubmed: 30395287
Genome Med. 2017 Apr 28;9(1):39
pubmed: 28449715
Expert Rev Anti Infect Ther. 2010 May;8(5):555-64
pubmed: 20455684
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D325-8
pubmed: 15608208
F1000Res. 2019 Jul 4;8:1006
pubmed: 31508216
Nat Rev Mol Cell Biol. 2001 Jul;2(7):530-7
pubmed: 11433367
Nat Biotechnol. 2019 Apr;37(4):420-423
pubmed: 30778233
Nucleic Acids Res. 2000 Jan 1;28(1):45-8
pubmed: 10592178
Microbiol Mol Biol Rev. 1997 Jun;61(2):136-69
pubmed: 9184008
Antimicrob Agents Chemother. 2016 Jan 04;60(3):1767-78
pubmed: 26729493
Bacteriophage. 2014 Jan 1;4(1):e27943
pubmed: 24575358
Bioinformatics. 2018 Nov 1;34(21):3601-3608
pubmed: 29762644
Science. 2019 Sep 13;365(6458):1082-1083
pubmed: 31515374
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D271-2
pubmed: 14681410
J Bacteriol. 2000 Jun;182(12):3467-74
pubmed: 10852879
Nat Microbiol. 2016 Oct 10;2:16180
pubmed: 27723761
Nucleic Acids Res. 2015 Jan;43(Database issue):D928-34
pubmed: 25378312
Nat Methods. 2018 Nov;15(11):962-968
pubmed: 30377376
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D438-42
pubmed: 14681452
Antimicrob Agents Chemother. 2019 Oct 22;63(11):
pubmed: 31427293
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D71-4
pubmed: 15608288
Nucleic Acids Res. 2000 Jan 1;28(1):27-30
pubmed: 10592173
Science. 2009 Aug 28;325(5944):1128-1131
pubmed: 19713526
PLoS One. 2011;6(12):e28032
pubmed: 22145021
Genome Biol. 2016 Dec 16;17(1):260
pubmed: 27986083