Predicting metabolic modules in incomplete bacterial genomes with MetaPathPredict.
bioinformatics
computational biology
machine learning
metabolism prediction
none
systems biology
Journal
eLife
ISSN: 2050-084X
Titre abrégé: Elife
Pays: England
ID NLM: 101579614
Informations de publication
Date de publication:
02 May 2024
02 May 2024
Historique:
received:
22
12
2022
accepted:
16
04
2024
medline:
2
5
2024
pubmed:
2
5
2024
entrez:
2
5
2024
Statut:
epublish
Résumé
The reconstruction of complete microbial metabolic pathways using 'omics data from environmental samples remains challenging. Computational pipelines for pathway reconstruction that utilize machine learning methods to predict the presence or absence of KEGG modules in incomplete genomes are lacking. Here, we present MetaPathPredict, a software tool that incorporates machine learning models to predict the presence of complete KEGG modules within bacterial genomic datasets. Using gene annotation data and information from the KEGG module database, MetaPathPredict employs deep learning models to predict the presence of KEGG modules in a genome. MetaPathPredict can be used as a command line tool or as a Python module, and both options are designed to be run locally or on a compute cluster. Benchmarks show that MetaPathPredict makes robust predictions of KEGG module presence within highly incomplete genomes.
Identifiants
pubmed: 38696239
doi: 10.7554/eLife.85749
pii: 85749
doi:
pii:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : NIH HHS
ID : NIGMS R01GM132600
Pays : United States
Déclaration de conflit d'intérêts
DG, VE, MP, JR, TW, JM No competing interests declared, KK affiliated with Luit Consulting. The author has no financial interests to declare