Predicting metabolic modules in incomplete bacterial genomes with MetaPathPredict.


Journal

eLife
ISSN: 2050-084X
Titre abrégé: Elife
Pays: England
ID NLM: 101579614

Informations de publication

Date de publication:
02 May 2024
Historique:
received: 22 12 2022
accepted: 16 04 2024
medline: 2 5 2024
pubmed: 2 5 2024
entrez: 2 5 2024
Statut: epublish

Résumé

The reconstruction of complete microbial metabolic pathways using 'omics data from environmental samples remains challenging. Computational pipelines for pathway reconstruction that utilize machine learning methods to predict the presence or absence of KEGG modules in incomplete genomes are lacking. Here, we present MetaPathPredict, a software tool that incorporates machine learning models to predict the presence of complete KEGG modules within bacterial genomic datasets. Using gene annotation data and information from the KEGG module database, MetaPathPredict employs deep learning models to predict the presence of KEGG modules in a genome. MetaPathPredict can be used as a command line tool or as a Python module, and both options are designed to be run locally or on a compute cluster. Benchmarks show that MetaPathPredict makes robust predictions of KEGG module presence within highly incomplete genomes.

Identifiants

pubmed: 38696239
doi: 10.7554/eLife.85749
pii: 85749
doi:
pii:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIH HHS
ID : NIGMS R01GM132600
Pays : United States

Déclaration de conflit d'intérêts

DG, VE, MP, JR, TW, JM No competing interests declared, KK affiliated with Luit Consulting. The author has no financial interests to declare

Auteurs

David Geller-McGrath (D)

Biology Department, Woods Hole Oceanographic Institution, Woods Hole, United States.

Kishori M Konwar (KM)

Luit Consulting, Revere, United States.

Virginia P Edgcomb (VP)

Marine Geology and Geophysics Department, Woods Hole Oceanographic Institution, Woods Hole, United States.

Maria Pachiadaki (M)

Biology Department, Woods Hole Oceanographic Institution, Woods Hole, United States.

Jack W Roddy (JW)

R. Ken Coit College of Pharmacy, University of Arizona, Tucson, United States.

Travis J Wheeler (TJ)

R. Ken Coit College of Pharmacy, University of Arizona, Tucson, United States.

Jason E McDermott (JE)

Computational Sciences Division, Pacific Northwest National Laboratory, Richland, United States.
Department of Molecular Microbiology and Immunology, Oregon Health & Science University, Portland, United States.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Populus Soil Microbiology Soil Microbiota Fungi

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Aerosols Humans Decontamination Air Microbiology Masks

Classifications MeSH