A deep learning architecture for metabolic pathway prediction.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
15 04 2020
Historique:
received: 09 10 2019
revised: 02 12 2019
accepted: 22 12 2019
pubmed: 28 12 2019
medline: 10 10 2020
entrez: 28 12 2019
Statut: ppublish

Résumé

Understanding the mechanisms and structural mappings between molecules and pathway classes are critical for design of reaction predictors for synthesizing new molecules. This article studies the problem of prediction of classes of metabolic pathways (series of chemical reactions occurring within a cell) in which a given biochemical compound participates. We apply a hybrid machine learning approach consisting of graph convolutional networks used to extract molecular shape features as input to a random forest classifier. In contrast to previously applied machine learning methods for this problem, our framework automatically extracts relevant shape features directly from input SMILES representations, which are atom-bond specifications of chemical structures composing the molecules. Our method is capable of correctly predicting the respective metabolic pathway class of 95.16% of tested compounds, whereas competing methods only achieve an accuracy of 84.92% or less. Furthermore, our framework extends to the task of classification of compounds having mixed membership in multiple pathway classes. Our prediction accuracy for this multi-label task is 97.61%. We analyze the relative importance of various global physicochemical features to the pathway class prediction problem and show that simple linear/logistic regression models can predict the values of these global features from the shape features extracted using our framework. https://github.com/baranwa2/MetabolicPathwayPrediction. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 31879763
pii: 5687857
doi: 10.1093/bioinformatics/btz954
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

2547-2553

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Mayank Baranwal (M)

Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA.

Abram Magner (A)

Department of Computer Science, University at Albany, SUNY, Albany, NY 12222, USA.

Paolo Elvati (P)

Department of Mechanical Engineering.

Jacob Saldinger (J)

Department of Mechanical Engineering.

Angela Violi (A)

Department of Mechanical Engineering.
Department of Chemical Engineering and Biophysics, University of Michigan, Ann Arbor, MI 48109, USA.

Alfred O Hero (AO)

Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis

Classifications MeSH