Graph Convolutional Networks for Improved Prediction and Interpretability of Chromatographic Retention Data.


Journal

Analytical chemistry
ISSN: 1520-6882
Titre abrégé: Anal Chem
Pays: United States
ID NLM: 0370536

Informations de publication

Date de publication:
30 11 2021
Historique:
pubmed: 16 11 2021
medline: 15 12 2021
entrez: 15 11 2021
Statut: ppublish

Résumé

Machine learning is a popular technique to predict the retention times of molecules based on descriptors. Descriptors and associated labels (e.g., retention times) of a set of molecules can be used to train a machine learning algorithm. However, descriptors are fixed molecular features which are not necessarily optimized for the given machine learning problem (e.g., to predict retention times). Recent advances in molecular machine learning make use of so-called graph convolutional networks (GCNs) to learn molecular representations from atoms and their bonds to adjacent atoms to optimize the molecular representation for the given problem. In this study, two GCNs were implemented to predict the retention times of molecules for three different chromatographic data sets and compared to seven benchmarks (including two state-of-the art machine learning models). Additionally, saliency maps were computed from trained GCNs to better interpret the importance of certain molecular sub-structures in the data sets. Based on the overall observations of this study, the GCNs performed better than all benchmarks, either significantly outperforming them (5-25% lower mean absolute error) or performing similar to them (<5% difference). Saliency maps revealed a significant difference in molecular sub-structures that are important for predictions of different chromatographic data sets (reversed-phase liquid chromatography vs hydrophilic interaction liquid chromatography).

Identifiants

pubmed: 34780168
doi: 10.1021/acs.analchem.1c02988
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

15633-15641

Auteurs

Alexander Kensert (A)

Department for Pharmaceutical and Pharmacological Sciences, University of Leuven (KU Leuven), Pharmaceutical Analysis, Herestraat 49, Leuven 3000, Belgium.
Department of Chemical Engineering, Vrije Universiteit Brussel, Pleinlaan 2, Brussel 1050, Belgium.

Robbin Bouwmeester (R)

VIB, VIB-UGent Center for Medical Biotechnology, Technologiepark-Zwijnaarde 75, Gent 9052, Belgium.
Department of Biomolecular Medicine, Ghent University, Technologiepark-Zwijnaarde 75, Gent 9052, Belgium.

Kyriakos Efthymiadis (K)

Department for Pharmaceutical and Pharmacological Sciences, University of Leuven (KU Leuven), Pharmaceutical Analysis, Herestraat 49, Leuven 3000, Belgium.
Department of Computer Science, Artificial Intelligence Lab, Vrije Universiteit Brussel, Pleinlaan 9, Brussel 1050, Belgium.

Peter Van Broeck (P)

Department of Pharmaceutical Development and Manufacturing Sciences, Janssen Pharmaceutica, Turnhoutseweg 30, Beerse 2340, Belgium.

Gert Desmet (G)

Department of Chemical Engineering, Vrije Universiteit Brussel, Pleinlaan 2, Brussel 1050, Belgium.

Deirdre Cabooter (D)

Department for Pharmaceutical and Pharmacological Sciences, University of Leuven (KU Leuven), Pharmaceutical Analysis, Herestraat 49, Leuven 3000, Belgium.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature

Classifications MeSH