Feature Selection Pipelines with Classification for Non-targeted Metabolomics Combining the Neural Network and Genetic Algorithm.


Journal

Analytical chemistry
ISSN: 1520-6882
Titre abrégé: Anal Chem
Pays: United States
ID NLM: 0370536

Informations de publication

Date de publication:
12 04 2022
Historique:
pubmed: 29 3 2022
medline: 14 4 2022
entrez: 28 3 2022
Statut: ppublish

Résumé

Non-targeted metabolomics via high-resolution mass spectrometry methods, such as direct infusion Fourier transform-ion cyclotron resonance mass spectrometry (DI-FT-ICR MS), produces data sets with thousands of features. By contrast, the number of samples is in general substantially lower. This disparity presents challenges when analyzing non-targeted metabolomics data sets and often requires custom methods to uncover information not always accessible via classical statistical techniques. In this work, we present a pipeline that combines a convolutional neural network with traditional statistical approaches and an adaptation of a genetic algorithm. The developed method was applied to a lifestyle intervention cohort data set, where subjects at risk of type 2 diabetes underwent an oral glucose tolerance test. Feature selection is the final result of the pipeline, achieved through classification of the data set via a neural network, with a precision-recall score of over 0.9 on the test set. The features most relevant for the described classification were then chosen via a genetic algorithm. The output of the developed pipeline encompasses approximately 200 features with high predictive scores, providing a fingerprint of the metabolic changes in the prediabetic class on the data set. Our framework presents a new approach which allows to apply complex modeling based on convolutional neural networks for the analysis of high-resolution mass spectrometric data.

Identifiants

pubmed: 35344349
doi: 10.1021/acs.analchem.1c03237
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

5474-5482

Auteurs

Anna Lisitsyna (A)

Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, Neuherberg 85764, Germany.
German Center for Diabetes Research (DZD), Neuherberg 85764, Germany.

Franco Moritz (F)

Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, Neuherberg 85764, Germany.

Youzhong Liu (Y)

Analytical Development, Small Molecule Development, Janssen Pharmaceutical Companies of Johnson and Johnson, Beerse 2340, Belgium.

Loubna Al Sadat (L)

Institute for Nutritional Medicine, School of Medicine, Technical University of Munich, Munich 80686, Germany.

Hans Hauner (H)

Institute for Nutritional Medicine, School of Medicine, Technical University of Munich, Munich 80686, Germany.
Else Kröner-Fresenius-Centre for Nutritional Medicine, School of Life Sciences, Technical University of Munich, Freising 85354, Germany.

Melina Claussnitzer (M)

Broad Institute of MIT and Harvard, Cambridge 02141-2023 Massachusetts, United States.
Division of Gerontology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts 02108, United States.
Harvard Medical School, Harvard University, Boston, Massachusetts 02108, United States.

Philippe Schmitt-Kopplin (P)

Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, Neuherberg 85764, Germany.
Chair of Analytical Food Chemistry, TUM School of Life Sciences, Technical University Munich, Munich 80686, Germany.

Sara Forcisi (S)

Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, Neuherberg 85764, Germany.
German Center for Diabetes Research (DZD), Neuherberg 85764, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH