Robust microarray data feature selection using a correntropy based distance metric learning approach.

Correntropy Distance metric learning Feature selection Microarray data classifications Robustness

Journal

Computers in biology and medicine
ISSN: 1879-0534
Titre abrégé: Comput Biol Med
Pays: United States
ID NLM: 1250250

Informations de publication

Date de publication:
07 2023
Historique:
received: 09 12 2022
revised: 18 04 2023
accepted: 20 05 2023
medline: 5 6 2023
pubmed: 27 5 2023
entrez: 26 5 2023
Statut: ppublish

Résumé

Classification of high-dimensional microarray data is a challenge in bioinformatics and genetic data processing. One of the challenging issues of feature selection is the presence of outliers. The Euclidean distance metric is sensitive to outliers. In this study, a distance metric learning based feature selection approach that uses the correntropy function as the discrimination metric is proposed. For this purpose, the metric learning problem is formulated as an optimization problem and solved using the Lagrange method. The output of the approach signifies the most important and robust features. After feature selection, different classification methods such as SVM, decision trees, and NN classifiers are used to investigate the classification accuracy of the proposed method as well as precision, recall, and F-measure. Experiments are carried out on 13 high-dimensional datasets and show that the proposed method outperforms the previous models in terms of accuracy and robustness.

Identifiants

pubmed: 37235945
pii: S0010-4825(23)00521-8
doi: 10.1016/j.compbiomed.2023.107056
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

107056

Informations de copyright

Copyright © 2023. Published by Elsevier Ltd.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Venus Vahabzadeh (V)

Department of Software Engineering, Mashhad Branch, Islamic Azad University, Mashhad, Iran. Electronic address: ve4788@yahoo.com.

Mohammad Hossein Moattar (MH)

Department of Software Engineering, Mashhad Branch, Islamic Azad University, Mashhad, Iran. Electronic address: moattar@mshdiau.ac.ir.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH