Robust microarray data feature selection using a correntropy based distance metric learning approach.
Correntropy
Distance metric learning
Feature selection
Microarray data classifications
Robustness
Journal
Computers in biology and medicine
ISSN: 1879-0534
Titre abrégé: Comput Biol Med
Pays: United States
ID NLM: 1250250
Informations de publication
Date de publication:
07 2023
07 2023
Historique:
received:
09
12
2022
revised:
18
04
2023
accepted:
20
05
2023
medline:
5
6
2023
pubmed:
27
5
2023
entrez:
26
5
2023
Statut:
ppublish
Résumé
Classification of high-dimensional microarray data is a challenge in bioinformatics and genetic data processing. One of the challenging issues of feature selection is the presence of outliers. The Euclidean distance metric is sensitive to outliers. In this study, a distance metric learning based feature selection approach that uses the correntropy function as the discrimination metric is proposed. For this purpose, the metric learning problem is formulated as an optimization problem and solved using the Lagrange method. The output of the approach signifies the most important and robust features. After feature selection, different classification methods such as SVM, decision trees, and NN classifiers are used to investigate the classification accuracy of the proposed method as well as precision, recall, and F-measure. Experiments are carried out on 13 high-dimensional datasets and show that the proposed method outperforms the previous models in terms of accuracy and robustness.
Identifiants
pubmed: 37235945
pii: S0010-4825(23)00521-8
doi: 10.1016/j.compbiomed.2023.107056
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
107056Informations de copyright
Copyright © 2023. Published by Elsevier Ltd.
Déclaration de conflit d'intérêts
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.