Evaluating Classification Model Against Bayes Error Rate.

Algorithms Bayes Theorem

Journal

IEEE transactions on pattern analysis and machine intelligence

ISSN: 1939-3539

Titre abrégé: IEEE Trans Pattern Anal Mach Intell

Pays: United States

ID NLM: 9885960

Informations de publication

Date de publication:
Aug 2023

Historique:

medline: 3 7 2023

pubmed: 7 4 2023

entrez: 6 4 2023

Statut: ppublish

Résumé

For a classification task, we usually select an appropriate classifier via model selection. How to evaluate whether the chosen classifier is optimal? One can answer this question via Bayes error rate (BER). Unfortunately, estimating BER is a fundamental conundrum. Most existing BER estimators focus on giving the upper and lower bounds of the BER. However, evaluating whether the selected classifier is optimal based on these bounds is hard. In this article, we aim to learn the exact BER instead of bounds on BER. The core of our method is to transform the BER calculation problem into a noise recognition problem. Specifically, we define a type of noise called Bayes noise and prove that the proportion of Bayes noisy samples in a data set is statistically consistent with the BER of the data set. To recognize the Bayes noisy samples, we present a method consisting of two parts: selecting reliable samples based on percolation theory and then employing a label propagation algorithm to recognize the Bayes noisy samples based on the selected reliable samples. The superiority of the proposed method compared to the existing BER estimators is verified on extensive synthetic, benchmark, and image data sets.

Identifiants

DOI: 10.1109/TPAMI.2023.3240194 PMID: 37022220

pubmed: 37022220

doi: 10.1109/TPAMI.2023.3240194

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

9639-9653

Evaluating Classification Model Against Bayes Error Rate.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

Qingqiang Chen (Q)

Fuyuan Cao (F)

Ying Xing (Y)

Jiye Liang (J)

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Multilabel SegSRGAN-A framework for parcellation and morphometry of preterm brain in MRI.

An arithmetic operation P system based on symmetric ternary system.

Unsupervised learning for real-time and continuous gait phase detection.

Classifications MeSH