Improving the Accuracy of Nearest-Neighbor Classification Using Principled Construction and Stochastic Sampling of Training-Set Centroids.

image recognition nearest-neighbor classification stochastic sampling

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
26 Jan 2021
Historique:
received: 03 01 2021
accepted: 21 01 2021
entrez: 3 2 2021
pubmed: 4 2 2021
medline: 4 2 2021
Statut: epublish

Résumé

A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse-graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.

Identifiants

pubmed: 33530507
pii: e23020149
doi: 10.3390/e23020149
pmc: PMC7911166
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : U.S. Department of Energy
ID : DE-AC02--05CH11231

Références

Neural Netw. 2002 Oct-Nov;15(8-9):1041-58
pubmed: 12416693
Phys Rev Lett. 2018 Aug 3;121(5):058301
pubmed: 30118252
J Chem Phys. 2018 Jul 28;149(4):044118
pubmed: 30068165
Nature. 2015 May 28;521(7553):436-44
pubmed: 26017442
J Phys Condens Matter. 2018 May 15;30(25):254005
pubmed: 29762140
Phys Rev E. 2020 May;101(5-1):052604
pubmed: 32575260
IEEE Trans Pattern Anal Mach Intell. 2012 Mar;34(3):417-35
pubmed: 21768651
IEEE Trans Neural Netw. 1997;8(1):65-74
pubmed: 18255611
Phys Rev Lett. 1989 Jan 23;62(4):361-364
pubmed: 10040213
J Chem Phys. 2018 Jun 28;148(24):241711
pubmed: 29960321
Phys Rev Lett. 2004 Jan 23;92(3):035504
pubmed: 14753886

Auteurs

Stephen Whitelam (S)

Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA.

Classifications MeSH