Mixture of Experts with Entropic Regularization for Data Classification.

classification entropy mixture-of-experts regularization

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
18 Feb 2019
Historique:
received: 04 01 2019
revised: 04 02 2019
accepted: 15 02 2019
entrez: 3 12 2020
pubmed: 18 2 2019
medline: 18 2 2019
Statut: epublish

Résumé

Today, there is growing interest in the automatic classification of a variety of tasks, such as weather forecasting, product recommendations, intrusion detection, and people recognition. "Mixture-of-experts" is a well-known classification technique; it is a probabilistic model consisting of local expert classifiers weighted by a gate network that is typically based on softmax functions, combined with learnable complex patterns in data. In this scheme, one data point is influenced by only one expert; as a result, the training process can be misguided in real datasets for which complex data need to be explained by multiple experts. In this work, we propose a variant of the regular mixture-of-experts model. In the proposed model, the cost classification is penalized by the Shannon entropy of the gating network in order to avoid a "winner-takes-all" output for the gating network. Experiments show the advantage of our approach using several real datasets, with improvements in mean accuracy of 3-6% in some datasets. In future work, we plan to embed feature selection into this model.

Identifiants

pubmed: 33266905
pii: e21020190
doi: 10.3390/e21020190
pmc: PMC7514672
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : Fondo Nacional de Desarrollo Científico y Tecnológico
ID : 11140892

Références

Proc Natl Acad Sci U S A. 2009 Feb 10;106(6):1826-31
pubmed: 19188593

Auteurs

Billy Peralta (B)

Department of Engineering Science, Andres Bello University, Santiago 7500971, Chile.

Ariel Saavedra (A)

Department of Engineering Informatics, Catholic University of Temuco, Temuco 4781312, Chile.

Luis Caro (L)

Department of Engineering Informatics, Catholic University of Temuco, Temuco 4781312, Chile.

Alvaro Soto (A)

Department of Computer Sciences, Pontifical Catholic University of Chile, Santiago 7820436, Chile.

Classifications MeSH