Partial local entropy and anisotropy in deep weight spaces.


Journal

Physical review. E
ISSN: 2470-0053
Titre abrégé: Phys Rev E
Pays: United States
ID NLM: 101676019

Informations de publication

Date de publication:
Apr 2021
Historique:
received: 25 09 2020
accepted: 09 03 2021
entrez: 19 5 2021
pubmed: 20 5 2021
medline: 20 5 2021
Statut: ppublish

Résumé

We refine a recently proposed class of local entropic loss functions by restricting the smoothening regularization to only a subset of weights. The new loss functions are referred to as partial local entropies. They can adapt to the weight-space anisotropy, thus outperforming their isotropic counterparts. We support the theoretical analysis with experiments on image classification tasks performed with multilayer, fully connected, and convolutional neural networks. The present study suggests how to better exploit the anisotropic nature of deep landscapes, and it provides direct probes of the shape of the minima encountered by stochastic gradient descent algorithms. As a byproduct, we observe an asymptotic dynamical regime at late training times where the temperature of all the layers obeys a common cooling behavior.

Identifiants

pubmed: 34005873
doi: 10.1103/PhysRevE.103.042303
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

042303

Auteurs

Daniele Musso (D)

Departamento de Física de Partículas, Universidade de Santiago de Compostela (USC), Instituto Galego de Física de Altas Enerxías (IGFAE), E-15782 Santiago de Compostela, Spain; Inovalabs Digital S.L. (TECHEYE), E-36202 Vigo, Spain; and Centro de Supercomputación de Galicia (CESGA), s/n, Avenida de Vigo, 15705, Santiago de Compostela, Spain.

Classifications MeSH