New role for circuit expansion for learning in neural networks.


Journal

Physical review. E
ISSN: 2470-0053
Titre abrégé: Phys Rev E
Pays: United States
ID NLM: 101676019

Informations de publication

Date de publication:
Feb 2021
Historique:
received: 26 08 2020
accepted: 16 12 2020
entrez: 19 3 2021
pubmed: 20 3 2021
medline: 25 2 2023
Statut: ppublish

Résumé

Many sensory pathways in the brain include sparsely active populations of neurons downstream from the input stimuli. The biological purpose of this expanded structure is unclear, but it may be beneficial due to the increased expressive power of the network. In this work, we show that certain ways of expanding a neural network can improve its generalization performance even when the expanded structure is pruned after the learning period. To study this setting, we use a teacher-student framework where a perceptron teacher network generates labels corrupted with small amounts of noise. We then train a student network structurally matched to the teacher. In this scenario, the student can achieve optimal accuracy if given the teacher's synaptic weights. We find that sparse expansion of the input layer of a student perceptron network both increases its capacity and improves the generalization performance of the network when learning a noisy rule from a teacher perceptron when the expansion is pruned after learning. We find similar behavior when the expanded units are stochastic and uncorrelated with the input and analyze this network in the mean-field limit. By solving the mean-field equations, we show that the generalization error of the stochastic expanded student network continues to drop as the size of the network increases. This improvement in generalization performance occurs despite the increased complexity of the student network relative to the teacher it is trying to learn. We show that this effect is closely related to the addition of slack variables in artificial neural networks and suggest possible implications for artificial and biological neural networks.

Identifiants

pubmed: 33736047
doi: 10.1103/PhysRevE.103.022404
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

022404

Auteurs

Julia Steinberg (J)

Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138, USA.
Department of Physics, Harvard University, Cambridge, Massachusetts 02138, USA.

Madhu Advani (M)

Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138, USA.

Haim Sompolinsky (H)

Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138, USA.
Edmond and Lily Safra Center for Brain Sciences, Hebrew University, Jerusalem 91904, Israel.

Articles similaires

Transmission of societal stereotypes to individual-level prejudice through instrumental learning.

David T Schultner, Benjamin S Stillerman, Björn R Lindström et al.
1.00
Humans Stereotyping Prejudice Male Female

Autistic traits foster effective curiosity-driven exploration.

Francesco Poli, Maran Koolen, Carlos A Velázquez-Vargas et al.
1.00
Humans Exploratory Behavior Male Female Young Adult
Humans Linguistics Language China Semantics

Constructing stability: optimal learning in noisy ecological niches.

Edward D Lee, Jessica C Flack, David C Krakauer
1.00
Learning Ecosystem Animals Models, Biological Game Theory

Classifications MeSH