Minimum perturbation theory of deep perceptual learning.


Journal

Physical review. E
ISSN: 2470-0053
Titre abrégé: Phys Rev E
Pays: United States
ID NLM: 101676019

Informations de publication

Date de publication:
Dec 2022
Historique:
received: 03 09 2022
accepted: 22 11 2022
entrez: 21 1 2023
pubmed: 22 1 2023
medline: 25 1 2023
Statut: ppublish

Résumé

Perceptual learning (PL) involves long-lasting improvement in perceptual tasks following extensive training and is accompanied by modified neuronal responses in sensory cortical areas in the brain. Understanding the dynamics of PL and the resultant synaptic changes is important for causally connecting PL to the observed neural plasticity. This is theoretically challenging because learning-related changes are distributed across many stages of the sensory hierarchy. In this paper, we modeled the sensory hierarchy as a deep nonlinear neural network and studied PL of fine discrimination, a common and well-studied paradigm of PL. Using tools from statistical physics, we developed a mean-field theory of the network in the limit of a large number of neurons and large number of examples. Our theory suggests that, in this thermodynamic limit, the input-output function of the network can be exactly mapped to that of a deep linear network, allowing us to characterize the space of solutions for the task. Surprisingly, we found that modifying synaptic weights in the first layer of the hierarchy is both sufficient and necessary for PL. To address the degeneracy of the space of solutions, we postulate that PL dynamics are constrained by a normative minimum perturbation (MP) principle, which favors weight matrices with minimal changes relative to their prelearning values. Interestingly, MP plasticity induces changes to weights and neural representations in all layers of the network, except for the readout weight vector. While weight changes in higher layers are not necessary for learning, they help reduce overall perturbation to the network. In addition, such plasticity can be learned simply through slow learning. We further elucidate the properties of MP changes and compare them against experimental findings. Overall, our statistical mechanics theory of PL provides mechanistic and normative understanding of several important empirical findings of PL.

Identifiants

pubmed: 36671118
doi: 10.1103/PhysRevE.106.064406
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

064406

Auteurs

Haozhe Shan (H)

Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138, USA and Program in Neuroscience, Harvard Medical School, Boston, Massachusetts 02115, USA.

Haim Sompolinsky (H)

Center for Brain Science, Harvard University, Cambridge, Massachusetts 02138, USA and Edmond and Lily Safra Center for Brain Sciences, Hebrew University of Jerusalem, Jerusalem 9190401, Israel.

Articles similaires

1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
alpha-Synuclein Humans Animals Mice Lewy Body Disease

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking
Animals Optogenetics Visual Cortex Neurons Mice

Classifications MeSH