Minimum perturbation theory of deep perceptual learning.

Learning / physiology Brain Neural Networks, Computer Neuronal Plasticity / physiology Neurons / physiology

Journal

Physical review. E

ISSN: 2470-0053

Titre abrégé: Phys Rev E

Pays: United States

ID NLM: 101676019

Informations de publication

Date de publication:
Dec 2022

Historique:

received: 03 09 2022

accepted: 22 11 2022

entrez: 21 1 2023

pubmed: 22 1 2023

medline: 25 1 2023

Statut: ppublish

Résumé

Perceptual learning (PL) involves long-lasting improvement in perceptual tasks following extensive training and is accompanied by modified neuronal responses in sensory cortical areas in the brain. Understanding the dynamics of PL and the resultant synaptic changes is important for causally connecting PL to the observed neural plasticity. This is theoretically challenging because learning-related changes are distributed across many stages of the sensory hierarchy. In this paper, we modeled the sensory hierarchy as a deep nonlinear neural network and studied PL of fine discrimination, a common and well-studied paradigm of PL. Using tools from statistical physics, we developed a mean-field theory of the network in the limit of a large number of neurons and large number of examples. Our theory suggests that, in this thermodynamic limit, the input-output function of the network can be exactly mapped to that of a deep linear network, allowing us to characterize the space of solutions for the task. Surprisingly, we found that modifying synaptic weights in the first layer of the hierarchy is both sufficient and necessary for PL. To address the degeneracy of the space of solutions, we postulate that PL dynamics are constrained by a normative minimum perturbation (MP) principle, which favors weight matrices with minimal changes relative to their prelearning values. Interestingly, MP plasticity induces changes to weights and neural representations in all layers of the network, except for the readout weight vector. While weight changes in higher layers are not necessary for learning, they help reduce overall perturbation to the network. In addition, such plasticity can be learned simply through slow learning. We further elucidate the properties of MP changes and compare them against experimental findings. Overall, our statistical mechanics theory of PL provides mechanistic and normative understanding of several important empirical findings of PL.

Identifiants

DOI: 10.1103/PhysRevE.106.064406 PMID: 36671118

pubmed: 36671118

doi: 10.1103/PhysRevE.106.064406

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

064406

Minimum perturbation theory of deep perceptual learning.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

Haozhe Shan (H)

Haim Sompolinsky (H)

Articles similaires

Multilabel SegSRGAN-A framework for parcellation and morphometry of preterm brain in MRI.

Mouse α-synuclein fibrils are structurally and functionally distinct from human fibrils associated with Lewy body diseases.

Unsupervised learning for real-time and continuous gait phase detection.

Cellular-resolution optogenetics reveals attenuation-by-suppression in visual cortical neurons.

Classifications MeSH