HOW CONNECTIVITY STRUCTURE SHAPES RICH AND LAZY LEARNING IN NEURAL CIRCUITS.


Journal

ArXiv
ISSN: 2331-8422
Titre abrégé: ArXiv
Pays: United States
ID NLM: 101759493

Informations de publication

Date de publication:
12 Oct 2023
Historique:
pubmed: 24 10 2023
medline: 24 10 2023
entrez: 24 10 2023
Statut: epublish

Résumé

In theoretical neuroscience, recent work leverages deep learning tools to explore how some network attributes critically influence its learning dynamics. Notably, initial weight distributions with small (resp. large) variance may yield a rich (resp. lazy) regime, where significant (resp. minor) changes to network states and representation are observed over the course of learning. However, in biology, neural circuit connectivity generally has a low-rank structure and therefore differs markedly from the random initializations generally used for these studies. As such, here we investigate how the structure of the initial weights - in particular their effective rank - influences the network learning regime. Through both empirical and theoretical analyses, we discover that high-rank initializations typically yield smaller network changes indicative of lazier learning, a finding we also confirm with experimentally-driven initial connectivity in recurrent neural networks. Conversely, low-rank initialization biases learning towards richer learning. Importantly, however, as an exception to this rule, we find lazier learning can still occur with a low-rank initialization that aligns with task and data statistics. Our research highlights the pivotal role of initial weight structures in shaping learning regimes, with implications for metabolic costs of plasticity and risks of catastrophic forgetting.

Identifiants

pubmed: 37873007
pii: 2310.08513
pmc: PMC10593070
pii:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIDA NIH HHS
ID : RF1 DA055669
Pays : United States

Auteurs

Yuhan Helena Liu (YH)

University of Washington, Seattle, WA, USA.
Allen Institute for Brain Science, Seattle WA, USA.
Mila - Quebec AI Institute, Montreal, QC, Canada.

Aristide Baratin (A)

Samsung - SAIT AI Lab, Montreal, QC, Canada.

Jonathan Cornford (J)

Mila - Quebec AI Institute, Montreal, QC, Canada.
McGill University, Montreal, QC, Canada.

Stefan Mihalas (S)

University of Washington, Seattle, WA, USA.
Allen Institute for Brain Science, Seattle WA, USA.

Eric Shea-Brown (E)

University of Washington, Seattle, WA, USA.
Allen Institute for Brain Science, Seattle WA, USA.

Guillaume Lajoie (G)

Mila - Quebec AI Institute, Montreal, QC, Canada.
Canada CIFAR AI Chair, CIFAR, Toronto, ON, Canada.
Université de Montréal, Montreal, QC, Canada.

Classifications MeSH