Representations and generalization in artificial and brain neural networks.
deep neural networks
few-shot learning
neural manifolds
representational drift
visual cortex
Journal
Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Titre abrégé: Proc Natl Acad Sci U S A
Pays: United States
ID NLM: 7505876
Informations de publication
Date de publication:
02 Jul 2024
02 Jul 2024
Historique:
medline:
24
6
2024
pubmed:
24
6
2024
entrez:
24
6
2024
Statut:
ppublish
Résumé
Humans and animals excel at generalizing from limited data, a capability yet to be fully replicated in artificial intelligence. This perspective investigates generalization in biological and artificial deep neural networks (DNNs), in both in-distribution and out-of-distribution contexts. We introduce two hypotheses: First, the geometric properties of the neural manifolds associated with discrete cognitive entities, such as objects, words, and concepts, are powerful order parameters. They link the neural substrate to the generalization capabilities and provide a unified methodology bridging gaps between neuroscience, machine learning, and cognitive science. We overview recent progress in studying the geometry of neural manifolds, particularly in visual object recognition, and discuss theories connecting manifold dimension and radius to generalization capacity. Second, we suggest that the theory of learning in wide DNNs, especially in the thermodynamic limit, provides mechanistic insights into the learning processes generating desired neural representational geometries and generalization. This includes the role of weight norm regularization, network architecture, and hyper-parameters. We will explore recent advances in this theory and ongoing challenges. We also discuss the dynamics of learning and its relevance to the issue of representational drift in the brain.
Identifiants
pubmed: 38913896
doi: 10.1073/pnas.2311805121
doi:
Types de publication
Journal Article
Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
e2311805121Subventions
Organisme : DOD | USN | Office of Naval Research (ONR)
ID : No.N0014-23-1-2051
Déclaration de conflit d'intérêts
Competing interests statement:The authors declare no competing interest.