A biologically inspired architecture with switching units can learn to generalize across backgrounds.

Bio-inspired Context Continual learning Domain adaptation Generalization Switching network

Journal

Neural networks : the official journal of the International Neural Network Society
ISSN: 1879-2782
Titre abrégé: Neural Netw
Pays: United States
ID NLM: 8805018

Informations de publication

Date de publication:
Nov 2023
Historique:
received: 06 07 2022
revised: 24 08 2023
accepted: 07 09 2023
medline: 13 11 2023
pubmed: 16 10 2023
entrez: 15 10 2023
Statut: ppublish

Résumé

Humans and other animals navigate different environments effortlessly, their brains rapidly and accurately generalizing across contexts. Despite recent progress in deep learning, this flexibility remains a challenge for many artificial systems. Here, we show how a bio-inspired network motif can explicitly address this issue. We do this using a dataset of MNIST digits of varying transparency, set on one of two backgrounds of different statistics that define two contexts: a pixel-wise noise or a more naturalistic background from the CIFAR-10 dataset. After learning digit classification when both contexts are shown sequentially, we find that both shallow and deep networks have sharply decreased performance when returning to the first background - an instance of the catastrophic forgetting phenomenon known from continual learning. To overcome this, we propose the bottleneck-switching network or switching network for short. This is a bio-inspired architecture analogous to a well-studied network motif in the visual cortex, with additional "switching" units that are activated in the presence of a new background, assuming a priori a contextual signal to turn these units on or off. Intriguingly, only a few of these switching units are sufficient to enable the network to learn the new context without catastrophic forgetting through inhibition of redundant background features. Further, the bottleneck-switching network can generalize to novel contexts similar to contexts it has learned. Importantly, we find that - again as in the underlying biological network motif, recurrently connecting the switching units to network layers is advantageous for context generalization.

Identifiants

pubmed: 37839332
pii: S0893-6080(23)00509-9
doi: 10.1016/j.neunet.2023.09.014
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

615-630

Subventions

Organisme : NIDA NIH HHS
ID : R90 DA033461
Pays : United States

Informations de copyright

Copyright © 2023 Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Doris Voina (D)

Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA. Electronic address: dvoina@uw.edu.

Eric Shea-Brown (E)

Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA; Allen Institute for Brain Science, 615 Westlake Ave N, Seattle, WA 98109, USA.

Stefan Mihalas (S)

Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA; Allen Institute for Brain Science, 615 Westlake Ave N, Seattle, WA 98109, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH