A bio-inspired reinforcement learning model that accounts for fast adaptation after punishment.

Adaptation Addiction Gambling Lose-shift Punishment Reinforcement learning

Journal

Neurobiology of learning and memory
ISSN: 1095-9564
Titre abrégé: Neurobiol Learn Mem
Pays: United States
ID NLM: 9508166

Informations de publication

Date de publication:
28 Aug 2024
Historique:
received: 22 12 2023
revised: 14 08 2024
accepted: 26 08 2024
medline: 31 8 2024
pubmed: 31 8 2024
entrez: 29 8 2024
Statut: aheadofprint

Résumé

Humans and animals can quickly learn a new strategy when a previously-rewarding strategy is punished. It is difficult to model this with reinforcement learning methods, because they tend to perseverate on previously-learned strategies - a hallmark of impaired response to punishment. Past work has addressed this by augmenting conventional reinforcement learning equations with ad hoc parameters or parallel learning systems. This produces reinforcement learning models that account for reversal learning, but are more abstract, complex, and somewhat detached from neural substrates. Here we use a different approach: we generalize a recently-discovered neuron-level learning rule, on the assumption that it captures a basic principle of learning that may occur at the whole-brain-level. Surprisingly, this gives a new reinforcement learning rule that accounts for adaptation and lose-shift behavior, and uses only the same parameters as conventional reinforcement learning equations. In the new rule, the normal reward prediction errors that drive reinforcement learning are scaled by the likelihood the agent assigns to the action that triggered a reward or punishment. The new rule demonstrates quick adaptation in card sorting and variable Iowa gambling tasks, and also exhibits a human-like paradox-of-choice effect. It will be useful for experimental researchers modeling learning and behavior.

Identifiants

pubmed: 39209018
pii: S1074-7427(24)00085-6
doi: 10.1016/j.nlm.2024.107974
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

107974

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Eric Chalmers (E)

Department of Mathematics and Computing, Mount Royal University, 4825 Mt Royal Gate SW, Calgary, AB T3E 6K6, Canada. Electronic address: echalmers@mtroyal.ca.

Artur Luczak (A)

Canadian Center for Behavioral Neuroscience, University of Lethbridge4401 University Dr W, Lethbridge, AB T1K 3M4, Canada. Electronic address: luczak@uleth.ca.

Classifications MeSH