Confirmatory reinforcement learning changes with age during adolescence.

adolescence computational modelling confirmation bias exploration learning rates reinforcement learning

Journal

Developmental science
ISSN: 1467-7687
Titre abrégé: Dev Sci
Pays: England
ID NLM: 9814574

Informations de publication

Date de publication:
05 2023
Historique:
revised: 26 07 2022
received: 08 10 2021
accepted: 20 09 2022
medline: 6 4 2023
pubmed: 5 10 2022
entrez: 4 10 2022
Statut: ppublish

Résumé

Understanding how learning changes during human development has been one of the long-standing objectives of developmental science. Recently, advances in computational biology have demonstrated that humans display a bias when learning to navigate novel environments through rewards and punishments: they learn more from outcomes that confirm their expectations than from outcomes that disconfirm them. Here, we ask whether confirmatory learning is stable across development, or whether it might be attenuated in developmental stages in which exploration is beneficial, such as in adolescence. In a reinforcement learning (RL) task, 77 participants aged 11-32 years (four men, mean age = 16.26) attempted to maximize monetary rewards by repeatedly sampling different pairs of novel options, which varied in their reward/punishment probabilities. Mixed-effect models showed an age-related increase in accuracy as long as learning contingencies remained stable across trials, but less so when they reversed halfway through the trials. Age was also associated with a greater tendency to stay with an option that had just delivered a reward, more than to switch away from an option that had just delivered a punishment. At the computational level, a confirmation model provided increasingly better fit with age. This model showed that age differences are captured by decreases in noise or exploration, rather than in the magnitude of the confirmation bias. These findings provide new insights into how learning changes during development and could help better tailor learning environments to people of different ages. RESEARCH HIGHLIGHTS: Reinforcement learning shows age-related improvement during adolescence, but more in stable learning environments compared with volatile learning environments. People tend to stay with an option after a win more than they shift from an option after a loss, and this asymmetry increases with age during adolescence. Computationally, these changes are captured by a developing confirmatory learning style, in which people learn more from outcomes that confirm rather than disconfirm their choices. Age-related differences in confirmatory learning are explained by decreases in stochasticity, rather than changes in the magnitude of the confirmation bias.

Identifiants

pubmed: 36194156
doi: 10.1111/desc.13330
pmc: PMC7615280
mid: EMS190284
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e13330

Subventions

Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 104908
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 107496
Pays : United Kingdom

Informations de copyright

© 2021 The Authors. Developmental Science published by John Wiley & Sons Ltd.

Références

J Exp Psychol Gen. 2018 Oct;147(10):1521-1530
pubmed: 30272465
Dev Cogn Neurosci. 2020 Feb;41:100715
pubmed: 31999568
Trends Cogn Sci. 2015 Oct;19(10):558-566
pubmed: 26419496
Trends Cogn Sci. 2022 Jul;26(7):607-621
pubmed: 35662490
NPJ Sci Learn. 2020 Oct 27;5:16
pubmed: 33133638
Nat Commun. 2015 Aug 25;6:8096
pubmed: 26302782
Elife. 2019 Nov 26;8:
pubmed: 31769410
Child Dev. 2022 Sep;93(5):1601-1615
pubmed: 35596654
Elife. 2022 Jan 24;11:
pubmed: 35072624
Brain Cogn. 2015 Feb;93:42-53
pubmed: 25528435
PLoS Comput Biol. 2017 Aug 11;13(8):e1005684
pubmed: 28800597
Nat Neurosci. 2019 Dec;22(12):2066-2077
pubmed: 31659343
Dev Psychobiol. 2010 Apr;52(3):216-24
pubmed: 20213754
Nat Hum Behav. 2019 Nov;3(11):1215-1224
pubmed: 31501543
Nat Hum Behav. 2020 Oct;4(10):1067-1079
pubmed: 32747804
Dev Cogn Neurosci. 2020 Apr;42:100753
pubmed: 32072931
Psychol Med. 2011 Dec;41(12):2651-9
pubmed: 21733217
Lancet Child Adolesc Health. 2018 Apr;2(4):e7
pubmed: 30169305
Philos Trans R Soc Lond B Biol Sci. 2020 Jul 20;375(1803):20190502
pubmed: 32475327
Science. 1983 May 13;220(4598):671-80
pubmed: 17813860
Dev Cogn Neurosci. 2019 Dec;40:100733
pubmed: 31770715
Dev Sci. 2021 Jul;24(4):e13075
pubmed: 33305510
Neuroimage. 2015 Jan 1;104:347-54
pubmed: 25234119
Trends Neurosci. 2019 Sep;42(9):604-616
pubmed: 31443912
Dev Cogn Neurosci. 2022 Jun;55:101106
pubmed: 35537273
Dev Sci. 2018 Mar;21(2):
pubmed: 28150391
J Exp Psychol Gen. 2022 Aug;151(8):1843-1853
pubmed: 34968128
PLoS Comput Biol. 2021 Jul 1;17(7):e1008524
pubmed: 34197447
Neural Netw. 2021 Nov;143:218-229
pubmed: 34157646
Nat Neurosci. 2019 Jun;22(6):992-999
pubmed: 31086316
Neural Comput. 2022 Jan 14;34(2):307-337
pubmed: 34758486
Dev Cogn Neurosci. 2020 Feb;41:100732
pubmed: 31826837
Sci Rep. 2017 Jan 18;7:40962
pubmed: 28098227
Proc Natl Acad Sci U S A. 2012 Oct 16;109(42):17135-40
pubmed: 23027965
Nat Commun. 2021 Jun 22;12(1):3823
pubmed: 34158482
Science. 2011 Mar 11;331(6022):1279-85
pubmed: 21393536
Cogn Affect Behav Neurosci. 2015 Jun;15(2):310-20
pubmed: 25582607
Front Psychol. 2017 Nov 23;8:2048
pubmed: 29250006
R Soc Open Sci. 2019 Oct 23;6(10):190232
pubmed: 31824684
Cereb Cortex. 2012 Jun;22(6):1247-55
pubmed: 21817091
Nat Hum Behav. 2023 Aug 17;:
pubmed: 37591981
Neurosci Biobehav Rev. 2020 May;112:279-299
pubmed: 32018038
PLoS Comput Biol. 2016 Jun 20;12(6):e1004953
pubmed: 27322574
Psychol Rev. 2020 Jul;127(4):622-639
pubmed: 32212763

Auteurs

Gabriele Chierchia (G)

Department of Psychology, University of Cambridge, Cambridge, UK.
Institute of Cognitive Neuroscience, University College London, London, UK.

Magdaléna Soukupová (M)

Department of Cognitive Science, École Normale Supérieure, Paris, France.

Emma J Kilford (EJ)

Institute of Cognitive Neuroscience, University College London, London, UK.
Department of Clinical, Educational and Health Psychology, University College London, London, UK.

Cait Griffin (C)

Institute of Cognitive Neuroscience, University College London, London, UK.

Jovita Leung (J)

Institute of Cognitive Neuroscience, University College London, London, UK.

Stefano Palminteri (S)

Institute of Cognitive Neuroscience, University College London, London, UK.
Department of Cognitive Science, École Normale Supérieure, Paris, France.

Sarah-Jayne Blakemore (SJ)

Department of Psychology, University of Cambridge, Cambridge, UK.
Institute of Cognitive Neuroscience, University College London, London, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH