Confirmatory reinforcement learning changes with age during adolescence.
adolescence
computational modelling
confirmation bias
exploration
learning rates
reinforcement learning
Journal
Developmental science
ISSN: 1467-7687
Titre abrégé: Dev Sci
Pays: England
ID NLM: 9814574
Informations de publication
Date de publication:
05 2023
05 2023
Historique:
revised:
26
07
2022
received:
08
10
2021
accepted:
20
09
2022
medline:
6
4
2023
pubmed:
5
10
2022
entrez:
4
10
2022
Statut:
ppublish
Résumé
Understanding how learning changes during human development has been one of the long-standing objectives of developmental science. Recently, advances in computational biology have demonstrated that humans display a bias when learning to navigate novel environments through rewards and punishments: they learn more from outcomes that confirm their expectations than from outcomes that disconfirm them. Here, we ask whether confirmatory learning is stable across development, or whether it might be attenuated in developmental stages in which exploration is beneficial, such as in adolescence. In a reinforcement learning (RL) task, 77 participants aged 11-32 years (four men, mean age = 16.26) attempted to maximize monetary rewards by repeatedly sampling different pairs of novel options, which varied in their reward/punishment probabilities. Mixed-effect models showed an age-related increase in accuracy as long as learning contingencies remained stable across trials, but less so when they reversed halfway through the trials. Age was also associated with a greater tendency to stay with an option that had just delivered a reward, more than to switch away from an option that had just delivered a punishment. At the computational level, a confirmation model provided increasingly better fit with age. This model showed that age differences are captured by decreases in noise or exploration, rather than in the magnitude of the confirmation bias. These findings provide new insights into how learning changes during development and could help better tailor learning environments to people of different ages. RESEARCH HIGHLIGHTS: Reinforcement learning shows age-related improvement during adolescence, but more in stable learning environments compared with volatile learning environments. People tend to stay with an option after a win more than they shift from an option after a loss, and this asymmetry increases with age during adolescence. Computationally, these changes are captured by a developing confirmatory learning style, in which people learn more from outcomes that confirm rather than disconfirm their choices. Age-related differences in confirmatory learning are explained by decreases in stochasticity, rather than changes in the magnitude of the confirmation bias.
Identifiants
pubmed: 36194156
doi: 10.1111/desc.13330
pmc: PMC7615280
mid: EMS190284
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e13330Subventions
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 104908
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 107496
Pays : United Kingdom
Informations de copyright
© 2021 The Authors. Developmental Science published by John Wiley & Sons Ltd.
Références
J Exp Psychol Gen. 2018 Oct;147(10):1521-1530
pubmed: 30272465
Dev Cogn Neurosci. 2020 Feb;41:100715
pubmed: 31999568
Trends Cogn Sci. 2015 Oct;19(10):558-566
pubmed: 26419496
Trends Cogn Sci. 2022 Jul;26(7):607-621
pubmed: 35662490
NPJ Sci Learn. 2020 Oct 27;5:16
pubmed: 33133638
Nat Commun. 2015 Aug 25;6:8096
pubmed: 26302782
Elife. 2019 Nov 26;8:
pubmed: 31769410
Child Dev. 2022 Sep;93(5):1601-1615
pubmed: 35596654
Elife. 2022 Jan 24;11:
pubmed: 35072624
Brain Cogn. 2015 Feb;93:42-53
pubmed: 25528435
PLoS Comput Biol. 2017 Aug 11;13(8):e1005684
pubmed: 28800597
Nat Neurosci. 2019 Dec;22(12):2066-2077
pubmed: 31659343
Dev Psychobiol. 2010 Apr;52(3):216-24
pubmed: 20213754
Nat Hum Behav. 2019 Nov;3(11):1215-1224
pubmed: 31501543
Nat Hum Behav. 2020 Oct;4(10):1067-1079
pubmed: 32747804
Dev Cogn Neurosci. 2020 Apr;42:100753
pubmed: 32072931
Psychol Med. 2011 Dec;41(12):2651-9
pubmed: 21733217
Lancet Child Adolesc Health. 2018 Apr;2(4):e7
pubmed: 30169305
Philos Trans R Soc Lond B Biol Sci. 2020 Jul 20;375(1803):20190502
pubmed: 32475327
Science. 1983 May 13;220(4598):671-80
pubmed: 17813860
Dev Cogn Neurosci. 2019 Dec;40:100733
pubmed: 31770715
Dev Sci. 2021 Jul;24(4):e13075
pubmed: 33305510
Neuroimage. 2015 Jan 1;104:347-54
pubmed: 25234119
Trends Neurosci. 2019 Sep;42(9):604-616
pubmed: 31443912
Dev Cogn Neurosci. 2022 Jun;55:101106
pubmed: 35537273
Dev Sci. 2018 Mar;21(2):
pubmed: 28150391
J Exp Psychol Gen. 2022 Aug;151(8):1843-1853
pubmed: 34968128
PLoS Comput Biol. 2021 Jul 1;17(7):e1008524
pubmed: 34197447
Neural Netw. 2021 Nov;143:218-229
pubmed: 34157646
Nat Neurosci. 2019 Jun;22(6):992-999
pubmed: 31086316
Neural Comput. 2022 Jan 14;34(2):307-337
pubmed: 34758486
Dev Cogn Neurosci. 2020 Feb;41:100732
pubmed: 31826837
Sci Rep. 2017 Jan 18;7:40962
pubmed: 28098227
Proc Natl Acad Sci U S A. 2012 Oct 16;109(42):17135-40
pubmed: 23027965
Nat Commun. 2021 Jun 22;12(1):3823
pubmed: 34158482
Science. 2011 Mar 11;331(6022):1279-85
pubmed: 21393536
Cogn Affect Behav Neurosci. 2015 Jun;15(2):310-20
pubmed: 25582607
Front Psychol. 2017 Nov 23;8:2048
pubmed: 29250006
R Soc Open Sci. 2019 Oct 23;6(10):190232
pubmed: 31824684
Cereb Cortex. 2012 Jun;22(6):1247-55
pubmed: 21817091
Nat Hum Behav. 2023 Aug 17;:
pubmed: 37591981
Neurosci Biobehav Rev. 2020 May;112:279-299
pubmed: 32018038
PLoS Comput Biol. 2016 Jun 20;12(6):e1004953
pubmed: 27322574
Psychol Rev. 2020 Jul;127(4):622-639
pubmed: 32212763