Confirmatory reinforcement learning changes with age during adolescence.

Male Humans Adolescent Reinforcement, Psychology Learning Reward Punishment Problem Solving

adolescence computational modelling confirmation bias exploration learning rates reinforcement learning

Journal

Developmental science

ISSN: 1467-7687

Titre abrégé: Dev Sci

Pays: England

ID NLM: 9814574

Informations de publication

Date de publication:
05 2023

Historique:

revised: 26 07 2022

received: 08 10 2021

accepted: 20 09 2022

medline: 6 4 2023

pubmed: 5 10 2022

entrez: 4 10 2022

Statut: ppublish

Résumé

Understanding how learning changes during human development has been one of the long-standing objectives of developmental science. Recently, advances in computational biology have demonstrated that humans display a bias when learning to navigate novel environments through rewards and punishments: they learn more from outcomes that confirm their expectations than from outcomes that disconfirm them. Here, we ask whether confirmatory learning is stable across development, or whether it might be attenuated in developmental stages in which exploration is beneficial, such as in adolescence. In a reinforcement learning (RL) task, 77 participants aged 11-32 years (four men, mean age = 16.26) attempted to maximize monetary rewards by repeatedly sampling different pairs of novel options, which varied in their reward/punishment probabilities. Mixed-effect models showed an age-related increase in accuracy as long as learning contingencies remained stable across trials, but less so when they reversed halfway through the trials. Age was also associated with a greater tendency to stay with an option that had just delivered a reward, more than to switch away from an option that had just delivered a punishment. At the computational level, a confirmation model provided increasingly better fit with age. This model showed that age differences are captured by decreases in noise or exploration, rather than in the magnitude of the confirmation bias. These findings provide new insights into how learning changes during development and could help better tailor learning environments to people of different ages. RESEARCH HIGHLIGHTS: Reinforcement learning shows age-related improvement during adolescence, but more in stable learning environments compared with volatile learning environments. People tend to stay with an option after a win more than they shift from an option after a loss, and this asymmetry increases with age during adolescence. Computationally, these changes are captured by a developing confirmatory learning style, in which people learn more from outcomes that confirm rather than disconfirm their choices. Age-related differences in confirmatory learning are explained by decreases in stochasticity, rather than changes in the magnitude of the confirmation bias.

Identifiants

DOI: 10.1111/desc.13330 PMID: 36194156 PMC: PMC7615280

pubmed: 36194156

doi: 10.1111/desc.13330

pmc: PMC7615280

mid: EMS190284

doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

Pagination

e13330

Subventions

Organisme : Wellcome Trust

Pays : United Kingdom

Organisme : Wellcome Trust

ID : 104908

Pays : United Kingdom

Organisme : Wellcome Trust

ID : 107496

Pays : United Kingdom

Informations de copyright

Références

J Exp Psychol Gen. 2018 Oct;147(10):1521-1530

pubmed: 30272465

Dev Cogn Neurosci. 2020 Feb;41:100715

pubmed: 31999568

Trends Cogn Sci. 2015 Oct;19(10):558-566

pubmed: 26419496

Trends Cogn Sci. 2022 Jul;26(7):607-621

pubmed: 35662490

NPJ Sci Learn. 2020 Oct 27;5:16

pubmed: 33133638

Nat Commun. 2015 Aug 25;6:8096

pubmed: 26302782

Elife. 2019 Nov 26;8:

pubmed: 31769410

Child Dev. 2022 Sep;93(5):1601-1615

pubmed: 35596654

Elife. 2022 Jan 24;11:

pubmed: 35072624

Brain Cogn. 2015 Feb;93:42-53

pubmed: 25528435

PLoS Comput Biol. 2017 Aug 11;13(8):e1005684

pubmed: 28800597

Nat Neurosci. 2019 Dec;22(12):2066-2077

pubmed: 31659343

Dev Psychobiol. 2010 Apr;52(3):216-24

pubmed: 20213754

Nat Hum Behav. 2019 Nov;3(11):1215-1224

pubmed: 31501543

Nat Hum Behav. 2020 Oct;4(10):1067-1079

pubmed: 32747804

Dev Cogn Neurosci. 2020 Apr;42:100753

pubmed: 32072931

Psychol Med. 2011 Dec;41(12):2651-9

pubmed: 21733217

Lancet Child Adolesc Health. 2018 Apr;2(4):e7

pubmed: 30169305

Philos Trans R Soc Lond B Biol Sci. 2020 Jul 20;375(1803):20190502

pubmed: 32475327

Science. 1983 May 13;220(4598):671-80

pubmed: 17813860

Dev Cogn Neurosci. 2019 Dec;40:100733

pubmed: 31770715

Dev Sci. 2021 Jul;24(4):e13075

pubmed: 33305510

Neuroimage. 2015 Jan 1;104:347-54

pubmed: 25234119

Trends Neurosci. 2019 Sep;42(9):604-616

pubmed: 31443912

Dev Cogn Neurosci. 2022 Jun;55:101106

pubmed: 35537273

Dev Sci. 2018 Mar;21(2):

pubmed: 28150391

J Exp Psychol Gen. 2022 Aug;151(8):1843-1853

pubmed: 34968128

PLoS Comput Biol. 2021 Jul 1;17(7):e1008524

pubmed: 34197447

Neural Netw. 2021 Nov;143:218-229

pubmed: 34157646

Nat Neurosci. 2019 Jun;22(6):992-999

pubmed: 31086316

Neural Comput. 2022 Jan 14;34(2):307-337

pubmed: 34758486

Dev Cogn Neurosci. 2020 Feb;41:100732

pubmed: 31826837

Sci Rep. 2017 Jan 18;7:40962

pubmed: 28098227

Proc Natl Acad Sci U S A. 2012 Oct 16;109(42):17135-40

pubmed: 23027965

Nat Commun. 2021 Jun 22;12(1):3823

pubmed: 34158482

Science. 2011 Mar 11;331(6022):1279-85

pubmed: 21393536

Cogn Affect Behav Neurosci. 2015 Jun;15(2):310-20

pubmed: 25582607

Front Psychol. 2017 Nov 23;8:2048

pubmed: 29250006

R Soc Open Sci. 2019 Oct 23;6(10):190232

pubmed: 31824684

Cereb Cortex. 2012 Jun;22(6):1247-55

pubmed: 21817091

Nat Hum Behav. 2023 Aug 17;:

pubmed: 37591981

Neurosci Biobehav Rev. 2020 May;112:279-299

pubmed: 32018038

PLoS Comput Biol. 2016 Jun 20;12(6):e1004953

pubmed: 27322574

Psychol Rev. 2020 Jul;127(4):622-639

pubmed: 32212763

Confirmatory reinforcement learning changes with age during adolescence.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Informations de copyright

Références

Auteurs

Gabriele Chierchia (G)

Magdaléna Soukupová (M)

Emma J Kilford (EJ)

Cait Griffin (C)

Jovita Leung (J)

Stefano Palminteri (S)

Sarah-Jayne Blakemore (SJ)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH