Human Curriculum Effects Emerge with In-Context Learning in Neural Networks.

Journal

ArXiv

ISSN: 2331-8422

Titre abrégé: ArXiv

Pays: United States

ID NLM: 101759493

Informations de publication

Date de publication:
13 Feb 2024

Historique:

medline: 27 2 2024

pubmed: 27 2 2024

entrez: 27 2 2024

Statut: epublish

Résumé

Human learning is sensitive to rule-like structure and the curriculum of examples used for training. In tasks governed by succinct rules, learning is more robust when related examples are blocked across trials, but in the absence of such rules, interleaving is more effective. To date, no neural model has simultaneously captured these seemingly contradictory effects. Here we show that this same tradeoff spontaneously emerges with "in-context learning" (ICL) both in neural networks trained with metalearning and in large language models (LLMs). ICL is the ability to learn new tasks "in context" - without weight changes - via an inner-loop algorithm implemented in activation dynamics. Experiments with pretrained LLMs and metalearning transformers show that ICL exhibits the blocking advantage demonstrated in humans on a task involving rule-like structure, and conversely, that concurrent in-weight learning reproduces the interleaving advantage observed in humans on tasks lacking such structure.

Identifiants

PMID: 38410645 PMC: PMC10896373

pubmed: 38410645

pii: 2402.08674

pmc: PMC10896373

pii:

Types de publication

Preprint

Langues

eng

Human Curriculum Effects Emerge with In-Context Learning in Neural Networks.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Auteurs

Jacob Russin (J)

Ellie Pavlick (E)

Michael J Frank (MJ)

Classifications MeSH