Human Curriculum Effects Emerge with In-Context Learning in Neural Networks.


Journal

ArXiv
ISSN: 2331-8422
Titre abrégé: ArXiv
Pays: United States
ID NLM: 101759493

Informations de publication

Date de publication:
13 Feb 2024
Historique:
medline: 27 2 2024
pubmed: 27 2 2024
entrez: 27 2 2024
Statut: epublish

Résumé

Human learning is sensitive to rule-like structure and the curriculum of examples used for training. In tasks governed by succinct rules, learning is more robust when related examples are blocked across trials, but in the absence of such rules, interleaving is more effective. To date, no neural model has simultaneously captured these seemingly contradictory effects. Here we show that this same tradeoff spontaneously emerges with "in-context learning" (ICL) both in neural networks trained with metalearning and in large language models (LLMs). ICL is the ability to learn new tasks "in context" - without weight changes - via an inner-loop algorithm implemented in activation dynamics. Experiments with pretrained LLMs and metalearning transformers show that ICL exhibits the blocking advantage demonstrated in humans on a task involving rule-like structure, and conversely, that concurrent in-weight learning reproduces the interleaving advantage observed in humans on tasks lacking such structure.

Identifiants

pubmed: 38410645
pii: 2402.08674
pmc: PMC10896373
pii:

Types de publication

Preprint

Langues

eng

Auteurs

Classifications MeSH