The interactive reading task: Transformer-based automatic item generation.

automatic item generation language modeling psychometrics reading assessment transformer models

Journal

Frontiers in artificial intelligence
ISSN: 2624-8212
Titre abrégé: Front Artif Intell
Pays: Switzerland
ID NLM: 101770551

Informations de publication

Date de publication:
2022
Historique:
received: 23 03 2022
accepted: 29 06 2022
entrez: 8 8 2022
pubmed: 9 8 2022
medline: 9 8 2022
Statut: epublish

Résumé

Automatic item generation (AIG) has the potential to greatly expand the number of items for educational assessments, while simultaneously allowing for a more construct-driven approach to item development. However, the traditional item modeling approach in AIG is limited in scope to content areas that are relatively easy to model (such as math problems), and depends on highly skilled content experts to create each model. In this paper we describe the interactive reading task, a transformer-based deep language modeling approach for creating reading comprehension assessments. This approach allows a fully automated process for the creation of source passages together with a wide range of comprehension questions about the passages. The format of the questions allows automatic scoring of responses with high fidelity (e.g., selected response questions). We present the results of a large-scale pilot of the interactive reading task, with hundreds of passages and thousands of questions. These passages were administered as part of the practice test of the Duolingo English Test. Human review of the materials and psychometric analyses of test taker results demonstrate the feasibility of this approach for automatic creation of complex educational assessments.

Identifiants

pubmed: 35937141
doi: 10.3389/frai.2022.903077
pmc: PMC9354894
doi:

Types de publication

Journal Article

Langues

eng

Pagination

903077

Informations de copyright

Copyright © 2022 Attali, Runge, LaFlair, Yancey, Goodwin, Park and von Davier.

Références

Psychometrika. 2018 Dec;83(4):847-857
pubmed: 29532403
Educ Psychol Meas. 2016 Oct;76(5):787-802
pubmed: 29795888
Appl Psychol Meas. 2017 May;41(3):178-194
pubmed: 29881087
Psychometrika. 2022 Jun;87(2):749-772
pubmed: 34907497

Auteurs

Yigal Attali (Y)

Duolingo, Pittsburgh, PA, United States.

Andrew Runge (A)

Duolingo, Pittsburgh, PA, United States.

Geoffrey T LaFlair (GT)

Duolingo, Pittsburgh, PA, United States.

Kevin Yancey (K)

Duolingo, Pittsburgh, PA, United States.

Sarah Goodwin (S)

Duolingo, Pittsburgh, PA, United States.

Yena Park (Y)

Duolingo, Pittsburgh, PA, United States.

Alina A von Davier (AA)

Duolingo, Pittsburgh, PA, United States.

Classifications MeSH