A machine learning-based procedure for leveraging clickstream data to investigate early predictability of failure on interactive tasks.
Clickstreams
Early prediction
Extreme gradient boosting
Interactive tasks
PIAAC
Time-stamped action sequences
Journal
Behavior research methods
ISSN: 1554-3528
Titre abrégé: Behav Res Methods
Pays: United States
ID NLM: 101244316
Informations de publication
Date de publication:
04 2023
04 2023
Historique:
accepted:
14
03
2022
medline:
26
4
2023
pubmed:
2
6
2022
entrez:
1
6
2022
Statut:
ppublish
Résumé
Early detection of risk of failure on interactive tasks comes with great potential for better understanding how examinees differ in their initial behavior as well as for adaptively tailoring interactive tasks to examinees' competence levels. Drawing on procedures originating in shopper intent prediction on e-commerce platforms, we introduce and showcase a machine learning-based procedure that leverages early-window clickstream data for systematically investigating early predictability of behavioral outcomes on interactive tasks. We derive features related to the occurrence, frequency, sequentiality, and timing of performed actions from early-window clickstreams and use extreme gradient boosting for classification. Multiple measures are suggested to evaluate the quality and utility of early predictions. The procedure is outlined by investigating early predictability of failure on two PIAAC 2012 Problem Solving in Technology Rich Environments (PSTRE) tasks. We investigated early windows of varying size in terms of time and in terms of actions. We achieved good prediction performance at stages where examinees had, on average, at least two thirds of their solution process ahead of them, and the vast majority of examinees who failed could potentially be detected to be at risk before completing the task. In-depth analyses revealed different features to be indicative of success and failure at different stages of the solution process, thereby highlighting the potential of the applied procedure for gaining a finer-grained understanding of the trajectories of behavioral patterns on interactive tasks.
Identifiants
pubmed: 35650385
doi: 10.3758/s13428-022-01844-1
pii: 10.3758/s13428-022-01844-1
pmc: PMC10125949
doi:
Types de publication
Journal Article
Research Support, U.S. Gov't, Non-P.H.S.
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1392-1412Informations de copyright
© 2022. The Author(s).
Références
Psychometrika. 2021 Mar;86(1):190-214
pubmed: 33544300
Front Psychol. 2019 Nov 21;10:2461
pubmed: 31824363
Child Dev. 2011 Sep-Oct;82(5):1501-17
pubmed: 21679178
Psychol Methods. 2021 Dec;26(6):743-773
pubmed: 33793268
Front Psychol. 2019 May 07;10:777
pubmed: 31133910
Front Psychol. 2019 Mar 27;10:646
pubmed: 30971986
Front Psychol. 2019 Mar 18;10:486
pubmed: 30936843
Sci Rep. 2020 Oct 12;10(1):16983
pubmed: 33046722
Br J Math Stat Psychol. 2023 Feb;76(1):211-235
pubmed: 36317951
Front Psychol. 2018 Nov 23;9:2231
pubmed: 30532716
Psychometrika. 2020 Jun;85(2):378-397
pubmed: 32572672
Br J Math Stat Psychol. 2021 Feb;74(1):1-33
pubmed: 32442346