A Gibbs sampler for the multidimensional four-parameter logistic item response model via a data augmentation scheme.

Bayes estimation Gibbs sampling data augmentation deviance information criterion multidimensional four-parameter logistic item response theory model

Journal

The British journal of mathematical and statistical psychology
ISSN: 2044-8317
Titre abrégé: Br J Math Stat Psychol
Pays: England
ID NLM: 0004047

Informations de publication

Date de publication:
11 2021
Historique:
revised: 30 12 2020
received: 23 06 2019
pubmed: 19 5 2021
medline: 5 11 2021
entrez: 18 5 2021
Statut: ppublish

Résumé

The four-parameter logistic (4PL) item response model, which includes an upper asymptote for the correct response probability, has drawn increasing interest due to its suitability for many practical scenarios. This paper proposes a new Gibbs sampling algorithm for estimation of the multidimensional 4PL model based on an efficient data augmentation scheme (DAGS). With the introduction of three continuous latent variables, the full conditional distributions are tractable, allowing easy implementation of a Gibbs sampler. Simulation studies are conducted to evaluate the proposed method and several popular alternatives. An empirical data set was analysed using the 4PL model to show its improved performance over the three-parameter and two-parameter logistic models. The proposed estimation scheme is easily accessible to practitioners through the open-source IRTlogit package.

Identifiants

pubmed: 34002857
doi: 10.1111/bmsp.12234
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

427-464

Informations de copyright

© 2021 The British Psychological Society.

Références

Baker, F. B., & Kim, S. (2004). Item response theory: Parameter estimation techniques (2nd ed.). New York, NY: Marcel Dekker.
Barton M. A., Lord F. M. (1981). AN UPPER ASYMPTOTE FOR THE THREE-PARAMETER LOGISTIC ITEM-RESPONSE MODEL*. ETS Research Report Series, 1981, (1), i-8. http://dx.doi.org/10.1002/j.2333-8504.1981.tb01255.x.
Béguin A. A., Glas C. A. W. (2001). MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika, 66, (4), 541-561. http://dx.doi.org/10.1007/bf02296195.
Betancourt, M. (2017). A conceptual introduction to Hamiltonian Monte Carlo. arXiv:1701.02434. Preprint
Blei D. M., Kucukelbir A., McAuliffe J. D. (2017). Variational Inference: A Review for Statisticians. Journal of the American Statistical Association, 112, (518), 859-877. http://dx.doi.org/10.1080/01621459.2017.1285773.
Bock R. D., Aitkin M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, (4), 443-459. http://dx.doi.org/10.1007/bf02293801.
Chang H., Ying Z. (2008). To Weight or Not to Weight? Balancing Influence of Initial Items in Adaptive Testing. Psychometrika, 73, (3), 441-450. http://dx.doi.org/10.1007/s11336-007-9047-7.
Culpepper S. A. (2016). Revisiting the 4-Parameter Item Response Model: Bayesian Estimation and Application. Psychometrika, 81, (4), 1142-1163. http://dx.doi.org/10.1007/s11336-015-9477-6.
Culpepper Steven Andrew (2017). The Prevalence and Implications of Slipping on Low-Stakes, Large-Scale Assessments. Journal of Educational and Behavioral Statistics, 42, (6), 706-725. http://dx.doi.org/10.3102/1076998617705653.
Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Erlbaum.
Ferrando P. J. (1994). Fitting Item Response Models to the Epi-A Impulsivity Sub Scale. Educational and Psychological Measurement, 54, (1), 118-127. http://dx.doi.org/10.1177/0013164494054001016.
Feuerstahler L. M., Waller N. G. (2014). Abstract: Estimation of the 4-Parameter Model with Marginal Maximum Likelihood. Multivariate Behavioral Research, 49, (3), 285-285. http://dx.doi.org/10.1080/00273171.2014.912889.
Fraley R. C., Waller N. G., Brennan K. A. (2000). An item response theory analysis of self-report measures of adult attachment. Journal of Personality and Social Psychology, 78, (2), 350-365. http://dx.doi.org/10.1037/0022-3514.78.2.350.
Fu Z., Tao J., Shi N. (2009). Bayesian estimation in the multidimensional three-parameter logistic model. Journal of Statistical Computation and Simulation, 79, (6), 819-835. http://dx.doi.org/10.1080/00949650801966876.
Gelman A., Rubin D. B. (1992). Inference from Iterative Simulation Using Multiple Sequences. Statistical Science, 7, (4), 457-472. http://dx.doi.org/10.1214/ss/1177011136.
Gupta, R. P. (Ed.) (1980). Multivariate statistical analysis. Amsterdam, The Netherlands: North-Holland.
Holman R., Glas C. A. W. (2005). Modelling non-ignorable missing-data mechanisms with item response theory models. British Journal of Mathematical and Statistical Psychology, 58, (1), 1-17. http://dx.doi.org/10.1111/j.2044-8317.2005.tb00312.x.
Jiang Z., Templin J. (2019). Gibbs Samplers for Logistic Item Response Models via the Pólya-Gamma Distribution: A Computationally Efficient Data-Augmentation Strategy. Psychometrika, 84, (2), 358-374. http://dx.doi.org/10.1007/s11336-018-9641-x.
Kern J. L., Culpepper S. A. (2020). A Restricted Four-Parameter IRT Model: The Dyad Four-Parameter Normal Ogive (Dyad-4PNO) Model. Psychometrika, 85, (3), 575-599. http://dx.doi.org/10.1007/s11336-020-09716-3.
Lanza, S. T., Foster, M., Taylor, T. K., & Burns, L. (2005). Assessing the impact of measurement specificity in a behavior problems checklist: An IRT analysis. Technical Report 05-75. The Pennsylvania State University.
Loken E., Rulison K. L. (2010). Estimation of a four-parameter item response theory model. British Journal of Mathematical and Statistical Psychology, 63, (3), 509-525. http://dx.doi.org/10.1348/000711009x474502.
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Erlbaum.
Miech, R. A., Johnston, L. D., Bachman, J. G., O'Malley, P. M., & Schulenberg, J. E. (2019). Monitoring the future: A continuing study of American youth (12th-grade survey), 2018. Inter-university Consortium for Political and Social Research, https://doi.org/10.3886/ICPSR37416.v1
Mislevy R. J. (1984). Estimating latent distributions. Psychometrika, 49, (3), 359-381. http://dx.doi.org/10.1007/bf02306026.
Mislevy, R. J., & Bock, R. D. (1986). PC-BILOG: Item analysis and test scoring with binary logistic models. Mooresville, IN: Scientific Software Inc.
Murphy, K., & Mahdaviani, M. (2005). MATBUGS: A MATLAB Interface to WinBUGS. [Computer software and manual]. Retrieved from http://people.cs.ubc.ca/murphyk/Software/MATBUGS/matbugs.html
Osgood D. W., McMorris B. J., Potenza M. T. (2002). Journal of Quantitative Criminology, 18, (3), 267-296. http://dx.doi.org/10.1023/a:1016008004010.
Patz R. J., Junker B. W. (1999). A Straightforward Approach to Markov Chain Monte Carlo Methods for Item Response Models. Journal of Educational and Behavioral Statistics, 24, (2), 146http://dx.doi.org/10.2307/1165199.
Patz R. J., Junker B. W. (1999). Applications and Extensions of MCMC in IRT: Multiple Item Types, Missing Data, and Rated Responses. Journal of Educational and Behavioral Statistics, 24, (4), 342http://dx.doi.org/10.2307/1165367.
Roju N. S., van der Linden W. J., Fleer P. F. (1995). IRT-Based Internal Measures of Differential Functioning of Items and Tests. Applied Psychological Measurement, 19, (4), 353-368. http://dx.doi.org/10.1177/014662169501900405.
Reise S. P., Waller N. G. (2003). How many IRT parameters does it take to model psychopathology items?. Psychological Methods, 8, (2), 164-184. http://dx.doi.org/10.1037/1082-989x.8.2.164.
Rulison K. L., Loken E. (2009). I've Fallen and I Can't Get Up: Can High-Ability Students Recover From Early Mistakes in CAT?. Applied Psychological Measurement, 33, (2), 83-101. http://dx.doi.org/10.1177/0146621608324023.
Sinharay S., Johnson M. S., Stern H. S. (2006). Posterior Predictive Assessment of Item Response Theory Models. Applied Psychological Measurement, 30, (4), 298-321. http://dx.doi.org/10.1177/0146621605285517.
Spiegelhalter, D., Thomas, A., Best, N., & Lunn, D. (2011). OpenBUGS user manual, version 3.2.1. March. Retrieved from http://www.openbugs.info/Manuals/Manual.html
Steinberg, L., & Thissen, D. (1995). Item response theory in personality research. In P. E. Shrout & S. T. Fiske (Eds.), Personality research, methods, and theory: A festschrift honoring Donald W. Fiske (pp. 161-181). Hillsdale, NJ: Erlbaum.
Swaminathan, H., Hambleton, R. K., & Rogers, H. J. (2006). Assessing the fit of item response theory models. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics, Volume 26: Psychometrics (pp. 683-715). Amsterdam: North-Holland.
Tavares H. R., de Andrade D. F., Pereira C. A. (2004). Detection of determinant genes and diagnostic via Item Response Theory. Genetics and Molecular Biology, 27, (4), 679-685. http://dx.doi.org/10.1590/s1415-47572004000400033.
The MathWorks, Inc. (2007). MATLAB - The language of technical computing, Version 7.5. Natick, MA: Author. Retrieved from http://www.mathworks.com/products/matlab/
Waller N. G., Feuerstahler L. (2017). Bayesian Modal Estimation of the Four-Parameter Item Response Model in Real, Realistic, and Idealized Data Sets. Multivariate Behavioral Research, 52, (3), 350-370. http://dx.doi.org/10.1080/00273171.2017.1292893.
Waller, N. G., & Reise, S. P. (2010). Measuring psychopathology with non-standard IRT models: Fitting the four parameter model to the MMPI. In S. Embretson & J. S. Roberts (Eds.), Measuring psychological constructs: Advances in model-based approaches (pp. 147-173). Washington, DC: American Psychological Association.
Zucchini W. (2000). An Introduction to Model Selection. Journal of Mathematical Psychology, 44, (1), 41-61. http://dx.doi.org/10.1006/jmps.1999.1276.

Auteurs

Zhihui Fu (Z)

Department of Statistics, School of Mathematics and Statistics, Minnan Normal University, Zhangzhou, Fujian, China.
Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun, Jilin, China.

Susu Zhang (S)

Departments of Psychology and Statistics, University of Illinois at Urbana-Champaign, IL, USA.

Ya-Hui Su (YH)

Department of Psychology, National Chung Cheng University, Chiayi County, Taiwan.

Ningzhong Shi (N)

Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun, Jilin, China.

Jian Tao (J)

Key Laboratory for Applied Statistics of MOE, School of Mathematics and Statistics, Northeast Normal University, Changchun, Jilin, China.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Humans Meta-Analysis as Topic Sample Size Models, Statistical Computer Simulation
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH