A Gibbs sampler for the multidimensional four-parameter logistic item response model via a data augmentation scheme.

Algorithms Bayes Theorem Computer Simulation Logistic Models Models, Statistical

Bayes estimation Gibbs sampling data augmentation deviance information criterion multidimensional four-parameter logistic item response theory model

Journal

The British journal of mathematical and statistical psychology

ISSN: 2044-8317

Titre abrégé: Br J Math Stat Psychol

Pays: England

ID NLM: 0004047

Informations de publication

Date de publication:
11 2021

Historique:

revised: 30 12 2020

received: 23 06 2019

pubmed: 19 5 2021

medline: 5 11 2021

entrez: 18 5 2021

Statut: ppublish

Résumé

The four-parameter logistic (4PL) item response model, which includes an upper asymptote for the correct response probability, has drawn increasing interest due to its suitability for many practical scenarios. This paper proposes a new Gibbs sampling algorithm for estimation of the multidimensional 4PL model based on an efficient data augmentation scheme (DAGS). With the introduction of three continuous latent variables, the full conditional distributions are tractable, allowing easy implementation of a Gibbs sampler. Simulation studies are conducted to evaluate the proposed method and several popular alternatives. An empirical data set was analysed using the 4PL model to show its improved performance over the three-parameter and two-parameter logistic models. The proposed estimation scheme is easily accessible to practitioners through the open-source IRTlogit package.

Identifiants

DOI: 10.1111/bmsp.12234 PMID: 34002857

pubmed: 34002857

doi: 10.1111/bmsp.12234

doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

Pagination

427-464

Informations de copyright

Références

Baker, F. B., & Kim, S. (2004). Item response theory: Parameter estimation techniques (2nd ed.). New York, NY: Marcel Dekker.

Barton M. A., Lord F. M. (1981). AN UPPER ASYMPTOTE FOR THE THREE-PARAMETER LOGISTIC ITEM-RESPONSE MODEL*. ETS Research Report Series, 1981, (1), i-8. http://dx.doi.org/10.1002/j.2333-8504.1981.tb01255.x.

Béguin A. A., Glas C. A. W. (2001). MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika, 66, (4), 541-561. http://dx.doi.org/10.1007/bf02296195.

Betancourt, M. (2017). A conceptual introduction to Hamiltonian Monte Carlo. arXiv:1701.02434. Preprint

Blei D. M., Kucukelbir A., McAuliffe J. D. (2017). Variational Inference: A Review for Statisticians. Journal of the American Statistical Association, 112, (518), 859-877. http://dx.doi.org/10.1080/01621459.2017.1285773.

Bock R. D., Aitkin M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, (4), 443-459. http://dx.doi.org/10.1007/bf02293801.

Chang H., Ying Z. (2008). To Weight or Not to Weight? Balancing Influence of Initial Items in Adaptive Testing. Psychometrika, 73, (3), 441-450. http://dx.doi.org/10.1007/s11336-007-9047-7.

Culpepper S. A. (2016). Revisiting the 4-Parameter Item Response Model: Bayesian Estimation and Application. Psychometrika, 81, (4), 1142-1163. http://dx.doi.org/10.1007/s11336-015-9477-6.

Culpepper Steven Andrew (2017). The Prevalence and Implications of Slipping on Low-Stakes, Large-Scale Assessments. Journal of Educational and Behavioral Statistics, 42, (6), 706-725. http://dx.doi.org/10.3102/1076998617705653.

Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Erlbaum.

Ferrando P. J. (1994). Fitting Item Response Models to the Epi-A Impulsivity Sub Scale. Educational and Psychological Measurement, 54, (1), 118-127. http://dx.doi.org/10.1177/0013164494054001016.

Feuerstahler L. M., Waller N. G. (2014). Abstract: Estimation of the 4-Parameter Model with Marginal Maximum Likelihood. Multivariate Behavioral Research, 49, (3), 285-285. http://dx.doi.org/10.1080/00273171.2014.912889.

Fraley R. C., Waller N. G., Brennan K. A. (2000). An item response theory analysis of self-report measures of adult attachment. Journal of Personality and Social Psychology, 78, (2), 350-365. http://dx.doi.org/10.1037/0022-3514.78.2.350.

Fu Z., Tao J., Shi N. (2009). Bayesian estimation in the multidimensional three-parameter logistic model. Journal of Statistical Computation and Simulation, 79, (6), 819-835. http://dx.doi.org/10.1080/00949650801966876.

Gelman A., Rubin D. B. (1992). Inference from Iterative Simulation Using Multiple Sequences. Statistical Science, 7, (4), 457-472. http://dx.doi.org/10.1214/ss/1177011136.

Gupta, R. P. (Ed.) (1980). Multivariate statistical analysis. Amsterdam, The Netherlands: North-Holland.

Holman R., Glas C. A. W. (2005). Modelling non-ignorable missing-data mechanisms with item response theory models. British Journal of Mathematical and Statistical Psychology, 58, (1), 1-17. http://dx.doi.org/10.1111/j.2044-8317.2005.tb00312.x.

Jiang Z., Templin J. (2019). Gibbs Samplers for Logistic Item Response Models via the Pólya-Gamma Distribution: A Computationally Efficient Data-Augmentation Strategy. Psychometrika, 84, (2), 358-374. http://dx.doi.org/10.1007/s11336-018-9641-x.

Kern J. L., Culpepper S. A. (2020). A Restricted Four-Parameter IRT Model: The Dyad Four-Parameter Normal Ogive (Dyad-4PNO) Model. Psychometrika, 85, (3), 575-599. http://dx.doi.org/10.1007/s11336-020-09716-3.

Lanza, S. T., Foster, M., Taylor, T. K., & Burns, L. (2005). Assessing the impact of measurement specificity in a behavior problems checklist: An IRT analysis. Technical Report 05-75. The Pennsylvania State University.

Loken E., Rulison K. L. (2010). Estimation of a four-parameter item response theory model. British Journal of Mathematical and Statistical Psychology, 63, (3), 509-525. http://dx.doi.org/10.1348/000711009x474502.

Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Erlbaum.

Miech, R. A., Johnston, L. D., Bachman, J. G., O'Malley, P. M., & Schulenberg, J. E. (2019). Monitoring the future: A continuing study of American youth (12th-grade survey), 2018. Inter-university Consortium for Political and Social Research, https://doi.org/10.3886/ICPSR37416.v1

Mislevy R. J. (1984). Estimating latent distributions. Psychometrika, 49, (3), 359-381. http://dx.doi.org/10.1007/bf02306026.

Mislevy, R. J., & Bock, R. D. (1986). PC-BILOG: Item analysis and test scoring with binary logistic models. Mooresville, IN: Scientific Software Inc.

Murphy, K., & Mahdaviani, M. (2005). MATBUGS: A MATLAB Interface to WinBUGS. [Computer software and manual]. Retrieved from http://people.cs.ubc.ca/murphyk/Software/MATBUGS/matbugs.html

Osgood D. W., McMorris B. J., Potenza M. T. (2002). Journal of Quantitative Criminology, 18, (3), 267-296. http://dx.doi.org/10.1023/a:1016008004010.

Patz R. J., Junker B. W. (1999). A Straightforward Approach to Markov Chain Monte Carlo Methods for Item Response Models. Journal of Educational and Behavioral Statistics, 24, (2), 146http://dx.doi.org/10.2307/1165199.

Patz R. J., Junker B. W. (1999). Applications and Extensions of MCMC in IRT: Multiple Item Types, Missing Data, and Rated Responses. Journal of Educational and Behavioral Statistics, 24, (4), 342http://dx.doi.org/10.2307/1165367.

Roju N. S., van der Linden W. J., Fleer P. F. (1995). IRT-Based Internal Measures of Differential Functioning of Items and Tests. Applied Psychological Measurement, 19, (4), 353-368. http://dx.doi.org/10.1177/014662169501900405.

Reise S. P., Waller N. G. (2003). How many IRT parameters does it take to model psychopathology items?. Psychological Methods, 8, (2), 164-184. http://dx.doi.org/10.1037/1082-989x.8.2.164.

Rulison K. L., Loken E. (2009). I've Fallen and I Can't Get Up: Can High-Ability Students Recover From Early Mistakes in CAT?. Applied Psychological Measurement, 33, (2), 83-101. http://dx.doi.org/10.1177/0146621608324023.

Sinharay S., Johnson M. S., Stern H. S. (2006). Posterior Predictive Assessment of Item Response Theory Models. Applied Psychological Measurement, 30, (4), 298-321. http://dx.doi.org/10.1177/0146621605285517.

Spiegelhalter, D., Thomas, A., Best, N., & Lunn, D. (2011). OpenBUGS user manual, version 3.2.1. March. Retrieved from http://www.openbugs.info/Manuals/Manual.html

Steinberg, L., & Thissen, D. (1995). Item response theory in personality research. In P. E. Shrout & S. T. Fiske (Eds.), Personality research, methods, and theory: A festschrift honoring Donald W. Fiske (pp. 161-181). Hillsdale, NJ: Erlbaum.

Swaminathan, H., Hambleton, R. K., & Rogers, H. J. (2006). Assessing the fit of item response theory models. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics, Volume 26: Psychometrics (pp. 683-715). Amsterdam: North-Holland.

Tavares H. R., de Andrade D. F., Pereira C. A. (2004). Detection of determinant genes and diagnostic via Item Response Theory. Genetics and Molecular Biology, 27, (4), 679-685. http://dx.doi.org/10.1590/s1415-47572004000400033.

The MathWorks, Inc. (2007). MATLAB - The language of technical computing, Version 7.5. Natick, MA: Author. Retrieved from http://www.mathworks.com/products/matlab/

Waller N. G., Feuerstahler L. (2017). Bayesian Modal Estimation of the Four-Parameter Item Response Model in Real, Realistic, and Idealized Data Sets. Multivariate Behavioral Research, 52, (3), 350-370. http://dx.doi.org/10.1080/00273171.2017.1292893.

Waller, N. G., & Reise, S. P. (2010). Measuring psychopathology with non-standard IRT models: Fitting the four parameter model to the MMPI. In S. Embretson & J. S. Roberts (Eds.), Measuring psychological constructs: Advances in model-based approaches (pp. 147-173). Washington, DC: American Psychological Association.

Zucchini W. (2000). An Introduction to Model Selection. Journal of Mathematical Psychology, 44, (1), 41-61. http://dx.doi.org/10.1006/jmps.1999.1276.

A Gibbs sampler for the multidimensional four-parameter logistic item response model via a data augmentation scheme.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Références

Auteurs

Zhihui Fu (Z)

Susu Zhang (S)

Ya-Hui Su (YH)

Ningzhong Shi (N)

Jian Tao (J)

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Multilabel SegSRGAN-A framework for parcellation and morphometry of preterm brain in MRI.

A new estimator of between study variance of standardized mean difference in meta-analysis.

An arithmetic operation P system based on symmetric ternary system.

Classifications MeSH