Missing data in longitudinal studies: Comparison of multiple imputation methods in a real clinical setting.
fully conditional specification
missing data
multivariate normal imputation
quality of life
Journal
Journal of evaluation in clinical practice
ISSN: 1365-2753
Titre abrégé: J Eval Clin Pract
Pays: England
ID NLM: 9609066
Informations de publication
Date de publication:
Feb 2021
Feb 2021
Historique:
received:
07
09
2019
revised:
30
01
2020
accepted:
02
02
2020
pubmed:
27
2
2020
medline:
29
7
2021
entrez:
27
2
2020
Statut:
ppublish
Résumé
Missing data represent a challenge in longitudinal studies. The aim of the study is to compare the performance of the multivariate normal imputation and the fully conditional specification methods, using real data set with missing data partially completed 2 years later. The data used came from an ongoing randomized controlled trial with 5-year follow-up. At a certain time, we observed a number of patients with missing data and a number of patients whose data were unobserved because they were not yet eligible for a given follow-up. Both unobserved and missing data were imputed. The imputed unobserved data were compared with the corresponding real information obtained 2 years later. Both imputation methods showed similar performance on the accuracy measures and produced minimally biased estimates. Despite the large number of repeated measures with intermittent missing data and the non-normal multivariate distribution of data, both methods performed well and was not possible to determine which was better.
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
34-41Subventions
Organisme : Piedmont Regional Cancer Network
Informations de copyright
© 2020 John Wiley & Sons, Ltd.
Références
Fairclough DL, Peterson HF, Chang V. Why are missing quality of life data a problem in clinical trials of cancer therapy? Stat Med. 1998;17(5-7):667-677.
Curran D, Molenberghs G, Fayers PM, Machin D. Incomplete quality of life data in randomized trials: missing forms. Stat Med. 1998;17(5-7):697-709.
Rubin DB. Multiple Imputation for Nonresponse in Surveys. Vol. 81. John Wiley & Sons, 2004.
White IR, Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30(4):377-399.
Collins LM, Schafer JL, Kam CM. A comparison of inclusive and restrictive strategies in modern missing data procedures. Psychol Methods. 2001;6(4):330-351.
Pedersen AB, Mikkelsen EM, Cronin-Fenton D, et al. Missing data and multiple imputation in clinical epidemiological research. Clin Epidemiol. 2017;9:157-166.
Nevalainen J, Kenward MG, Virtanen SM. Missing values in longitudinal dietary data: a multiple imputation approach based on a fully conditional specification. Stat Med. 2009;28(29):3657-3669.
Welch CA, Petersen I, Bartlett JW, et al. Evaluation of two-fold fully conditional specification multiple imputation for longitudinal electronic health record data. Stat Med. 2014;33(21):3725-3737.
Schafer J. Analysis of Incomplete Multivariate Data. Chapman and Hall/CRC, 1997.
Demirtas H, Freels S, Yucel R. Plausibility of multivariate normality assumption when multiply imputing non-Gaussian continuous outcomes: a simulation assessment. J Stat Comput Simul. 2008;78:69-84.
Saraceno L, Heron J, Munafo M, Craddock N, van den Bree MB. The relationship between childhood depressive symptoms and problem alcohol use in early adolescence: findings from a large longitudinal population-based study. Addiction. 2012;107(3):567-577.
Van Buuren SBJ. Fully conditional specification in multivariate imputation. J Stat Comput Simul. 2006;76:1049-1064.
Lee KJ, Carlin JB. Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation. Am J Epidemiol. 2010;171(5):624-632.
Kalaycioglu OC, King A, Omar M, RZ. A comparison of multiple imputation methods for handling missing data in repeated measurements observational studies. R Stat Soc Ser A-Stat Soc. 2016;179:683-706.
De Silva AP, Moreno-Betancur M, De Livera AM, Lee KJ, Simpson JA. A comparison of multiple imputation methods for handling missing values in longitudinal data in the presence of a time-varying covariate with a non-linear association with time: a simulation study. BMC Med Res Methodol. 2017;17(1):114.
Van Buuren S. Multiple imputation of discrete and continuous data by fully conditional specification. Stat Methods Med Res. 2007;16(3):219-242.
Jordhoy MS, Fayers P, Loge JH, Saltnes T, Ahlner-Elmqvist M, Kaasa S. Quality of life in advanced cancer patients: the impact of sociodemographic and medical characteristics. Br J Cancer. 2001;85(10):1478-1485.
Ware J Jr, Kosinski M, Keller SD. A 12-item short-form health survey: construction of scales and preliminary tests of reliability and validity. Med Care. 1996;34(3):220-233.
Kodraliu G, Mosconi P, Groth N, et al. Subjective health status assessment: evaluation of the Italian version of the SF-12 health survey. results from the MiOS project. J Epidemiol Biostat. 2001;6(3):305-316.
Grossi E, Groth N, Mosconi P, et al. Development and validation of the short version of the psychological general well-being index (PGWB-S). Health Qual Life Outcomes. 2006;4-88.
Mardia K. Measures of multivariate skewness and kurtosis with applications. Biometrika. 1970;57:519-530.
Henze N, Zirkler B. A class of invariant consistent tests for multivariate normality. Commun Stat-Theor M. 1990;19:3595-3617.
Yucel RD, H. Impact of non-normal random effects on inference by multiple imputation: a simulation assessment. Comput Stat Data Anal. 2010;54:790-801.
Biering K, Frydenberg M, Hjollund NH. Self-reported health following percutaneous coronary intervention: results from a cohort followed for 3 years with multiple measurements. Clin Epidemiol. 2014;6:441-449.
Biering K, Hjollund NH, Frydenberg M. Using multiple imputation to deal with missing data and attrition in longitudinal studies with repeated measures of patient-reported outcomes. Clin Epidemiol. 2015;7:91-106.
Engels JM, Diehr P. Imputation of missing longitudinal data: a comparison of methods. J Clin Epidemiol. 2003;56(10):968-976.
Royston P, White I. Multiple imputation by chained equations (MICE): implementation in Stata. J Stat Softw. 2011;45(4):20.
Kropko J, Goodrich B, Gelman A, Hill J. Multiple imputation for continuous and categorical data: comparing joint multivariate normal and conditional approaches. Polit Anal. 2014;22:497-519.