Balancing efficacy and computational burden: weighted mean, multiple imputation, and inverse probability weighting methods for item non-response in reliable scales.

All of Us Research Program item imputation missing data multi-item questionnaire simulation

Journal

Journal of the American Medical Informatics Association : JAMIA
ISSN: 1527-974X
Titre abrégé: J Am Med Inform Assoc
Pays: England
ID NLM: 9430800

Informations de publication

Date de publication:
13 Aug 2024
Historique:
received: 04 04 2024
revised: 05 07 2024
accepted: 07 08 2024
medline: 14 8 2024
pubmed: 14 8 2024
entrez: 14 8 2024
Statut: aheadofprint

Résumé

Scales often arise from multi-item questionnaires, yet commonly face item non-response. Traditional solutions use weighted mean (WMean) from available responses, but potentially overlook missing data intricacies. Advanced methods like multiple imputation (MI) address broader missing data, but demand increased computational resources. Researchers frequently use survey data in the All of Us Research Program (All of Us), and it is imperative to determine if the increased computational burden of employing MI to handle non-response is justifiable. Using the 5-item Physical Activity Neighborhood Environment Scale (PANES) in All of Us, this study assessed the tradeoff between efficacy and computational demands of WMean, MI, and inverse probability weighting (IPW) when dealing with item non-response. Synthetic missingness, allowing 1 or more item non-response, was introduced into PANES across 3 missing mechanisms and various missing percentages (10%-50%). Each scenario compared WMean of complete questions, MI, and IPW on bias, variability, coverage probability, and computation time. All methods showed minimal biases (all <5.5%) for good internal consistency, with WMean suffered most with poor consistency. IPW showed considerable variability with increasing missing percentage. MI required significantly more computational resources, taking >8000 and >100 times longer than WMean and IPW in full data analysis, respectively. The marginal performance advantages of MI for item non-response in highly reliable scales do not warrant its escalated cloud computational burden in All of Us, particularly when coupled with computationally demanding post-imputation analyses. Researchers using survey scales with low missingness could utilize WMean to reduce computing burden.

Identifiants

pubmed: 39138951
pii: 7733273
doi: 10.1093/jamia/ocae217
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIH HHS
ID : 3OT2OD035404
Pays : United States

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association.

Auteurs

Andrew Guide (A)

Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Shawn Garbett (S)

Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Xiaoke Feng (X)

Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Brandy M Mapes (BM)

Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Justin Cook (J)

Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Lina Sulieman (L)

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Robert M Cronin (RM)

Department of Internal Medicine, The Ohio State University, Columbus, OH 43210-1218, United States.

Qingxia Chen (Q)

Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203-2158, United States.

Classifications MeSH