Balancing efficacy and computational burden: weighted mean, multiple imputation, and inverse probability weighting methods for item non-response in reliable scales.

All of Us Research Program item imputation missing data multi-item questionnaire simulation

Journal

Journal of the American Medical Informatics Association : JAMIA

ISSN: 1527-974X

Titre abrégé: J Am Med Inform Assoc

Pays: England

ID NLM: 9430800

Informations de publication

Date de publication:
13 Aug 2024

Historique:

received: 04 04 2024

revised: 05 07 2024

accepted: 07 08 2024

medline: 14 8 2024

pubmed: 14 8 2024

entrez: 14 8 2024

Statut: aheadofprint

Résumé

Scales often arise from multi-item questionnaires, yet commonly face item non-response. Traditional solutions use weighted mean (WMean) from available responses, but potentially overlook missing data intricacies. Advanced methods like multiple imputation (MI) address broader missing data, but demand increased computational resources. Researchers frequently use survey data in the All of Us Research Program (All of Us), and it is imperative to determine if the increased computational burden of employing MI to handle non-response is justifiable. Using the 5-item Physical Activity Neighborhood Environment Scale (PANES) in All of Us, this study assessed the tradeoff between efficacy and computational demands of WMean, MI, and inverse probability weighting (IPW) when dealing with item non-response. Synthetic missingness, allowing 1 or more item non-response, was introduced into PANES across 3 missing mechanisms and various missing percentages (10%-50%). Each scenario compared WMean of complete questions, MI, and IPW on bias, variability, coverage probability, and computation time. All methods showed minimal biases (all <5.5%) for good internal consistency, with WMean suffered most with poor consistency. IPW showed considerable variability with increasing missing percentage. MI required significantly more computational resources, taking >8000 and >100 times longer than WMean and IPW in full data analysis, respectively. The marginal performance advantages of MI for item non-response in highly reliable scales do not warrant its escalated cloud computational burden in All of Us, particularly when coupled with computationally demanding post-imputation analyses. Researchers using survey scales with low missingness could utilize WMean to reduce computing burden.

Identifiants

DOI: 10.1093/jamia/ocae217 PMID: 39138951

pubmed: 39138951

pii: 7733273

doi: 10.1093/jamia/ocae217

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Subventions

Organisme : NIH HHS

ID : 3OT2OD035404

Pays : United States

Balancing efficacy and computational burden: weighted mean, multiple imputation, and inverse probability weighting methods for item non-response in reliable scales.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Subventions

Informations de copyright

Auteurs

Andrew Guide (A)

Shawn Garbett (S)

Xiaoke Feng (X)

Brandy M Mapes (BM)

Justin Cook (J)

Lina Sulieman (L)

Robert M Cronin (RM)

Qingxia Chen (Q)

Classifications MeSH