Variable selection methods were poorly reported but rarely misused in major medical journals: Literature review.


Journal

Journal of clinical epidemiology
ISSN: 1878-5921
Titre abrégé: J Clin Epidemiol
Pays: United States
ID NLM: 8801383

Informations de publication

Date de publication:
11 2021
Historique:
received: 03 03 2021
revised: 19 06 2021
accepted: 12 07 2021
pubmed: 20 7 2021
medline: 21 12 2021
entrez: 19 7 2021
Statut: ppublish

Résumé

Objective This work presents a review of the literature on reporting, practice and misuse of knowledge-based and data-driven variable selection methods, in five highly cited medical journals, considering recoding and interaction unlike previous reviews. Study Design and Setting Original observational studies with a predictive or explicative research question with multivariable analyses published in N. Engl. J. Med., Lancet, JAMA, Br. Med. J. and Ann. Intern. Med. between 2017 and 2019 were searched. Article screening was performed by a single reader, data extraction was performed by two readers and a third reader participated in case of disagreement. The use of data-driven variable selection methods in causal explicative questions was considered as misuse. Results 488 articles were included. The variable selection method was unclear in 234 (48%) articles, data-driven in 78 (16%) articles and knowledge-based in 176 (36%) articles. The most common data-driven methods were: Univariate selection (n = 22, 4.5%) and model comparisons or testing for interaction (n = 17, 3.5%). Data-driven methods were misused in 51 (10.5%) of articles. Conclusion Overall reporting of variable selection methods is insufficient. Data-driven methods seem to be used only in a minority of articles of the big five medical journals.

Identifiants

pubmed: 34280475
pii: S0895-4356(21)00214-6
doi: 10.1016/j.jclinepi.2021.07.006
pii:
doi:

Types de publication

Journal Article Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

12-19

Informations de copyright

Copyright © 2021. Published by Elsevier Inc.

Auteurs

T Pressat-Laffouilhère (T)

Department of Biostatistics, CHU Rouen, F-76000 Rouen, France; Department of Biomedical Informatics, F-76000 Rouen, France; Normandie Univ, UNIROUEN, LITIS EA 4108, F-76000 Rouen, France. Electronic address: t.pressat@chu-rouen.fr.

R Jouffroy (R)

Department of Intensive Care Unit, Ambroise Paré Hospital, Assistance Publique Hôpitaux Paris and Paris Saclay University, Boulogne Billancourt, France; Inserm U 1018, University of Rouen and University Paris-Saclay, France; Institut de Recherche bioMédicale et d'Epidémiologie du Sport - EA7329, INSEP - Paris University, France.

A Leguillou (A)

Reims University Hospital, Reims, France.

G Kerdelhue (G)

Department of Biomedical Informatics, F-76000 Rouen, France.

J Benichou (J)

Department of Biostatistics, CHU Rouen, F-76000 Rouen, France; Inserm U 1018, University of Rouen and University Paris-Saclay, France.

A Gillibert (A)

Department of Biostatistics, CHU Rouen, F-76000 Rouen, France.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH