CPMCGLM: an R package for p-value adjustment when looking for an optimal transformation of a single explanatory variable in generalized linear models.

Generalized linear model Multiple testing Optimal cutoff point determination R package Resampling Union intersection test p-value adjustment

Journal

BMC medical research methodology
ISSN: 1471-2288
Titre abrégé: BMC Med Res Methodol
Pays: England
ID NLM: 100968545

Informations de publication

Date de publication:
16 04 2019
Historique:
received: 11 03 2018
accepted: 18 03 2019
entrez: 18 4 2019
pubmed: 18 4 2019
medline: 21 4 2020
Statut: epublish

Résumé

In medical research, explanatory continuous variables are frequently transformed or converted into categorical variables. If the coding is unknown, many tests can be used to identify the "optimal" transformation. This common process, involving the problems of multiple testing, requires a correction of the significance level. Liquet and Commenges proposed an asymptotic correction of significance level in the context of generalized linear models (GLM) (Liquet and Commenges, Stat Probab Lett 71:33-38, 2005). This procedure has been developed for dichotomous and Box-Cox transformations. Furthermore, Liquet and Riou suggested the use of resampling methods to estimate the significance level for transformations into categorical variables with more than two levels (Liquet and Riou, BMC Med Res Methodol 13:75, 2013). CPMCGLM provides to users both methods of p-value adjustment. Futhermore, they are available for a large set of transformations. This paper aims to provide insight the user an overview of the methodological context, and explain in detail the use of the CPMCGLM R package through its application to a real epidemiological dataset. We present here the CPMCGLMR package providing efficient methods for the correction of type-I error rate in the context of generalized linear models. This is the first and the only available package in R providing such methods applied to this context. This package is designed to help researchers, who work principally in the field of biostatistics and epidemiology, to analyze their data in the context of optimal cutoff point determination.

Sections du résumé

BACKGROUND
In medical research, explanatory continuous variables are frequently transformed or converted into categorical variables. If the coding is unknown, many tests can be used to identify the "optimal" transformation. This common process, involving the problems of multiple testing, requires a correction of the significance level. Liquet and Commenges proposed an asymptotic correction of significance level in the context of generalized linear models (GLM) (Liquet and Commenges, Stat Probab Lett 71:33-38, 2005). This procedure has been developed for dichotomous and Box-Cox transformations. Furthermore, Liquet and Riou suggested the use of resampling methods to estimate the significance level for transformations into categorical variables with more than two levels (Liquet and Riou, BMC Med Res Methodol 13:75, 2013).
RESULTS
CPMCGLM provides to users both methods of p-value adjustment. Futhermore, they are available for a large set of transformations. This paper aims to provide insight the user an overview of the methodological context, and explain in detail the use of the CPMCGLM R package through its application to a real epidemiological dataset.
CONCLUSION
We present here the CPMCGLMR package providing efficient methods for the correction of type-I error rate in the context of generalized linear models. This is the first and the only available package in R providing such methods applied to this context. This package is designed to help researchers, who work principally in the field of biostatistics and epidemiology, to analyze their data in the context of optimal cutoff point determination.

Identifiants

pubmed: 30991962
doi: 10.1186/s12874-019-0711-2
pii: 10.1186/s12874-019-0711-2
pmc: PMC6469151
doi:

Substances chimiques

Cholesterol, HDL 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

79

Références

Biom J. 2007 Feb;49(1):52-67
pubmed: 17342949
Stat Med. 2016 Jul 20;35(16):2687-714
pubmed: 26914402
Biometrics. 2008 Dec;64(4):1287-9; discussion 1289-2
pubmed: 19051394
Int J Epidemiol. 1999 Oct;28(5):964-74
pubmed: 10597998
Stat Med. 2006 Jan 15;25(1):127-41
pubmed: 16217841
Stat Med. 2001 Oct 15;20(19):2815-26
pubmed: 11568942
Biom J. 2008 Oct;50(5):745-55
pubmed: 18932134
Neuroepidemiology. 2000 May-Jun;19(3):141-8
pubmed: 10705232
BMC Med Res Methodol. 2013 Jun 08;13:75
pubmed: 23758852

Auteurs

Benoit Liquet (B)

Université de Pau et Pays de l'Adour, UFR Sciences et Techniques de la Cote Basque-Anglet UMR CNRS 5142, Allée du Parc Montaury, Anglet, 64600, France.
ARC Centre of Excellence for Mathematical and Statistical Frontiers and School of Mathematical Sciences at Queensland University of Technology, Brisbane, Australia.

Jérémie Riou (J)

MINT UMR INSERM 1066, CNRS 6021, Université d'Angers, UFR Santé, 16 Boulevard Davier, Angers Cedex, 49085, France. jeremie.riou@univ-angers.fr.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH