CPMCGLM: an R package for p-value adjustment when looking for an optimal transformation of a single explanatory variable in generalized linear models.
Generalized linear model
Multiple testing
Optimal cutoff point determination
R package
Resampling
Union intersection test
p-value adjustment
Journal
BMC medical research methodology
ISSN: 1471-2288
Titre abrégé: BMC Med Res Methodol
Pays: England
ID NLM: 100968545
Informations de publication
Date de publication:
16 04 2019
16 04 2019
Historique:
received:
11
03
2018
accepted:
18
03
2019
entrez:
18
4
2019
pubmed:
18
4
2019
medline:
21
4
2020
Statut:
epublish
Résumé
In medical research, explanatory continuous variables are frequently transformed or converted into categorical variables. If the coding is unknown, many tests can be used to identify the "optimal" transformation. This common process, involving the problems of multiple testing, requires a correction of the significance level. Liquet and Commenges proposed an asymptotic correction of significance level in the context of generalized linear models (GLM) (Liquet and Commenges, Stat Probab Lett 71:33-38, 2005). This procedure has been developed for dichotomous and Box-Cox transformations. Furthermore, Liquet and Riou suggested the use of resampling methods to estimate the significance level for transformations into categorical variables with more than two levels (Liquet and Riou, BMC Med Res Methodol 13:75, 2013). CPMCGLM provides to users both methods of p-value adjustment. Futhermore, they are available for a large set of transformations. This paper aims to provide insight the user an overview of the methodological context, and explain in detail the use of the CPMCGLM R package through its application to a real epidemiological dataset. We present here the CPMCGLMR package providing efficient methods for the correction of type-I error rate in the context of generalized linear models. This is the first and the only available package in R providing such methods applied to this context. This package is designed to help researchers, who work principally in the field of biostatistics and epidemiology, to analyze their data in the context of optimal cutoff point determination.
Sections du résumé
BACKGROUND
In medical research, explanatory continuous variables are frequently transformed or converted into categorical variables. If the coding is unknown, many tests can be used to identify the "optimal" transformation. This common process, involving the problems of multiple testing, requires a correction of the significance level. Liquet and Commenges proposed an asymptotic correction of significance level in the context of generalized linear models (GLM) (Liquet and Commenges, Stat Probab Lett 71:33-38, 2005). This procedure has been developed for dichotomous and Box-Cox transformations. Furthermore, Liquet and Riou suggested the use of resampling methods to estimate the significance level for transformations into categorical variables with more than two levels (Liquet and Riou, BMC Med Res Methodol 13:75, 2013).
RESULTS
CPMCGLM provides to users both methods of p-value adjustment. Futhermore, they are available for a large set of transformations. This paper aims to provide insight the user an overview of the methodological context, and explain in detail the use of the CPMCGLM R package through its application to a real epidemiological dataset.
CONCLUSION
We present here the CPMCGLMR package providing efficient methods for the correction of type-I error rate in the context of generalized linear models. This is the first and the only available package in R providing such methods applied to this context. This package is designed to help researchers, who work principally in the field of biostatistics and epidemiology, to analyze their data in the context of optimal cutoff point determination.
Identifiants
pubmed: 30991962
doi: 10.1186/s12874-019-0711-2
pii: 10.1186/s12874-019-0711-2
pmc: PMC6469151
doi:
Substances chimiques
Cholesterol, HDL
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
79Références
Biom J. 2007 Feb;49(1):52-67
pubmed: 17342949
Stat Med. 2016 Jul 20;35(16):2687-714
pubmed: 26914402
Biometrics. 2008 Dec;64(4):1287-9; discussion 1289-2
pubmed: 19051394
Int J Epidemiol. 1999 Oct;28(5):964-74
pubmed: 10597998
Stat Med. 2006 Jan 15;25(1):127-41
pubmed: 16217841
Stat Med. 2001 Oct 15;20(19):2815-26
pubmed: 11568942
Biom J. 2008 Oct;50(5):745-55
pubmed: 18932134
Neuroepidemiology. 2000 May-Jun;19(3):141-8
pubmed: 10705232
BMC Med Res Methodol. 2013 Jun 08;13:75
pubmed: 23758852