Sequential knockoffs for continuous and categorical predictors: With application to a large psoriatic arthritis clinical trial pool.


Journal

Statistics in medicine
ISSN: 1097-0258
Titre abrégé: Stat Med
Pays: England
ID NLM: 8215016

Informations de publication

Date de publication:
30 06 2021
Historique:
revised: 22 02 2021
received: 20 10 2020
accepted: 01 03 2021
pubmed: 27 4 2021
medline: 30 6 2021
entrez: 26 4 2021
Statut: ppublish

Résumé

Knockoffs provide a general framework for controlling the false discovery rate when performing variable selection. Much of the Knockoffs literature focuses on theoretical challenges and we recognize a need for bringing some of the current ideas into practice. In this paper we propose a sequential algorithm for generating knockoffs when underlying data consists of both continuous and categorical (factor) variables. Further, we present a heuristic multiple knockoffs approach that offers a practical assessment of how robust the knockoff selection process is for a given dataset. We conduct extensive simulations to validate performance of the proposed methodology. Finally, we demonstrate the utility of the methods on a large clinical data pool of more than 2000 patients with psoriatic arthritis evaluated in four clinical trials with an IL-17A inhibitor, secukinumab (Cosentyx), where we determine prognostic factors of a well established clinical outcome. The analyses presented in this paper could provide a wide range of applications to commonly encountered datasets in medical practice and other fields where variable selection is of particular interest.

Identifiants

pubmed: 33899260
doi: 10.1002/sim.8955
doi:

Substances chimiques

Antibodies, Monoclonal, Humanized 0
secukinumab DLG4EML025

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

3313-3328

Informations de copyright

© 2021 John Wiley & Sons Ltd.

Références

Wasserstein RL, Lazar NA. The ASA statement on p-values: context, process, and purpose. Am Stat. 2016;70(2):129-133.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Ser B. 1995;57(1):289-300.
Baldi P, Long AD. A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes. Bioinformatics. 2001;17(6):509-519.
Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3(3):1-25.
Efron B. Microarrays empirical bayes and the two-groups model. Stat Sci. 2008;23(1):1-22.
Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001;29(4):1165-1188.
Harrell FE. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. New York, NY: Springer-Verlag; 2001.
Tibshirani RJ, Taylor J, Lockhart R, Tibshirani R. Exact post-selection inference for sequential regression procedures. J Am Stat Assoc. 2016;111(514):600-620.
Hastie T, Tibshirani R, Friedman JH. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York, NY: Springer; 2001.
Tibshirani R. Regression shrinkage and selection via the Lasso. J Royal Stat Soc Ser B (Methodol). 1996;58(1):267-288.
Zou H, Hastie T. Regularization and variable selection via the elastic net. J Royal Stat Soc Ser B. 2005;67(2):301-320.
Su W, Bogdan M, Candès EJ. False discoveries occur early on the Lasso path. Ann Stat. 2017;45(5):2133-2150.
Lee JD, Sun DL, Sun Y, Taylor JE. Exact post-selection inference, with application to the lasso. Ann Stat. 2016;44(3):907-927.
Fan J, Song R. Sure independence screening in generalized linear models with np-dimensionality. Ann Stat. 2010;38(6):3567-3604.
Yi H, Breheny P, Imam N, Liu Y, Hoeschele I. Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits. Genetics. 2015;199(1):205-222.
Tadist K, Najah S, Nikolov NS, Mrabti F, Zahi A. Feature selection methods and genomic big data: a systematic review. J Big Data. 2019;6(79):1-24.
Ding C, Peng H. Minimum redundancy feature selection from microarray gene expression data. J Bioinform Comput Biol. 2005;3(2):185-205.
Barber RF, Candès EJ. Controlling the false discovery rate via knockoffs. Ann Stat. 2015;43(5):2055-2085.
Candès E, Fan Y, Janson L, Lv J. Panning for gold: 'model-X' knockoffs for high dimensional controlled variable selection. J Royal Stat Soc Ser B. 2018;80(3):551-577.
Romano Y, Sesia M, Candès E. Deep knockoffs. J Am Stat Assoc. 2020;115:1861-1872.
Sesia M, Sabatti C, Candès EJ. Gene hunting with hidden Markov model knockoffs. Biometrika. 2019;106(1):1-18.
Patterson E, Sesia M. knockoff: the knockoff filter for controlled variable selection. R package version 03.2; 2018.
Holden L, Hellton K. Multiple model-free knockoffs; 2018. arXiv e-prints:arXiv:1812.04928.
Gimenez JR, Zou J. Improving the stability of the knockoff procedure: multiple simultaneous knockoffs and entropy maximization. Proceedings of Machine Learning Research. Vol 89; 2019:2184-2192.
Nguyen TB, Chevalier JA, Thirion B, Arlot S. Aggregation of multiple knockoffs; 2020. arXiv e-prints:arXiv:2002.09269.
Barber RF, Candès EJ. A knockoff filter for high-dimensional selective inference. Ann Stat. 2019;47(5):2504-2537.
Barber RF, Candès EJ, Samworth RJ. Robust inference with knockoffs. Ann Stat. 2020;48(3):1409-1431.
G'Sell M, Wager S, Chouldechova A, Tibshirani R. Sequential selection procedures and false discovery rate control. J Royal Stat Soc Ser B. 2016;78:423-444.
Friedman J, Hastie R, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1-22.
Tibshirani RJ, Tibshirani R, Taylor J, Loftus J, Reid S. Selective inference: tools for post-selection inference. R package version 1.2.4; 2017.
Gottlieb AB, Mease PJ, Kirkham B, et al. Secukinumab efficacy in psoriatic arthritis: machine learning and meta-analysis of four phase 3 trials. J Clin Rheumatol. 2020. https://doi.org/10.1097/RHU.0000000000001302.
Felson DT, Anderson JJ, Boers M. The American college of rheumatology preliminary core set of disease activity measures for rheumatoid arthritis clinical trials. The committee on outcome measures in rheumatoid arthritis clinical trials. Arthritis Rheum. 1993;36(6):729-740.
McInnes IB, Mease PJ, Kirkham B, Kavanaugh A. Secukinumab, A human anti-interleukin-17A monoclonal antibody, in patients with psoriatic arthritis (FUTURE 2): a randomised, double-blind, placebo-controlled, phase 3 trial. Lancet. 2015;386(9999):1137-1146.
Nash P, Mease PJ, McInnes IB, Rahman P. Efficacy and safety of secukinumab administration by autoinjector in patients with psoriatic arthritis: results from a randomized, placebo-controlled trial (FUTURE 3). Arthritis Res Ther. 2018;20(1):47.
Kivitz AJ, Nash P, Tahir H, Everding A. Efficacy and safety of subcutaneous secukinumab 150 mg with or without loading regimen in psoriatic arthritis: results from the FUTURE 4 study. Rheumatol Ther. 2019;6(3):393-407.
Mease P, Heijde D, Landewé R. Secukinumab improves active psoriatic arthritis symptoms and inhibits radiographic progression: primary results from the randomised, double-blind, phase III FUTURE 5 study. Ann Rheum Dis. 2018;77(6):890-897.
Bühlmann P, Rütimann P, Geer S, Zhang CH. Correlated variables in regression: clustering and sparse estimation. J Stat Plann Infer. 2013;143(11):1835-1858.
Novartis Cosentyx (secukinumab)[package insert]. European Medicines Agency Website; 2019. https://www.ema.europa.eu/en/documents/product-information/cosentyx-epa%r-product-information_en.pdf.
Novartis Cosentyx (secukinumab)[package insert]. U.S. Food and Drug Administration Website; 2020. https://www.accessdata.fda.gov/drugsatfda_docs/label/2020/125504s031lbl%.pdf.
Kavanaugh A, McInnes IB, Mease PJ, et al. Efficacy of subcutaneous secukinumab in patients with active psoriatic arthritis stratified by prior tumor necrosis factor inhibitor use: results from the randomized placebo-controlled FUTURE 2 study. J Rheumatol. 2016;43(9):1713-1717.
Glintborg B, Ostergaard M, Krogh NS, et al. Clinical response, drug survival, and predictors thereof among 548 patients with psoriatic arthritis who switched tumor necrosis factor α inhibitor therapy: results from the Danish Nationwide DANBIO registry. Arthritis Rheum. 2013;65(5):1213-1223.
Anders HJ, Vielhauer V. Renal co-morbidity in patients with rheumatic diseases. Arthritis Res Ther. 2011;13:3.
Heiskell CL, Reed WB, Weimer HE. Serum protein profiles in psoriasis and arthritis. Arch Dermatol. 1962;85(6):708-715.
Sokolova MV, Simon D, Nas K, et al. A set of serum markers detecting systemic inflammation in psoriatic skin, entheseal, and joint disease in the absence of C-reactive protein and its link to clinical disease manifestations. Arthritis Res Ther. 2020;22(26):1-8.
Reid S, Tibshirani R, Friedman JA. Study of error variance estimation in lasso regression. Stat Sin. 2016;26(1):35-67.

Auteurs

Matthias Kormaksson (M)

Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA.

Luke J Kelly (LJ)

Oxford University, Oxford, UK.

Xuan Zhu (X)

Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA.

Sibylle Haemmerle (S)

Novartis Pharmaceuticals Corporation, Basel, Switzerland.

Luminita Pricop (L)

Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA.

David Ohlssen (D)

Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH