Feature Selection in Predictive Modeling: A Systematic Study on Drug Response Heterogeneity for Type II Diabetic Patients.


Journal

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
ISSN: 2153-4063
Titre abrégé: AMIA Jt Summits Transl Sci Proc
Pays: United States
ID NLM: 101539486

Informations de publication

Date de publication:
2019
Historique:
entrez: 2 7 2019
pubmed: 2 7 2019
medline: 2 7 2019
Statut: epublish

Résumé

With the rapid development of computer hardware and software technologies, more and more electronic health data from insurance claims, clinical trials and hospitals are becoming readily available. These data provide a rich resource for developing various healthcare analytics algorithms, among which predictive modeling is of key importance in many real health problems. One important issue for data-driven predictive modeling is high dimensionality, and feature selection is one effective strategy to reduce the number of independent variables and control the confounding factors. However, most of the existing studies just pick one feature selection approach without comprehensive investigations. In this paper, we investigate the issue of drug response heterogeneity for type II diabetes mellitus (T2DM) patients using a large scale clinical trial data. Our goal is to find out the important factors that may lead to the response heterogeneity for three popular T2DM drugs, Metformin, Rosiglitazone and Glimepiride. We implemented 8 different feature selection approaches and compared their performances with various measures including prediction error and the consistency of the identified important factors. Finally, we ensemble all factor lists picked by different algorithms and obtain a final set of factors that contribute to the drug response heterogeneities and verified them through existing literature.

Identifiants

pubmed: 31258982
pmc: PMC6568100

Types de publication

Journal Article

Langues

eng

Pagination

295-304

Références

Neuroimage. 2001 Dec;14(6):1370-86
pubmed: 11707093
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38
pubmed: 16119262
Lancet. 2006 Sep 23;368(9541):1096-105
pubmed: 16997664
Artif Intell Med. 2007 Nov;41(3):251-62
pubmed: 17707617
Vasc Health Risk Manag. 2007;3(5):721-31
pubmed: 18078023
J Anim Ecol. 2008 Jul;77(4):802-13
pubmed: 18397250
N Engl J Med. 2008 Jun 12;358(24):2545-59
pubmed: 18539917
Diabetes Res Clin Pract. 2010 Jan;87(1):4-14
pubmed: 19896746
Int J Methods Psychiatr Res. 2011 Mar;20(1):40-9
pubmed: 21499542
Diabetol Metab Syndr. 2013 Feb 15;5(1):6
pubmed: 23415113
Diabetes Care. 2014;37(3):634-43
pubmed: 24170759
PLoS One. 2014 Aug 07;9(8):e104046
pubmed: 25101994
Diabetes Ther. 2014 Dec;5(2):471-82
pubmed: 25502227
Curr Diabetes Rev. 2016;12(1):30-41
pubmed: 25877695
Comput Biol Med. 2015 Nov 1;66:1-10
pubmed: 26327447
Diabetes Care. 2018 Mar;41(3):604-612
pubmed: 29279299
Phys Rev A Gen Phys. 1986 Feb;33(2):1134-1140
pubmed: 9896728

Auteurs

Jingyuan Chou (J)

Department of Healthcare Policy and Research. Weill Cornell Medicine.

James Flory (J)

Department of Healthcare Policy and Research. Weill Cornell Medicine.

Fei Wang (F)

Department of Healthcare Policy and Research. Weill Cornell Medicine.

Classifications MeSH