The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data.
Journal
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
ISSN: 2335-6936
Titre abrégé: Pac Symp Biocomput
Pays: United States
ID NLM: 9711271
Informations de publication
Date de publication:
2019
2019
Historique:
entrez:
14
3
2019
pubmed:
14
3
2019
medline:
24
8
2019
Statut:
ppublish
Résumé
Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex.
Identifiants
pubmed: 30864307
pii: 9789813279827_0003
pmc: PMC6662921
mid: NIHMS1040926
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Pagination
18-29Subventions
Organisme : NLM NIH HHS
ID : R01 LM011369
Pays : United States
Organisme : NLM NIH HHS
ID : T32 LM012409
Pays : United States
Références
Artif Intell Med. 2007 Nov;41(3):251-62
pubmed: 17707617
Sci Transl Med. 2011 Apr 20;3(79):79re1
pubmed: 21508311
AMIA Annu Symp Proc. 2011;2011:274-83
pubmed: 22195079
Curr Oncol Rep. 2012 Dec;14(6):494-501
pubmed: 22948276
J Am Med Inform Assoc. 2013 Jun;20(e1):e147-54
pubmed: 23531748
J Am Med Inform Assoc. 2013 Dec;20(e2):e243-52
pubmed: 23837993
Nat Biotechnol. 2013 Dec;31(12):1095-7
pubmed: 24316646
Front Genet. 2014 Jun 17;5:184
pubmed: 24987407
Health Aff (Millwood). 2014 Jul;33(7):1229-35
pubmed: 25006150
PLoS One. 2015 Mar 04;10(3):e0118432
pubmed: 25738806
J Biomed Inform. 2015 Jun;55:82-93
pubmed: 25841328
Genome Med. 2015 Apr 30;7(1):41
pubmed: 25937834
AMIA Annu Symp Proc. 2014 Nov 14;2014:1180-7
pubmed: 25954429
AMIA Annu Symp Proc. 2014 Nov 14;2014:1258-67
pubmed: 25954437
J Am Med Inform Assoc. 2015 Nov;22(6):1220-30
pubmed: 26342218
J Am Med Inform Assoc. 2016 Nov;23(6):1046-1052
pubmed: 27026615
J Am Med Inform Assoc. 2016 Jul;23(4):731-40
pubmed: 27107443
J Am Med Inform Assoc. 2016 Nov;23(6):1166-1173
pubmed: 27174893
Big Data. 2015 Dec;3(4):277-87
pubmed: 27441408
PLoS One. 2017 Jul 7;12(7):e0175508
pubmed: 28686612
AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:48-57
pubmed: 28815104
NPJ Digit Med. 2018 May 8;1:18
pubmed: 31304302