Machine Learning Methods for Precision Medicine Research Designed to Reduce Health Disparities: A Structured Tutorial.
Deep Learning
Gradient Boosting Machines
Health Disparities
Machine Learning
Precision Medicine
Random Forest
Journal
Ethnicity & disease
ISSN: 1945-0826
Titre abrégé: Ethn Dis
Pays: United States
ID NLM: 9109034
Informations de publication
Date de publication:
2020
2020
Historique:
entrez:
10
4
2020
pubmed:
10
4
2020
medline:
2
2
2021
Statut:
epublish
Résumé
Precision medicine research designed to reduce health disparities often involves studying multi-level datasets to understand how diseases manifest disproportionately in one group over another, and how scarce health care resources can be directed precisely to those most at risk for disease. In this article, we provide a structured tutorial for medical and public health researchers on the application of machine learning methods to conduct precision medicine research designed to reduce health disparities. We review key terms and concepts for understanding machine learning papers, including supervised and unsupervised learning, regularization, cross-validation, bagging, and boosting. Metrics are reviewed for evaluating machine learners and major families of learning approaches, including tree-based learning, deep learning, and ensemble learning. We highlight the advantages and disadvantages of different learning approaches, describe strategies for interpreting "black box" models, and demonstrate the application of common methods in an example dataset with open-source statistical code in R.
Identifiants
pubmed: 32269464
doi: 10.18865/ed.30.S1.217
pii: ed.30.S1.217
pmc: PMC7138444
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
217-228Subventions
Organisme : NIMHD NIH HHS
ID : DP2 MD010478
Pays : United States
Organisme : NIMHD NIH HHS
ID : U54 MD010724
Pays : United States
Informations de copyright
Copyright © 2020, Ethnicity & Disease, Inc.
Déclaration de conflit d'intérêts
Competing Interests: None declared.
Références
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
JMLR Workshop Conf Proc. 2016 Aug;56:301-318
pubmed: 28286600
JAMA. 2017 Mar 14;317(10):1019-1020
pubmed: 28192565
Value Health. 2019 Jul;22(7):808-815
pubmed: 31277828
J Am Med Inform Assoc. 2018 Oct 1;25(10):1419-1428
pubmed: 29893864
BMC Bioinformatics. 2008 Jul 11;9:307
pubmed: 18620558
Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):7353-60
pubmed: 27382149
J Med Internet Res. 2016 Dec 16;18(12):e323
pubmed: 27986644
Stat Appl Genet Mol Biol. 2007;6:Article25
pubmed: 17910531
AMIA Jt Summits Transl Sci Proc. 2016 Jul 20;2016:438-45
pubmed: 27570684
Epidemiology. 2010 Jan;21(1):128-38
pubmed: 20010215
PLoS Comput Biol. 2015 Apr 23;11(4):e1004191
pubmed: 25905639
Neural Netw. 2015 Jan;61:85-117
pubmed: 25462637
Rev Environ Health. 2012;27(2-3):133-49
pubmed: 23023922
MDM Policy Pract. 2017 Aug 17;2(2):2381468317725741
pubmed: 30288429