Extracting social determinants of health from electronic health records using natural language processing: a systematic review.

electronic health records information extraction machine learning natural language processing population health outcomes social determinants of health

Journal

Journal of the American Medical Informatics Association : JAMIA
ISSN: 1527-974X
Titre abrégé: J Am Med Inform Assoc
Pays: England
ID NLM: 9430800

Informations de publication

Date de publication:
25 11 2021
Historique:
received: 29 04 2021
revised: 09 07 2021
accepted: 04 08 2021
pubmed: 7 10 2021
medline: 22 1 2022
entrez: 6 10 2021
Statut: ppublish

Résumé

Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9). NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems.

Identifiants

pubmed: 34613399
pii: 6382241
doi: 10.1093/jamia/ocab170
pmc: PMC8633615
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Systematic Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

2716-2727

Subventions

Organisme : NIMH NIH HHS
ID : R01 MH121922
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH121924
Pays : United States
Organisme : NIDDK NIH HHS
ID : P30 DK092949
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH121907
Pays : United States
Organisme : NIMH NIH HHS
ID : R41 MH124581
Pays : United States
Organisme : NIMHD NIH HHS
ID : R25 MD011713
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH119177
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM105688
Pays : United States
Organisme : NIH HHS
ID : R01MH119177
Pays : United States

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association.

Références

Stud Health Technol Inform. 2020 Jun 16;270:173-177
pubmed: 32570369
Breast Cancer Res Treat. 2019 Oct;177(3):537-548
pubmed: 31270761
JMIR Med Inform. 2019 Aug 02;7(3):e13802
pubmed: 31376277
Health Serv Res. 2018 Apr;53(2):1110-1136
pubmed: 28295260
J Pain. 2015 Apr;16(4):380-7
pubmed: 25640294
BMC Med Inform Decis Mak. 2006 Jul 26;6:30
pubmed: 16872495
J Biomed Inform. 2018 Jun;82:41-46
pubmed: 29705196
Pain. 2015 Jul;156(7):1208-1214
pubmed: 25760471
BMC Med Inform Decis Mak. 2019 Apr 25;19(1):89
pubmed: 31023302
AMIA Annu Symp Proc. 2015 Nov 05;2015:2121-30
pubmed: 26958312
J Am Med Inform Assoc. 2019 Oct 1;26(10):1020-1029
pubmed: 31197358
Pharmacoepidemiol Drug Saf. 2019 Aug;28(8):1143-1151
pubmed: 31218780
BJPsych Open. 2020 Jul 16;6(4):e73
pubmed: 32669154
J Gen Intern Med. 2020 Jun;35(6):1759-1767
pubmed: 31745856
PLoS One. 2014 Dec 26;9(12):e115873
pubmed: 25541956
J Biomed Inform. 2019 Jun;94:103185
pubmed: 31028874
Pac Symp Biocomput. 2016;22:230-241
pubmed: 27896978
Appl Clin Inform. 2020 Jan;11(1):172-181
pubmed: 32131117
JAMIA Open. 2021 Feb 09;4(3):ooaa069
pubmed: 34514351
Curr Nutr Rep. 2014;3(4):324-332
pubmed: 25383254
Med Care. 2020 Dec;58(12):1037-1043
pubmed: 32925453
AMIA Annu Symp Proc. 2017 Feb 10;2016:1209-1218
pubmed: 28269918
AMIA Annu Symp Proc. 2014 Nov 14;2014:366-74
pubmed: 25954340
Prev Med. 2017 Dec;105:32-36
pubmed: 28823688
Cancer Inform. 2016 Dec 08;15:237-242
pubmed: 27980387
J Am Med Inform Assoc. 2014 Feb;21(e1):e163-8
pubmed: 24201026
AMIA Jt Summits Transl Sci Proc. 2018 May 18;2017:236-245
pubmed: 29888079
Stud Health Technol Inform. 2014;202:153-6
pubmed: 25000039
BMC Med Inform Decis Mak. 2020 Apr 29;20(1):79
pubmed: 32349766
J Am Med Inform Assoc. 2018 Jan 1;25(1):61-71
pubmed: 29016793
Stud Health Technol Inform. 2019 Aug 21;264:1056-1060
pubmed: 31438086
AMIA Annu Symp Proc. 2018 Dec 05;2018:422-429
pubmed: 30815082
J Biomed Inform. 2020 Jul;107:103429
pubmed: 32387393
AMIA Annu Symp Proc. 2018 Apr 16;2017:1179-1185
pubmed: 29854186
PLoS Med. 2018 Nov 20;15(11):e1002695
pubmed: 30458006
Health Informatics J. 2020 Mar;26(1):388-405
pubmed: 30791802
JAMA Netw Open. 2019 Aug 2;2(8):e1910399
pubmed: 31469397
AMIA Annu Symp Proc. 2012;2012:577-86
pubmed: 23304330
J Am Geriatr Soc. 2018 Aug;66(8):1499-1507
pubmed: 29972595
Med Care. 2017 Mar;55(3):261-266
pubmed: 27632767
Health Serv Res. 2019 Feb;54(1):75-85
pubmed: 30240000
Fed Pract. 2021 Jan;38(1):15-19
pubmed: 33574644
AMIA Annu Symp Proc. 2018 Apr 16;2017:1169-1178
pubmed: 29854185
Stud Health Technol Inform. 2017;238:128-131
pubmed: 28679904
J Am Med Inform Assoc. 2018 Jul 1;25(7):833-840
pubmed: 29659856
Ann Fam Med. 2005 Nov-Dec;3(6):488-93
pubmed: 16338911
AMIA Annu Symp Proc. 2020 Mar 04;2019:267-274
pubmed: 32308819
Technol Health Care. 2018;26(3):445-456
pubmed: 29614708
BMC Med Inform Decis Mak. 2019 Jul 25;19(1):141
pubmed: 31340796
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:507-516
pubmed: 32477672
PLoS One. 2019 Jul 16;14(7):e0219717
pubmed: 31310611
Int J Med Inform. 2015 Dec;84(12):1057-64
pubmed: 26456569
Med Clin (Barc). 2016 Sep 16;147(6):262-6
pubmed: 27040178
AMIA Annu Symp Proc. 2013 Nov 16;2013:537-46
pubmed: 24551356
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):962-8
pubmed: 23748627
JAMA Netw Open. 2018 Nov 2;1(7):e184178
pubmed: 30646344
AMIA Annu Symp Proc. 2018 Dec 05;2018:1056-1065
pubmed: 30815148
Int J Eat Disord. 2015 Dec;48(8):1082-91
pubmed: 25959636
PLoS One. 2016 Apr 21;11(4):e0153103
pubmed: 27099932
J Healthc Qual. 2008 Jul-Aug;30(4):24-9
pubmed: 18680924
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1
pubmed: 30616584
Pharmacoepidemiol Drug Saf. 2019 Aug;28(8):1127-1137
pubmed: 31020755
J Urban Health. 2011 Dec;88(6):1105-16
pubmed: 21647798
J Clin Psychiatry. 2012 Oct;73(10):e1269-75
pubmed: 23140657
Perspect Health Inf Manag. 2018 Jan 01;15(Winter):1d
pubmed: 29618960
AMIA Annu Symp Proc. 2020 Mar 04;2019:514-522
pubmed: 32308845
Alcohol. 2020 May;84:49-55
pubmed: 31574300
Am J Prev Med. 2005 Dec;29(5):434-9
pubmed: 16376707
AMIA Annu Symp Proc. 2014 Nov 14;2014:589-98
pubmed: 25954364
Child Abuse Negl. 2019 Dec;98:104180
pubmed: 31521909
J Am Med Inform Assoc. 2019 Mar 1;26(3):254-261
pubmed: 30602031
Am J Prev Med. 2014 May;46(5):457-64
pubmed: 24745635
J Am Med Inform Assoc. 2020 Nov 1;27(11):1764-1773
pubmed: 33202021
J Pain Palliat Care Pharmacother. 2018 Jun - Sep;32(2-3):106-115
pubmed: 30702378
J Biomed Semantics. 2019 Apr 11;10(1):6
pubmed: 30975223
BMJ Open. 2016 Mar 03;6(3):e009888
pubmed: 26940105
J Biomed Inform. 2021 Jan;113:103631
pubmed: 33290878
PLoS One. 2013 Sep 12;8(9):e74262
pubmed: 24069288
AMIA Annu Symp Proc. 2018 Apr 16;2017:1783-1792
pubmed: 29854249
Healthc Inform Res. 2019 Jan;25(1):1-2
pubmed: 30788175

Auteurs

Braja G Patra (BG)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Mohit M Sharma (MM)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Veer Vekaria (V)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Prakash Adekkanattu (P)

Information Technologies and Services, Weill Cornell Medicine, New York, New York, USA.

Olga V Patterson (OV)

Department of Internal Medicine, Division of Epidemiology, University of Utah, Salt Lake City, Utah, USA.
US Department of Veterans Affairs, Salt Lake City, Utah, USA.

Benjamin Glicksberg (B)

Icahn School of Medicine at Mount Sinai, New York, New York, USA.

Lauren A Lepow (LA)

Icahn School of Medicine at Mount Sinai, New York, New York, USA.

Euijung Ryu (E)

Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA.

Joanna M Biernacka (JM)

Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA.

Al'ona Furmanchuk (A)

Northwestern University, Chicago, Illinois, USA.

Thomas J George (TJ)

Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA.

William Hogan (W)

Division of Hematology & Oncology, Department of Medicine, College of Medicine, University of Florida, Gainesville, Florida, USA, and.

Yonghui Wu (Y)

Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA.

Xi Yang (X)

Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA.

Jiang Bian (J)

Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA.

Myrna Weissman (M)

Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA.

Priya Wickramaratne (P)

Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA.

J John Mann (JJ)

Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA.

Mark Olfson (M)

Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA.

Thomas R Campion (TR)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.
Information Technologies and Services, Weill Cornell Medicine, New York, New York, USA.

Mark Weiner (M)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Jyotishman Pathak (J)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH