Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach.

deep learning electronic health records machine learning natural language processing suicide suicide, attempted

Journal

JMIR medical informatics
ISSN: 2291-9694
Titre abrégé: JMIR Med Inform
Pays: Canada
ID NLM: 101645109

Informations de publication

Date de publication:
30 Jul 2020
Historique:
received: 13 01 2020
accepted: 21 05 2020
revised: 25 04 2020
entrez: 31 7 2020
pubmed: 31 7 2020
medline: 31 7 2020
Statut: epublish

Résumé

Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words-based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.

Sections du résumé

BACKGROUND BACKGROUND
Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum.
OBJECTIVE OBJECTIVE
This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events.
METHODS METHODS
We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words-based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance.
RESULTS RESULTS
The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance.
CONCLUSIONS CONCLUSIONS
The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.

Identifiants

pubmed: 32729840
pii: v8i7e17784
doi: 10.2196/17784
pmc: PMC7426805
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e17784

Subventions

Organisme : NIDDK NIH HHS
ID : P30 DK123704
Pays : United States
Organisme : NIMH NIH HHS
ID : K23 MH118482
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR001450
Pays : United States
Organisme : HSRD VA
ID : I21 HX002700
Pays : United States
Organisme : NIDA NIH HHS
ID : K23 DA045766
Pays : United States

Informations de copyright

©Jihad S Obeid, Jennifer Dahne, Sean Christensen, Samuel Howard, Tami Crawford, Lewis J Frey, Tracy Stecker, Brian E Bunnell. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 30.07.2020.

Références

J Biomed Inform. 2009 Apr;42(2):377-81
pubmed: 18929686
J Affect Disord. 2016 Dec;206:204-209
pubmed: 27475891
BMC Med Inform Decis Mak. 2019 Aug 19;19(1):164
pubmed: 31426779
World J Psychiatry. 2017 Sep 22;7(3):163-176
pubmed: 29043154
Int J Methods Psychiatr Res. 2017 Jun;26(2):
pubmed: 27634457
AMIA Annu Symp Proc. 2012;2012:1244-53
pubmed: 23304402
BMJ. 2015 Nov 09;351:h4978
pubmed: 26552947
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):43
pubmed: 30066665
Br J Psychiatry. 2016 Oct;209(4):277-283
pubmed: 27340111
PLoS One. 2014 Jan 20;9(1):e84282
pubmed: 24465400
AMIA Annu Symp Proc. 2018 Apr 16;2017:411-420
pubmed: 29854105
Nature. 2015 May 28;521(7553):436-44
pubmed: 26017442
Prof Psychol Res Pr. 2010 Jun 1;41(3):221-227
pubmed: 20640243
PLoS One. 2017 Jul 19;12(7):e0180292
pubmed: 28723978
Natl Health Stat Report. 2018 Feb;(108):1-19
pubmed: 29616901
Annu Rev Clin Psychol. 2016;12:307-30
pubmed: 26772209
J Psychiatr Res. 2011 May;45(5):619-25
pubmed: 21055768
Psychol Bull. 2017 Feb;143(2):187-232
pubmed: 27841450
BMC Med Inform Decis Mak. 2018 May 29;18(1):30
pubmed: 29843698
J Clin Psychiatry. 2008 Jan;69(1):23-31
pubmed: 18312034
J Clin Transl Sci. 2017 Aug;1(4):246-252
pubmed: 29657859
J Affect Disord. 2019 Feb 15;245:869-884
pubmed: 30699872
Mol Psychiatry. 2017 Apr;22(4):544-551
pubmed: 27431294
AMIA Annu Symp Proc. 2018 Apr 16;2017:641-649
pubmed: 29854129
Clin Psychol Rev. 2015 Dec;42:156-67
pubmed: 26416295
NCHS Data Brief. 2017 Dec;(293):1-8
pubmed: 29319473
Front Psychiatry. 2018 Mar 07;9:56
pubmed: 29563886
Sci Rep. 2018 May 9;8(1):7426
pubmed: 29743531
PLoS One. 2014 Jan 28;9(1):e85733
pubmed: 24489669
Ann Intern Med. 2019 Sep 3;171(5):343-353
pubmed: 31450237
BMC Med Inform Decis Mak. 2017 Aug 22;17(1):126
pubmed: 28830409
JAMA Psychiatry. 2016 Oct 1;73(10):1064-1071
pubmed: 27626235

Auteurs

Jihad S Obeid (JS)

Medical University of South Carolina, Charleston, SC, United States.

Jennifer Dahne (J)

Medical University of South Carolina, Charleston, SC, United States.

Sean Christensen (S)

Medical University of South Carolina, Charleston, SC, United States.

Samuel Howard (S)

Medical University of South Carolina, Charleston, SC, United States.

Tami Crawford (T)

Medical University of South Carolina, Charleston, SC, United States.

Lewis J Frey (LJ)

Medical University of South Carolina, Charleston, SC, United States.

Tracy Stecker (T)

Medical University of South Carolina, Charleston, SC, United States.

Brian E Bunnell (BE)

University of South Florida, Tampa, FL, United States.

Classifications MeSH