Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach.

deep learning electronic health records machine learning natural language processing suicide suicide, attempted

Journal

JMIR medical informatics

ISSN: 2291-9694

Titre abrégé: JMIR Med Inform

Pays: Canada

ID NLM: 101645109

Informations de publication

Date de publication:
30 Jul 2020

Historique:

received: 13 01 2020

accepted: 21 05 2020

revised: 25 04 2020

entrez: 31 7 2020

pubmed: 31 7 2020

medline: 31 7 2020

Statut: epublish

Résumé

Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words-based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.

Sections du résumé

BACKGROUND BACKGROUND

OBJECTIVE OBJECTIVE

This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events.

METHODS METHODS

We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words-based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance.

RESULTS RESULTS

The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance.

CONCLUSIONS CONCLUSIONS

The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.

Identifiants

DOI: 10.2196/17784 PMID: 32729840 PMC: PMC7426805

pubmed: 32729840

pii: v8i7e17784

doi: 10.2196/17784

pmc: PMC7426805

doi:

Types de publication

Journal Article

Langues

eng

Pagination

e17784

Subventions

Organisme : NIDDK NIH HHS

ID : P30 DK123704

Pays : United States

Organisme : NIMH NIH HHS

ID : K23 MH118482

Pays : United States

Organisme : NCATS NIH HHS

ID : UL1 TR001450

Pays : United States

Organisme : HSRD VA

ID : I21 HX002700

Pays : United States

Organisme : NIDA NIH HHS

ID : K23 DA045766

Pays : United States

Informations de copyright

©Jihad S Obeid, Jennifer Dahne, Sean Christensen, Samuel Howard, Tami Crawford, Lewis J Frey, Tracy Stecker, Brian E Bunnell. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 30.07.2020.

Références

J Biomed Inform. 2009 Apr;42(2):377-81

pubmed: 18929686

J Affect Disord. 2016 Dec;206:204-209

pubmed: 27475891

BMC Med Inform Decis Mak. 2019 Aug 19;19(1):164

pubmed: 31426779

World J Psychiatry. 2017 Sep 22;7(3):163-176

pubmed: 29043154

Int J Methods Psychiatr Res. 2017 Jun;26(2):

pubmed: 27634457

AMIA Annu Symp Proc. 2012;2012:1244-53

pubmed: 23304402

BMJ. 2015 Nov 09;351:h4978

pubmed: 26552947

BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):43

pubmed: 30066665

Br J Psychiatry. 2016 Oct;209(4):277-283

pubmed: 27340111

PLoS One. 2014 Jan 20;9(1):e84282

pubmed: 24465400

AMIA Annu Symp Proc. 2018 Apr 16;2017:411-420

pubmed: 29854105

Nature. 2015 May 28;521(7553):436-44

pubmed: 26017442

Prof Psychol Res Pr. 2010 Jun 1;41(3):221-227

pubmed: 20640243

PLoS One. 2017 Jul 19;12(7):e0180292

pubmed: 28723978

Natl Health Stat Report. 2018 Feb;(108):1-19

pubmed: 29616901

Annu Rev Clin Psychol. 2016;12:307-30

pubmed: 26772209

J Psychiatr Res. 2011 May;45(5):619-25

pubmed: 21055768

Psychol Bull. 2017 Feb;143(2):187-232

pubmed: 27841450

BMC Med Inform Decis Mak. 2018 May 29;18(1):30

pubmed: 29843698

J Clin Psychiatry. 2008 Jan;69(1):23-31

pubmed: 18312034

J Clin Transl Sci. 2017 Aug;1(4):246-252

pubmed: 29657859

J Affect Disord. 2019 Feb 15;245:869-884

pubmed: 30699872

Mol Psychiatry. 2017 Apr;22(4):544-551

pubmed: 27431294

AMIA Annu Symp Proc. 2018 Apr 16;2017:641-649

pubmed: 29854129

Clin Psychol Rev. 2015 Dec;42:156-67

pubmed: 26416295

NCHS Data Brief. 2017 Dec;(293):1-8

pubmed: 29319473

Front Psychiatry. 2018 Mar 07;9:56

pubmed: 29563886

Sci Rep. 2018 May 9;8(1):7426

pubmed: 29743531

PLoS One. 2014 Jan 28;9(1):e85733

pubmed: 24489669

Ann Intern Med. 2019 Sep 3;171(5):343-353

pubmed: 31450237

BMC Med Inform Decis Mak. 2017 Aug 22;17(1):126

pubmed: 28830409

JAMA Psychiatry. 2016 Oct 1;73(10):1064-1071

pubmed: 27626235

Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach.

Journal

Informations de publication

Résumé

Sections du résumé

Identifiants

Types de publication

Langues

Pagination

Subventions

Informations de copyright

Références

Auteurs

Jihad S Obeid (JS)

Jennifer Dahne (J)

Sean Christensen (S)

Samuel Howard (S)

Tami Crawford (T)

Lewis J Frey (LJ)

Tracy Stecker (T)

Brian E Bunnell (BE)

Classifications MeSH