Identifying Patients With Delirium Based on Unstructured Clinical Notes: Observational Study.

clinical notes delirium electronic health records machine learning natural language processing

Journal

JMIR formative research
ISSN: 2561-326X
Titre abrégé: JMIR Form Res
Pays: Canada
ID NLM: 101726394

Informations de publication

Date de publication:
24 Jun 2022
Historique:
received: 27 09 2021
accepted: 10 02 2022
revised: 22 01 2022
entrez: 24 6 2022
pubmed: 25 6 2022
medline: 25 6 2022
Statut: epublish

Résumé

Delirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate. We sought to develop a more accurate method using natural language processing (NLP) to detect delirium episodes on the basis of unstructured clinical notes. We collected 1.5 million notes from >10,000 patients from among 9 hospitals. Seven experts iteratively labeled 200,471 sentences. Using these, we trained three NLP classifiers: Support Vector Machine, Recurrent Neural Networks, and Transformer. Testing was performed using an external data set. We also evaluated associations with delirium billing (ICD) codes, medications, orders for restraints and sitters, direct assessments (Confusion Assessment Method [CAM] scores), and in-hospital mortality. F1 scores, confusion matrices, and areas under the receiver operating characteristic curve (AUCs) were used to compare NLP models. We used the φ coefficient to measure associations with other delirium indicators. The transformer NLP performed best on the following parameters: micro F1=0.978, macro F1=0.918, positive AUC=0.984, and negative AUC=0.992. NLP detections exhibited higher correlations (φ) than ICD codes with deliriogenic medications (0.194 vs 0.073 for ICD codes), restraints and sitter orders (0.358 vs 0.177), mortality (0.216 vs 0.000), and CAM scores (0.256 vs -0.028). Clinical notes are an attractive alternative to ICD codes for EHR delirium studies but require automated methods. Our NLP model detects delirium with high accuracy, similar to manual chart review. Our NLP approach can provide more accurate determination of delirium for large-scale EHR-based studies regarding delirium, quality improvement, and clinical trails.

Sections du résumé

BACKGROUND BACKGROUND
Delirium in hospitalized patients is a syndrome of acute brain dysfunction. Diagnostic (International Classification of Diseases [ICD]) codes are often used in studies using electronic health records (EHRs), but they are inaccurate.
OBJECTIVE OBJECTIVE
We sought to develop a more accurate method using natural language processing (NLP) to detect delirium episodes on the basis of unstructured clinical notes.
METHODS METHODS
We collected 1.5 million notes from >10,000 patients from among 9 hospitals. Seven experts iteratively labeled 200,471 sentences. Using these, we trained three NLP classifiers: Support Vector Machine, Recurrent Neural Networks, and Transformer. Testing was performed using an external data set. We also evaluated associations with delirium billing (ICD) codes, medications, orders for restraints and sitters, direct assessments (Confusion Assessment Method [CAM] scores), and in-hospital mortality. F1 scores, confusion matrices, and areas under the receiver operating characteristic curve (AUCs) were used to compare NLP models. We used the φ coefficient to measure associations with other delirium indicators.
RESULTS RESULTS
The transformer NLP performed best on the following parameters: micro F1=0.978, macro F1=0.918, positive AUC=0.984, and negative AUC=0.992. NLP detections exhibited higher correlations (φ) than ICD codes with deliriogenic medications (0.194 vs 0.073 for ICD codes), restraints and sitter orders (0.358 vs 0.177), mortality (0.216 vs 0.000), and CAM scores (0.256 vs -0.028).
CONCLUSIONS CONCLUSIONS
Clinical notes are an attractive alternative to ICD codes for EHR delirium studies but require automated methods. Our NLP model detects delirium with high accuracy, similar to manual chart review. Our NLP approach can provide more accurate determination of delirium for large-scale EHR-based studies regarding delirium, quality improvement, and clinical trails.

Identifiants

pubmed: 35749214
pii: v6i6e33834
doi: 10.2196/33834
pmc: PMC9270709
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e33834

Subventions

Organisme : NINDS NIH HHS
ID : R01 NS102190
Pays : United States
Organisme : NIA NIH HHS
ID : P30 AG062421
Pays : United States
Organisme : NIMH NIH HHS
ID : K23 MH115812
Pays : United States
Organisme : NIA NIH HHS
ID : RF1 AG064312
Pays : United States
Organisme : NINDS NIH HHS
ID : R01 NS102574
Pays : United States
Organisme : NINDS NIH HHS
ID : R01 NS107291
Pays : United States

Informations de copyright

©Wendong Ge, Haitham Alabsi, Aayushee Jain, Elissa Ye, Haoqi Sun, Marta Fernandes, Colin Magdamo, Ryan A Tesh, Sarah I Collens, Amy Newhouse, Lidia MVR Moura, Sahar Zafar, John Hsu, Oluwaseun Akeju, Gregory K Robbins, Shibani S Mukerji, Sudeshna Das, M Brandon Westover. Originally published in JMIR Formative Research (https://formative.jmir.org), 24.06.2022.

Références

Nat Rev Neurol. 2009 Apr;5(4):210-20
pubmed: 19347026
JAMA Psychiatry. 2016 Oct 01;73(10):1064-1071
pubmed: 27626235
J Biomed Inform. 2021 Jun;118:103783
pubmed: 33887456
J Gerontol A Biol Sci Med Sci. 2022 Mar 3;77(3):524-530
pubmed: 35239951
NPJ Digit Med. 2020 Apr 14;3:57
pubmed: 32337372
BMC Psychiatry. 2016 May 26;16:167
pubmed: 27229307
Summit Transl Bioinform. 2008 Mar 01;2008:36-40
pubmed: 21347124
J Am Med Inform Assoc. 2020 Jul 1;27(9):1383-1392
pubmed: 32968811
JMIR Med Inform. 2020 Mar 31;8(3):e17984
pubmed: 32229465
BMC Med Inform Decis Mak. 2008 Jul 24;8:32
pubmed: 18652655
Psychosomatics. 2019 Mar - Apr;60(2):105-120
pubmed: 30686485
Stud Health Technol Inform. 2015;216:629-33
pubmed: 26262127
Pharmacoepidemiol Drug Saf. 2017 Aug;26(8):945-953
pubmed: 28485014
Dement Geriatr Cogn Disord. 1999 Sep-Oct;10(5):315-8
pubmed: 10473930
J Am Med Inform Assoc. 2020 Dec 9;27(12):1935-1942
pubmed: 33120431

Auteurs

Wendong Ge (W)

Massachusetts General Hospital, Boston, MA, United States.

Haitham Alabsi (H)

Massachusetts General Hospital, Boston, MA, United States.

Aayushee Jain (A)

Massachusetts General Hospital, Boston, MA, United States.

Elissa Ye (E)

Massachusetts General Hospital, Boston, MA, United States.

Haoqi Sun (H)

Massachusetts General Hospital, Boston, MA, United States.

Marta Fernandes (M)

Massachusetts General Hospital, Boston, MA, United States.

Colin Magdamo (C)

Massachusetts General Hospital, Boston, MA, United States.

Ryan A Tesh (RA)

Massachusetts General Hospital, Boston, MA, United States.

Sarah I Collens (SI)

Massachusetts General Hospital, Boston, MA, United States.

Amy Newhouse (A)

Massachusetts General Hospital, Boston, MA, United States.

Lidia Mvr Moura (L)

Massachusetts General Hospital, Boston, MA, United States.

Sahar Zafar (S)

Massachusetts General Hospital, Boston, MA, United States.

John Hsu (J)

Massachusetts General Hospital, Boston, MA, United States.

Oluwaseun Akeju (O)

Massachusetts General Hospital, Boston, MA, United States.

Gregory K Robbins (GK)

Massachusetts General Hospital, Boston, MA, United States.

Shibani S Mukerji (SS)

Massachusetts General Hospital, Boston, MA, United States.

Sudeshna Das (S)

Massachusetts General Hospital, Boston, MA, United States.

M Brandon Westover (MB)

Massachusetts General Hospital, Boston, MA, United States.

Classifications MeSH