Differentiation of lumbar disc herniation and lumbar spinal stenosis using natural language processing-based machine learning based on positive symptoms.

Long Short-Term Memory network electronic health record lumbar disc herniation lumbar spinal stenosis machine learning natural language processing

Journal

Neurosurgical focus
ISSN: 1092-0684
Titre abrégé: Neurosurg Focus
Pays: United States
ID NLM: 100896471

Informations de publication

Date de publication:
04 2022
Historique:
received: 22 09 2021
accepted: 20 01 2022
entrez: 1 4 2022
pubmed: 2 4 2022
medline: 6 4 2022
Statut: ppublish

Résumé

The purpose of this study was to develop natural language processing (NLP)-based machine learning algorithms to automatically differentiate lumbar disc herniation (LDH) and lumbar spinal stenosis (LSS) based on positive symptoms in free-text admission notes. The secondary purpose was to compare the performance of the deep learning algorithm with the ensemble model on the current task. In total, 1921 patients whose principal diagnosis was LDH or LSS between June 2013 and June 2020 at Zhongda Hospital, affiliated with Southeast University, were retrospectively analyzed. The data set was randomly divided into a training set and testing set at a 7:3 ratio. Long Short-Term Memory (LSTM) and extreme gradient boosting (XGBoost) models were developed in this study. NLP algorithms were assessed on the testing set by the following metrics: receiver operating characteristic (ROC) curve, area under the curve (AUC), accuracy score, recall score, F1 score, and precision score. In the testing set, the LSTM model achieved an AUC of 0.8487, accuracy score of 0.7818, recall score of 0.9045, F1 score of 0.8108, and precision score of 0.7347. In comparison, the XGBoost model achieved an AUC of 0.7565, accuracy score of 0.6961, recall score of 0.7387, F1 score of 0.7153, and precision score of 0.6934. NLP-based machine learning algorithms were a promising auxiliary to the electronic health record in spine disease diagnosis. LSTM, the deep learning model, showed better capacity compared with the widely used ensemble model, XGBoost, in differentiation of LDH and LSS using positive symptoms. This study presents a proof of concept for the application of NLP in prediagnosis of spine disease.

Identifiants

pubmed: 35364584
doi: 10.3171/2022.1.FOCUS21561
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

E7

Auteurs

GuanRui Ren (G)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

Kun Yu (K)

2Nanjing Jiangbei Hospital, Nanjing, Jiangsu, China.

ZhiYang Xie (Z)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

Lei Liu (L)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

PeiYang Wang (P)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

Wei Zhang (W)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

YunTao Wang (Y)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

XiaoTao Wu (X)

1Zhongda Hospital, Medical College, Southeast University, Nanjing; and.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH