Differentiation of lumbar disc herniation and lumbar spinal stenosis using natural language processing-based machine learning based on positive symptoms.
Long Short-Term Memory network
electronic health record
lumbar disc herniation
lumbar spinal stenosis
machine learning
natural language processing
Journal
Neurosurgical focus
ISSN: 1092-0684
Titre abrégé: Neurosurg Focus
Pays: United States
ID NLM: 100896471
Informations de publication
Date de publication:
04 2022
04 2022
Historique:
received:
22
09
2021
accepted:
20
01
2022
entrez:
1
4
2022
pubmed:
2
4
2022
medline:
6
4
2022
Statut:
ppublish
Résumé
The purpose of this study was to develop natural language processing (NLP)-based machine learning algorithms to automatically differentiate lumbar disc herniation (LDH) and lumbar spinal stenosis (LSS) based on positive symptoms in free-text admission notes. The secondary purpose was to compare the performance of the deep learning algorithm with the ensemble model on the current task. In total, 1921 patients whose principal diagnosis was LDH or LSS between June 2013 and June 2020 at Zhongda Hospital, affiliated with Southeast University, were retrospectively analyzed. The data set was randomly divided into a training set and testing set at a 7:3 ratio. Long Short-Term Memory (LSTM) and extreme gradient boosting (XGBoost) models were developed in this study. NLP algorithms were assessed on the testing set by the following metrics: receiver operating characteristic (ROC) curve, area under the curve (AUC), accuracy score, recall score, F1 score, and precision score. In the testing set, the LSTM model achieved an AUC of 0.8487, accuracy score of 0.7818, recall score of 0.9045, F1 score of 0.8108, and precision score of 0.7347. In comparison, the XGBoost model achieved an AUC of 0.7565, accuracy score of 0.6961, recall score of 0.7387, F1 score of 0.7153, and precision score of 0.6934. NLP-based machine learning algorithms were a promising auxiliary to the electronic health record in spine disease diagnosis. LSTM, the deep learning model, showed better capacity compared with the widely used ensemble model, XGBoost, in differentiation of LDH and LSS using positive symptoms. This study presents a proof of concept for the application of NLP in prediagnosis of spine disease.
Identifiants
pubmed: 35364584
doi: 10.3171/2022.1.FOCUS21561
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM