A Comprehensive Natural Language Processing Pipeline for the Chronic Lupus Disease.
Artificial Intelligence (AI)
Electronic Health Record (EHR)
Information Extraction (IE)
Natural Language Processing (NLP)
Systemic Lupus Erythematosus (SLE)
Journal
Studies in health technology and informatics
ISSN: 1879-8365
Titre abrégé: Stud Health Technol Inform
Pays: Netherlands
ID NLM: 9214582
Informations de publication
Date de publication:
22 Aug 2024
22 Aug 2024
Historique:
medline:
23
8
2024
pubmed:
23
8
2024
entrez:
23
8
2024
Statut:
ppublish
Résumé
Electronic Health Records (EHRs) contain a wealth of unstructured patient data, making it challenging for physicians to do informed decisions. In this paper, we introduce a Natural Language Processing (NLP) approach for the extraction of therapies, diagnosis, and symptoms from ambulatory EHRs of patients with chronic Lupus disease. We aim to demonstrate the effort of a comprehensive pipeline where a rule-based system is combined with text segmentation, transformer-based topic analysis and clinical ontology, in order to enhance text preprocessing and automate rules' identification. Our approach is applied on a sub-cohort of 56 patients, with a total of 750 EHRs written in Italian language, achieving an Accuracy and an F-score over 97% and 90% respectively, in the three extracted domains. This work has the potential to be integrated with EHR systems to automate information extraction, minimizing the human intervention, and providing personalized digital solutions in the chronic Lupus disease domain.
Identifiants
pubmed: 39176940
pii: SHTI240559
doi: 10.3233/SHTI240559
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM