Applying and Improving a Publicly Available Medication NER Pipeline in a Clinical Cancer EMR.

Natural language processing Transformers electronic medical records medications named entity recognition

Journal

Studies in health technology and informatics
ISSN: 1879-8365
Titre abrégé: Stud Health Technol Inform
Pays: Netherlands
ID NLM: 9214582

Informations de publication

Date de publication:
25 Jan 2024
Historique:
medline: 25 1 2024
pubmed: 25 1 2024
entrez: 25 1 2024
Statut: ppublish

Résumé

Clinical NLP can be applied to extract medication information from free-text notes in EMRs, using NER pipelines. Publicly available annotated data for clinical NLP are scarce, and research annotation budgets are often low. Fine-tuning pre-trained pipelines containing a Transformer layer can produce quality results with relatively small training corpora. We examine the transferability of a publicly available, pre-trained NER pipeline with a Transformer layer for medication targets. The pipeline performs poorly when directly validated but achieves an F1-score of 92% for drug names after fine-tuning with 1,565 annotated samples from a clinical cancer EMR - highlighting the benefits of the Transformer architecture in this setting. Performance was largely influenced by inconsistent annotation - reinforcing the need for innovative annotation processes in clinical NLP applications.

Identifiants

pubmed: 38269895
pii: SHTI231051
doi: 10.3233/SHTI231051
doi:

Types de publication

Journal Article

Langues

eng

Pagination

679-684

Auteurs

Meg Stevens (M)

University of New South Wales, Sydney Australia.

Georgina Kennedy (G)

University of New South Wales, Sydney Australia.
Ingham Institute for Applied Medical Research, Sydney Australia.
Maridulu Budyari Gumal (SPHERE) Cancer Clinical Academic Group.

Timothy Churches (T)

University of New South Wales, Sydney Australia.
Ingham Institute for Applied Medical Research, Sydney Australia.

Classifications MeSH