Applying and Improving a Publicly Available Medication NER Pipeline in a Clinical Cancer EMR.

Natural language processing Transformers electronic medical records medications named entity recognition

Journal

Studies in health technology and informatics

ISSN: 1879-8365

Titre abrégé: Stud Health Technol Inform

Pays: Netherlands

ID NLM: 9214582

Informations de publication

Date de publication:
25 Jan 2024

Historique:

medline: 25 1 2024

pubmed: 25 1 2024

entrez: 25 1 2024

Statut: ppublish

Résumé

Clinical NLP can be applied to extract medication information from free-text notes in EMRs, using NER pipelines. Publicly available annotated data for clinical NLP are scarce, and research annotation budgets are often low. Fine-tuning pre-trained pipelines containing a Transformer layer can produce quality results with relatively small training corpora. We examine the transferability of a publicly available, pre-trained NER pipeline with a Transformer layer for medication targets. The pipeline performs poorly when directly validated but achieves an F1-score of 92% for drug names after fine-tuning with 1,565 annotated samples from a clinical cancer EMR - highlighting the benefits of the Transformer architecture in this setting. Performance was largely influenced by inconsistent annotation - reinforcing the need for innovative annotation processes in clinical NLP applications.

Identifiants

DOI: 10.3233/SHTI231051 PMID: 38269895

pubmed: 38269895

pii: SHTI231051

doi: 10.3233/SHTI231051

doi:

Types de publication

Journal Article

Langues

eng

Pagination

679-684

Applying and Improving a Publicly Available Medication NER Pipeline in a Clinical Cancer EMR.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Pagination

Auteurs

Meg Stevens (M)

Georgina Kennedy (G)

Timothy Churches (T)

Classifications MeSH