Algorithmic identification of treatment-emergent adverse events from clinical notes using large language models: a pilot study in inflammatory bowel disease.

Journal

medRxiv : the preprint server for health sciences

Titre abrégé: medRxiv

Pays: United States

ID NLM: 101767986

Informations de publication

Date de publication:
08 Sep 2023

Historique:

medline: 21 9 2023

pubmed: 21 9 2023

entrez: 21 9 2023

Statut: epublish

Résumé

Outpatient clinical notes are a rich source of information regarding drug safety. However, data in these notes are currently underutilized for pharmacovigilance due to methodological limitations in text mining. Large language models (LLM) like BERT have shown progress in a range of natural language processing tasks but have not yet been evaluated on adverse event detection. We adapted a new clinical LLM, UCSF BERT, to identify serious adverse events (SAEs) occurring after treatment with a non-steroid immunosuppressant for inflammatory bowel disease (IBD). We compared this model to other language models that have previously been applied to AE detection. We annotated 928 outpatient IBD notes corresponding to 928 individual IBD patients for all SAE-associated hospitalizations occurring after treatment with a non-steroid immunosuppressant. These notes contained 703 SAEs in total, the most common of which was failure of intended efficacy. Out of 8 candidate models, UCSF BERT achieved the highest numerical performance on identifying drug-SAE pairs from this corpus (accuracy 88-92%, macro F1 61-68%), with 5-10% greater accuracy than previously published models. UCSF BERT was significantly superior at identifying hospitalization events emergent to medication use (p < 0.01). LLMs like UCSF BERT achieve numerically superior accuracy on the challenging task of SAE detection from clinical notes compared to prior methods. Future work is needed to adapt this methodology to improve model performance and evaluation using multi-center data and newer architectures like GPT. Our findings support the potential value of using large language models to enhance pharmacovigilance.

Sections du résumé

Background and Aims UNASSIGNED

Methods UNASSIGNED

We adapted a new clinical LLM, UCSF BERT, to identify serious adverse events (SAEs) occurring after treatment with a non-steroid immunosuppressant for inflammatory bowel disease (IBD). We compared this model to other language models that have previously been applied to AE detection.

Results UNASSIGNED

We annotated 928 outpatient IBD notes corresponding to 928 individual IBD patients for all SAE-associated hospitalizations occurring after treatment with a non-steroid immunosuppressant. These notes contained 703 SAEs in total, the most common of which was failure of intended efficacy. Out of 8 candidate models, UCSF BERT achieved the highest numerical performance on identifying drug-SAE pairs from this corpus (accuracy 88-92%, macro F1 61-68%), with 5-10% greater accuracy than previously published models. UCSF BERT was significantly superior at identifying hospitalization events emergent to medication use (p < 0.01).

Conclusions UNASSIGNED

LLMs like UCSF BERT achieve numerically superior accuracy on the challenging task of SAE detection from clinical notes compared to prior methods. Future work is needed to adapt this methodology to improve model performance and evaluation using multi-center data and newer architectures like GPT. Our findings support the potential value of using large language models to enhance pharmacovigilance.

Identifiants

DOI: 10.1101/2023.09.06.23295149 PMID: 37732220 PMC: PMC10508809

pubmed: 37732220

doi: 10.1101/2023.09.06.23295149

pmc: PMC10508809

pii:

doi:

Types de publication

Preprint

Langues

eng

Commentaires et corrections

Type : UpdateIn

Algorithmic identification of treatment-emergent adverse events from clinical notes using large language models: a pilot study in inflammatory bowel disease.

Journal

Informations de publication

Résumé

Sections du résumé

Identifiants

Types de publication

Langues

Commentaires et corrections

Auteurs

Anna L Silverman (AL)

Madhumita Sushil (M)

Balu Bhasuran (B)

Dana Ludwig (D)

James Buchanan (J)

Rebecca Racz (R)

Mahalakshmi Parakala (M)

Samer El-Kamary (S)

Ohenewaa Ahima (O)

Artur Belov (A)

Lauren Choi (L)

Monisha Billings (M)

Yan Li (Y)

Nadia Habal (N)

Qi Liu (Q)

Jawahar Tiwari (J)

Atul J Butte (AJ)

Vivek A Rudrapatna (VA)

Classifications MeSH