Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach.
classification
deep learning
machine learning
natural language processing
Journal
JMIR medical informatics
ISSN: 2291-9694
Titre abrégé: JMIR Med Inform
Pays: Canada
ID NLM: 101645109
Informations de publication
Date de publication:
24 Dec 2021
24 Dec 2021
Historique:
received:
09
03
2021
accepted:
14
11
2021
revised:
01
08
2021
entrez:
24
12
2021
pubmed:
25
12
2021
medline:
25
12
2021
Statut:
epublish
Résumé
Pharmacovigilance and safety reporting, which involve processes for monitoring the use of medicines in clinical trials, play a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events. This study aims to demonstrate the feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable statistical analysis of the aforementioned patterns. We used the Unified Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as the International Classification of Diseases-10th Revision, Medical Dictionary for Regulatory Activities, and Systematized Nomenclature of Medicine). We used MetaMap, a highly configurable dictionary lookup software, to identify the mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformers (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represented adverse events and those that did not. The model achieved a high F1 score of 0.8080, despite the class imbalance. This is 10.15 percent points lower than human-like performance but also 17.45 percent points higher than that of the baseline approach. These results confirmed that automated coding of adverse events described in the narrative section of serious adverse event reports is feasible. Once coded, adverse events can be statistically analyzed so that any correlations with the trialed medicines can be estimated in a timely fashion.
Sections du résumé
BACKGROUND
BACKGROUND
Pharmacovigilance and safety reporting, which involve processes for monitoring the use of medicines in clinical trials, play a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events.
OBJECTIVE
OBJECTIVE
This study aims to demonstrate the feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable statistical analysis of the aforementioned patterns.
METHODS
METHODS
We used the Unified Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as the International Classification of Diseases-10th Revision, Medical Dictionary for Regulatory Activities, and Systematized Nomenclature of Medicine). We used MetaMap, a highly configurable dictionary lookup software, to identify the mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformers (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represented adverse events and those that did not.
RESULTS
RESULTS
The model achieved a high F1 score of 0.8080, despite the class imbalance. This is 10.15 percent points lower than human-like performance but also 17.45 percent points higher than that of the baseline approach.
CONCLUSIONS
CONCLUSIONS
These results confirmed that automated coding of adverse events described in the narrative section of serious adverse event reports is feasible. Once coded, adverse events can be statistically analyzed so that any correlations with the trialed medicines can be estimated in a timely fashion.
Identifiants
pubmed: 34951601
pii: v9i12e28632
doi: 10.2196/28632
pmc: PMC8742206
doi:
Types de publication
Journal Article
Langues
eng
Pagination
e28632Informations de copyright
©Daphne Chopard, Matthias S Treder, Padraig Corcoran, Nagheen Ahmed, Claire Johnson, Monica Busse, Irena Spasic. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 24.12.2021.
Références
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):631-8
pubmed: 21709163
J Am Med Inform Assoc. 2021 Jul 14;28(7):1393-1400
pubmed: 33647938
J Am Med Inform Assoc. 2009 May-Jun;16(3):328-37
pubmed: 19261932
AMIA Annu Symp Proc. 2018 Apr 16;2017:679-688
pubmed: 29854133
J Am Med Inform Assoc. 2021 Mar 1;28(3):569-577
pubmed: 33150942
J Am Med Inform Assoc. 2017 Jul 1;24(4):813-821
pubmed: 28339747
IEEE J Biomed Health Inform. 2019 Jan;23(1):95-102
pubmed: 30059326
Artif Intell Med. 2016 Jun;70:62-76
pubmed: 27431037
J Am Med Inform Assoc. 2012 Nov-Dec;19(6):1011-8
pubmed: 22922172
Pharmacotherapy. 2018 Aug;38(8):822-841
pubmed: 29884988
JMIR Med Inform. 2019 Oct 31;7(4):e15980
pubmed: 31674914
J Am Med Inform Assoc. 2015 May;22(3):671-81
pubmed: 25755127
PLoS One. 2017 Nov 9;12(11):e0187121
pubmed: 29121053
J Biomed Inform. 2009 Oct;42(5):839-51
pubmed: 19435614
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70
pubmed: 14681409
AMIA Annu Symp Proc. 2011;2011:1019-26
pubmed: 22195162
J Am Med Inform Assoc. 2017 Sep 1;24(5):913-920
pubmed: 28371826
Drug Saf. 2017 Nov;40(11):1075-1089
pubmed: 28643174
AMIA Annu Symp Proc. 2011;2011:217-26
pubmed: 22195073
AMIA Annu Symp Proc. 2012;2012:144-53
pubmed: 23304283
JMIR Med Inform. 2020 Mar 31;8(3):e17984
pubmed: 32229465
J Am Med Inform Assoc. 2020 Apr 1;27(4):584-591
pubmed: 32044989
J Med Internet Res. 2019 Feb 06;21(2):e11016
pubmed: 30724742
AMIA Annu Symp Proc. 2010 Nov 13;2010:177-81
pubmed: 21346964
Bioinformatics. 2020 Feb 15;36(4):1234-1240
pubmed: 31501885
AMIA Annu Symp Proc. 2011;2011:1270-9
pubmed: 22195188
J Am Med Inform Assoc. 2010 May-Jun;17(3):229-36
pubmed: 20442139
Brief Bioinform. 2014 Mar;15(2):327-40
pubmed: 23255168
Vaccine. 2009 Mar 23;27(14):2077-83
pubmed: 19428833
J Am Med Inform Assoc. 2010 Sep-Oct;17(5):532-5
pubmed: 20819858
JMIR Med Inform. 2020 Dec 3;8(12):e18953
pubmed: 33270027
J Am Med Inform Assoc. 2005 May-Jun;12(3):296-8
pubmed: 15684123
PLoS One. 2014 Nov 13;9(11):e112774
pubmed: 25393544
J Biomed Inform. 2015 Feb;53:196-207
pubmed: 25451103