Causal relationship extraction from biomedical text using deep neural models: A comprehensive survey.

Biomedical cause-effect Information extraction Natural language processing Relation extraction

Journal

Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413

Informations de publication

Date de publication:
07 2021
Historique:
received: 10 06 2020
revised: 08 05 2021
accepted: 15 05 2021
pubmed: 28 5 2021
medline: 13 8 2021
entrez: 27 5 2021
Statut: ppublish

Résumé

The identification of causal relationships between events or entities within biomedical texts is of great importance for creating scientific knowledge bases and is also a fundamental natural language processing (NLP) task. A causal (cause-effect) relation is defined as an association between two events in which the first must occur before the second. Although this task is an open problem in artificial intelligence, and despite its important role in information extraction from the biomedical literature, very few works have considered this problem. However, with the advent of new techniques in machine learning, especially deep neural networks, research increasingly addresses this problem. This paper summarizes state-of-the-art research, its applications, existing datasets, and remaining challenges. For this survey we have implemented and evaluated various techniques including a Multiview CNN (MVC), attention-based BiLSTM models and state-of-the-art word embedding models, such as those obtained with bidirectional encoder representations (ELMo) and transformer architectures (BioBERT). In addition, we have evaluated a graph LSTM as well as a baseline rule based system. We have investigated the class imbalance problem as an innate property of annotated data in this type of task. The results show that a considerable improvement of the results of state-of-the-art systems can be achieved when a simple random oversampling technique for data augmentation is used in order to reduce class imbalance.

Identifiants

pubmed: 34044157
pii: S1532-0464(21)00149-0
doi: 10.1016/j.jbi.2021.103820
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

103820

Informations de copyright

Copyright © 2021 Elsevier Inc. All rights reserved.

Auteurs

Abbas Akkasi (A)

Language Intelligence and Information Retrieval Lab, KU Leuven, Belgium; Department of Computer Science, Celestijnenlaan 200 A, Leuven, Belgium. Electronic address: abbas.akkasi@gmail.com.

Mari-Francine Moens (MF)

Language Intelligence and Information Retrieval Lab, KU Leuven, Belgium; Department of Computer Science, Celestijnenlaan 200 A, Leuven, Belgium. Electronic address: sien.moens@kuleuven.be.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH