ADPG: Biomedical entity recognition based on Automatic Dependency Parsing Graph.


Journal

Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413

Informations de publication

Date de publication:
04 2023
Historique:
received: 26 11 2022
revised: 19 01 2023
accepted: 08 02 2023
medline: 4 4 2023
pubmed: 23 2 2023
entrez: 22 2 2023
Statut: ppublish

Résumé

Named entity recognition is a key task in text mining. In the biomedical field, entity recognition focuses on extracting key information from large-scale biomedical texts for the downstream information extraction task. Biomedical literature contains a large amount of long-dependent text, and previous studies use external syntactic parsing tools to capture word dependencies in sentences to achieve nested biomedical entity recognition. However, the addition of external parsing tools often introduces unnecessary noise to the current auxiliary task and cannot improve the performance of entity recognition in an end-to-end way. Therefore, we propose a novel automatic dependency parsing approach, namely the ADPG model, to fuse syntactic structure information in an end-to-end way to recognize biomedical entities. Specifically, the method is based on a multilayer Tree-Transformer structure to automatically extract the semantic representation and syntactic structure in long-dependent sentences, and then combines a multilayer graph attention neural network (GAT) to extract the dependency paths between words in the syntactic structure to improve the performance of biomedical entity recognition. We evaluated our ADPG model on three biomedical domain and one news domain datasets, and the experimental results demonstrate that our model achieves state-of-the-art results on these four datasets with certain generalization performance. Our model is released on GitHub: https://github.com/Yumeng-Y/ADPG.

Identifiants

pubmed: 36804374
pii: S1532-0464(23)00038-2
doi: 10.1016/j.jbi.2023.104317
pii:
doi:

Substances chimiques

Adenosine Diphosphate Glucose 2140-58-1

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

104317

Informations de copyright

Copyright © 2023 Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Yumeng Yang (Y)

School of Computer Science and Technology, Dalian University of Technology, Dalian, China. Electronic address: yumeng.yang@dlut.edu.cn.

Hongfei Lin (H)

School of Computer Science and Technology, Dalian University of Technology, Dalian, China. Electronic address: hflin@dlut.edu.cn.

Zhihao Yang (Z)

School of Computer Science and Technology, Dalian University of Technology, Dalian, China. Electronic address: yangzh@dlut.edu.cn.

Yijia Zhang (Y)

School of Information Science and Technology, Dalian Maritime University, Dalian, China. Electronic address: zhangyijia@dlmu.edu.cn.

Di Zhao (D)

School of Computer Science and Engineering, Dalian Minzu University, Dalian, China. Electronic address: zhaodi@dlnu.edu.cn.

Shuaiheng Huai (S)

School of Information Science and Technology, Dalian Maritime University, Dalian, China. Electronic address: huaishuaiheng@dlmu.edu.cn.

Articles similaires

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking
Humans Shoulder Fractures Tomography, X-Ray Computed Neural Networks, Computer Female
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted
Humans Deep Learning Mouth Neoplasms Drug Resistance, Neoplasm Cell Line, Tumor

Classifications MeSH