Hierarchical reinforcement learning for automatic disease diagnosis.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
10 08 2022
Historique:
received: 21 03 2022
revised: 16 05 2022
accepted: 29 06 2022
pubmed: 2 7 2022
medline: 15 11 2022
entrez: 1 7 2022
Statut: ppublish

Résumé

Disease diagnosis-oriented dialog system models the interactive consultation procedure as the Markov decision process, and reinforcement learning algorithms are used to solve the problem. Existing approaches usually employ a flat policy structure that treat all symptoms and diseases equally for action making. This strategy works well in a simple scenario when the action space is small; however, its efficiency will be challenged in the real environment. Inspired by the offline consultation process, we propose to integrate a hierarchical policy structure of two levels into the dialog system for policy learning. The high-level policy consists of a master model that is responsible for triggering a low-level model, the low-level policy consists of several symptom checkers and a disease classifier. The proposed policy structure is capable to deal with diagnosis problem including large number of diseases and symptoms. Experimental results on three real-world datasets and a synthetic dataset demonstrate that our hierarchical framework achieves higher accuracy and symptom recall in disease diagnosis compared with existing systems. We construct a benchmark including datasets and implementation of existing algorithms to encourage follow-up researches. The code and data are available from https://github.com/FudanDISC/DISCOpen-MedBox-DialoDiagnosis. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 35775965
pii: 6625731
doi: 10.1093/bioinformatics/btac408
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

3995-4001

Subventions

Organisme : Natural Science Foundation of China
ID : 71991471
Organisme : Science and Technology Commission of Shanghai Municipality Grant
ID : 20dz1200600
Organisme : Zhejiang Lab
ID : 2019KD0AD01

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Cheng Zhong (C)

School of Data Science, Fudan University, 200433 Shanghai, China.

Kangenbei Liao (K)

School of Data Science, Fudan University, 200433 Shanghai, China.

Wei Chen (W)

School of Data Science, Fudan University, 200433 Shanghai, China.

Qianlong Liu (Q)

Alibaba Group, 310052 Hangzhou, China.

Baolin Peng (B)

Microsoft Research, Redmond, WA 98052, USA.

Xuanjing Huang (X)

School of Computer Science, Fudan University, 200433 Shanghai, China.

Jiajie Peng (J)

Research Institute of Intelligent Complex Symtems, Fudan University, 200433 Shanghai, China.

Zhongyu Wei (Z)

School of Data Science, Fudan University, 200433 Shanghai, China.
Research Institute of Intelligent Complex Symtems, Fudan University, 200433 Shanghai, China.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH