Learning the Dynamic Treatment Regimes from Medical Registry Data through Deep Q-network.
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
06 02 2019
06 02 2019
Historique:
received:
11
01
2018
accepted:
06
10
2018
entrez:
8
2
2019
pubmed:
8
2
2019
medline:
20
8
2020
Statut:
epublish
Résumé
This paper presents the deep reinforcement learning (DRL) framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real-life complexity in heterogeneous disease progression and treatment choices, with the goal of providing doctors and patients the data-driven personalized decision recommendations. The proposed DRL framework comprises (i) a supervised learning step to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term value function of Dynamic Treatment Regimes. Both steps depend on deep neural networks. As a key motivational example, we have implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease after transplantation. In the experimental results, we have demonstrated promising accuracy in predicting human experts' decisions, as well as the high expected reward function in the DRL-based dynamic treatment regimes.
Identifiants
pubmed: 30728403
doi: 10.1038/s41598-018-37142-0
pii: 10.1038/s41598-018-37142-0
pmc: PMC6365640
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
1495Subventions
Organisme : NHLBI NIH HHS
ID : U10 HL069294
Pays : United States
Organisme : NCI NIH HHS
ID : U24 CA076518
Pays : United States
Références
Bone Marrow Transplant. 2014 Feb;49(2):168-73
pubmed: 23892326
J Am Stat Assoc. 2015;110(510):583-598
pubmed: 26236062
Neuropsychopharmacology. 2007 Feb;32(2):257-62
pubmed: 17091129
Med Phys. 2017 Dec;44(12):6690-6705
pubmed: 29034482
Br J Haematol. 1986 Jun;63(2):221-30
pubmed: 3521712
Nature. 2015 Feb 26;518(7540):529-33
pubmed: 25719670
Clin Trials. 2004 Feb;1(1):9-20
pubmed: 16281458
Stat Med. 2005 May 30;24(10):1455-81
pubmed: 15586395
Blood. 1991 Apr 1;77(7):1423-8
pubmed: 2009366
Am J Epidemiol. 2017 Jul 15;186(2):160-172
pubmed: 28472335
Biometrics. 2007 Jun;63(2):447-55
pubmed: 17688497
Nature. 2016 Jan 28;529(7587):484-9
pubmed: 26819042
Biometrics. 2012 Dec;68(4):1010-8
pubmed: 22550953
Stat Med. 2009 Nov 20;28(26):3294-315
pubmed: 19750510
Health Psychol. 2015 Dec;34S:1220-8
pubmed: 26651463