Off-Policy Estimation of Long-Term Average Outcomes with Applications to Mobile Health.

markov decision process policy evaluation reinforcement learning sequential decision making

Journal

Journal of the American Statistical Association
ISSN: 0162-1459
Titre abrégé: J Am Stat Assoc
Pays: United States
ID NLM: 01510020R

Informations de publication

Date de publication:
2021
Historique:
entrez: 5 4 2021
pubmed: 6 4 2021
medline: 6 4 2021
Statut: ppublish

Résumé

Due to the recent advancements in wearables and sensing technology, health scientists are increasingly developing mobile health (mHealth) interventions. In mHealth interventions, mobile devices are used to deliver treatment to individuals as they go about their daily lives. These treatments are generally designed to impact a near time, proximal outcome such as stress or physical activity. The mHealth intervention policies, often called just-in-time adaptive interventions, are decision rules that map a individual's current state (e.g., individual's past behaviors as well as current observations of time, location, social activity, stress and urges to smoke) to a particular treatment at each of many time points. The vast majority of current mHealth interventions deploy expert-derived policies. In this paper, we provide an approach for conducting inference about the performance of one or more such policies using historical data collected under a possibly different policy. Our measure of performance is the average of proximal outcomes over a long time period should the particular mHealth policy be followed. We provide an estimator as well as confidence intervals. This work is motivated by HeartSteps, an mHealth physical activity intervention.

Identifiants

pubmed: 33814653
doi: 10.1080/01621459.2020.1807993
pmc: PMC8014957
mid: NIHMS1619552
doi:

Types de publication

Journal Article

Langues

eng

Pagination

382-391

Subventions

Organisme : NIBIB NIH HHS
ID : U54 EB020404
Pays : United States
Organisme : NIDA NIH HHS
ID : P50 DA039838
Pays : United States
Organisme : NIAAA NIH HHS
ID : R01 AA023187
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL125440
Pays : United States
Organisme : NCI NIH HHS
ID : U01 CA229437
Pays : United States
Organisme : NIDA NIH HHS
ID : P50 DA054039
Pays : United States
Organisme : NIBIB NIH HHS
ID : P41 EB028242
Pays : United States
Organisme : NIDA NIH HHS
ID : R01 DA039901
Pays : United States

Références

BMC Med Inform Decis Mak. 2018 Feb 20;18(1):12
pubmed: 29458358
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2018 Dec;2(4):
pubmed: 30801052
Ann Behav Med. 2018 May 18;52(6):446-462
pubmed: 27663578
Stat Med. 2016 May 30;35(12):1944-71
pubmed: 26707831
Health Psychol. 2015 Dec;34S:1220-8
pubmed: 26651463
JMIR Res Protoc. 2018 Jul 18;7(7):e166
pubmed: 30021714
Stud Health Technol Inform. 2014;199:35-9
pubmed: 24875686
J Am Stat Assoc. 2001 Dec 1;96(456):1410-1423
pubmed: 20019887
Ann Behav Med. 2019 May 3;53(6):573-582
pubmed: 30192907
Ann Stat. 2016 Aug;44(4):1400-1437
pubmed: 28428647
J Am Stat Assoc. 2020;115(530):692-706
pubmed: 32952236

Auteurs

Peng Liao (P)

Department of Statistics, University of Michigan.

Predrag Klasnja (P)

School of Information, University of Michigan.

Susan Murphy (S)

Department of Statistics, Harvard University.

Classifications MeSH