Literature DB >> 25541562

Interactive model building for Q-learning.

Eric B Laber1, Kristin A Linn1, Leonard A Stefanski1.   

Abstract

Evidence-based rules for optimal treatment allocation are key components in the quest for efficient, effective health care delivery. Q-learning, an approximate dynamic programming algorithm, is a popular method for estimating optimal sequential decision rules from data. Q-learning requires the modeling of nonsmooth, nonmonotone transformations of the data, complicating the search for adequately expressive, yet parsimonious, statistical models. The default Q-learning working model is multiple linear regression, which is not only provably misspecified under most data-generating models, but also results in nonregular regression estimators, complicating inference. We propose an alternative strategy for estimating optimal sequential decision rules for which the requisite statistical modeling does not depend on nonsmooth, nonmonotone transformed data, does not result in nonregular regression estimators, is consistent under a broader array of data-generation models than Q-learning, results in estimated sequential decision rules that have better sampling properties, and is amenable to established statistical approaches for exploratory data analysis, model building, and validation. We derive the new method, IQ-learning, via an interchange in the order of certain steps in Q-learning. In simulated experiments IQ-learning improves on Q-learning in terms of integrated mean squared error and power. The method is illustrated using data from a study of major depressive disorder.

Entities:  

Keywords:  Dynamic Treatment Regime; Personalized Medicine; Treatment Selection

Year:  2014        PMID: 25541562      PMCID: PMC4274394          DOI: 10.1093/biomet/asu043

Source DB:  PubMed          Journal:  Biometrika        ISSN: 0006-3444            Impact factor:   2.445


  13 in total

1.  Model checking with residuals for g-estimation of optimal dynamic treatment regimes.

Authors:  Benjamin Rich; Erica E M Moodie; David A Stephens; Robert W Platt
Journal:  Int J Biostat       Date:  2010       Impact factor: 0.968

2.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

3.  Dynamic treatment regimes: practical design considerations.

Authors:  Philip W Lavori; Ree Dawson
Journal:  Clin Trials       Date:  2004-02       Impact factor: 2.486

4.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

5.  Regret-regression for optimal dynamic treatment regimes.

Authors:  Robin Henderson; Phil Ansell; Deyadeen Alshibani
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

6.  Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].

Authors:  Erica E M Moodie; Thomas S Richardson
Journal:  Scand Stat Theory Appl       Date:  2009-09-22       Impact factor: 1.396

7.  Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.

Authors:  Yufan Zhao; Donglin Zeng; Mark A Socinski; Michael R Kosorok
Journal:  Biometrics       Date:  2011-03-08       Impact factor: 2.571

Review 8.  Inference for non-regular parameters in optimal dynamic treatment regimes.

Authors:  Bibhas Chakraborty; Susan Murphy; Victor Strecher
Journal:  Stat Methods Med Res       Date:  2009-07-16       Impact factor: 3.021

9.  Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design.

Authors:  A John Rush; Maurizio Fava; Stephen R Wisniewski; Philip W Lavori; Madhukar H Trivedi; Harold A Sackeim; Michael E Thase; Andrew A Nierenberg; Frederic M Quitkin; T Michael Kashner; David J Kupfer; Jerrold F Rosenbaum; Jonathan Alpert; Jonathan W Stewart; Patrick J McGrath; Melanie M Biggs; Kathy Shores-Wilson; Barry D Lebowitz; Louise Ritz; George Niederehe
Journal:  Control Clin Trials       Date:  2004-02

10.  Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer.

Authors:  Lu Wang; Andrea Rotnitzky; Xihong Lin; Randall E Millikan; Peter F Thall
Journal:  J Am Stat Assoc       Date:  2012-06       Impact factor: 5.033

View more
  24 in total

1.  iqLearn: Interactive Q-Learning in R.

Authors:  Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal:  J Stat Softw       Date:  2015-03-20       Impact factor: 6.440

2.  Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Authors:  Ashkan Ertefaie; Susan Shortreed; Bibhas Chakraborty
Journal:  Stat Med       Date:  2016-01-10       Impact factor: 2.373

3.  Comment.

Authors:  Qian Guan; Eric B Laber; Brian J Reich
Journal:  J Am Stat Assoc       Date:  2016-10-18       Impact factor: 5.033

4.  High-Dimensional Inference for Personalized Treatment Decision.

Authors:  X Jessie Jeng; Wenbin Lu; Huimin Peng
Journal:  Electron J Stat       Date:  2018-06-21       Impact factor: 1.125

5.  Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain.

Authors:  Eric B Laber; Fan Wu; Catherine Munera; Ilya Lipkovich; Salvatore Colucci; Steve Ripa
Journal:  Stat Med       Date:  2018-02-21       Impact factor: 2.373

6.  Precision Medicine.

Authors:  Michael R Kosorok; Eric B Laber
Journal:  Annu Rev Stat Appl       Date:  2019-03       Impact factor: 5.810

7.  Tree-based methods for individualized treatment regimes.

Authors:  E B Laber; Y Q Zhao
Journal:  Biometrika       Date:  2015-07-15       Impact factor: 2.445

8.  A cure-rate model for Q-learning: Estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients.

Authors:  Erica E M Moodie; David A Stephens; Shomoita Alam; Mei-Jie Zhang; Brent Logan; Mukta Arora; Stephen Spellman; Elizabeth F Krakow
Journal:  Biom J       Date:  2018-05-16       Impact factor: 2.207

9.  Interactive Q-learning for Quantiles.

Authors:  Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal:  J Am Stat Assoc       Date:  2017-03-31       Impact factor: 5.033

10.  Noninferiority and equivalence tests in sequential, multiple assignment, randomized trials (SMARTs).

Authors:  Palash Ghosh; Inbal Nahum-Shani; Bonnie Spring; Bibhas Chakraborty
Journal:  Psychol Methods       Date:  2019-09-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.