Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Interactive model building for Q-learning.

Literature DB >> 25541562

Interactive model building for Q-learning.

Eric B Laber¹, Kristin A Linn¹, Leonard A Stefanski¹.

Abstract

Evidence-based rules for optimal treatment allocation are key components in the quest for efficient, effective health care delivery. Q-learning, an approximate dynamic programming algorithm, is a popular method for estimating optimal sequential decision rules from data. Q-learning requires the modeling of nonsmooth, nonmonotone transformations of the data, complicating the search for adequately expressive, yet parsimonious, statistical models. The default Q-learning working model is multiple linear regression, which is not only provably misspecified under most data-generating models, but also results in nonregular regression estimators, complicating inference. We propose an alternative strategy for estimating optimal sequential decision rules for which the requisite statistical modeling does not depend on nonsmooth, nonmonotone transformed data, does not result in nonregular regression estimators, is consistent under a broader array of data-generation models than Q-learning, results in estimated sequential decision rules that have better sampling properties, and is amenable to established statistical approaches for exploratory data analysis, model building, and validation. We derive the new method, IQ-learning, via an interchange in the order of certain steps in Q-learning. In simulated experiments IQ-learning improves on Q-learning in terms of integrated mean squared error and power. The method is illustrated using data from a study of major depressive disorder.

Entities: Chemical Disease Gene Species

Keywords: Dynamic Treatment Regime; Personalized Medicine; Treatment Selection

Year: 2014 PMID： 25541562 PMCID： PMC4274394 DOI： 10.1093/biomet/asu043

Source DB: PubMed Journal: Biometrika ISSN： 0006-3444 Impact factor: 2.445

13 in total

1. Model checking with residuals for g-estimation of optimal dynamic treatment regimes.

Authors: Benjamin Rich; Erica E M Moodie; David A Stephens; Robert W Platt
Journal: Int J Biostat Date: 2010 Impact factor: 0.968

2. An experimental design for the development of adaptive treatment strategies.

Authors: S A Murphy
Journal: Stat Med Date: 2005-05-30 Impact factor: 2.373

3. Dynamic treatment regimes: practical design considerations.

Authors: Philip W Lavori; Ree Dawson
Journal: Clin Trials Date: 2004-02 Impact factor: 2.486

4. A Generalization Error for Q-Learning.

Authors: Susan A Murphy
Journal: J Mach Learn Res Date: 2005-07 Impact factor: 3.654

5. Regret-regression for optimal dynamic treatment regimes.

Authors: Robin Henderson; Phil Ansell; Deyadeen Alshibani
Journal: Biometrics Date: 2010-12 Impact factor: 2.571

6. Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].

Authors: Erica E M Moodie; Thomas S Richardson
Journal: Scand Stat Theory Appl Date: 2009-09-22 Impact factor: 1.396

7. Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.

Authors: Yufan Zhao; Donglin Zeng; Mark A Socinski; Michael R Kosorok
Journal: Biometrics Date: 2011-03-08 Impact factor: 2.571

Review 8. Inference for non-regular parameters in optimal dynamic treatment regimes.

Authors: Bibhas Chakraborty; Susan Murphy; Victor Strecher
Journal: Stat Methods Med Res Date: 2009-07-16 Impact factor: 3.021

9. Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design.

Authors: A John Rush; Maurizio Fava; Stephen R Wisniewski; Philip W Lavori; Madhukar H Trivedi; Harold A Sackeim; Michael E Thase; Andrew A Nierenberg; Frederic M Quitkin; T Michael Kashner; David J Kupfer; Jerrold F Rosenbaum; Jonathan Alpert; Jonathan W Stewart; Patrick J McGrath; Melanie M Biggs; Kathy Shores-Wilson; Barry D Lebowitz; Louise Ritz; George Niederehe
Journal: Control Clin Trials Date: 2004-02

10. Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer.

Authors: Lu Wang; Andrea Rotnitzky; Xihong Lin; Randall E Millikan; Peter F Thall
Journal: J Am Stat Assoc Date: 2012-06 Impact factor: 5.033

24 in total

1. iqLearn: Interactive Q-Learning in R.

Authors: Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal: J Stat Softw Date: 2015-03-20 Impact factor: 6.440

2. Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Authors: Ashkan Ertefaie; Susan Shortreed; Bibhas Chakraborty
Journal: Stat Med Date: 2016-01-10 Impact factor: 2.373

3. Comment.

Authors: Qian Guan; Eric B Laber; Brian J Reich
Journal: J Am Stat Assoc Date: 2016-10-18 Impact factor: 5.033

4. High-Dimensional Inference for Personalized Treatment Decision.

Authors: X Jessie Jeng; Wenbin Lu; Huimin Peng
Journal: Electron J Stat Date: 2018-06-21 Impact factor: 1.125

5. Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain.

Authors: Eric B Laber; Fan Wu; Catherine Munera; Ilya Lipkovich; Salvatore Colucci; Steve Ripa
Journal: Stat Med Date: 2018-02-21 Impact factor: 2.373

6. Precision Medicine.

Authors: Michael R Kosorok; Eric B Laber
Journal: Annu Rev Stat Appl Date: 2019-03 Impact factor: 5.810

7. Tree-based methods for individualized treatment regimes.

Authors: E B Laber; Y Q Zhao
Journal: Biometrika Date: 2015-07-15 Impact factor: 2.445

8. A cure-rate model for Q-learning: Estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients.

Authors: Erica E M Moodie; David A Stephens; Shomoita Alam; Mei-Jie Zhang; Brent Logan; Mukta Arora; Stephen Spellman; Elizabeth F Krakow
Journal: Biom J Date: 2018-05-16 Impact factor: 2.207

9. Interactive Q-learning for Quantiles.

Authors: Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal: J Am Stat Assoc Date: 2017-03-31 Impact factor: 5.033

10. Noninferiority and equivalence tests in sequential, multiple assignment, randomized trials (SMARTs).

Authors: Palash Ghosh; Inbal Nahum-Shani; Bonnie Spring; Bibhas Chakraborty
Journal: Psychol Methods Date: 2019-09-09