Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Penalized Q-Learning for Dynamic Treatment Regimens.

Literature DB >> 26257504

Penalized Q-Learning for Dynamic Treatment Regimens.

R Song¹, W Wang¹, D Zeng¹, M R Kosorok¹.

Abstract

A dynamic treatment regimen incorporates both accrued information and long-term effects of treatment from specially designed clinical trials. As these trials become more and more popular in conjunction with longitudinal data from clinical studies, the development of statistical inference for optimal dynamic treatment regimens is a high priority. In this paper, we propose a new machine learning framework called penalized Q-learning, under which valid statistical inference is established. We also propose a new statistical procedure: individual selection and corresponding methods for incorporating individual selection within penalized Q-learning. Extensive numerical studies are presented which compare the proposed methods with existing methods, under a variety of scenarios, and demonstrate that the proposed approach is both inferentially and computationally superior. It is illustrated with a depression clinical trial study.

Entities: Chemical Disease Gene Mutation Species

Keywords: Dynamic treatment regimen; Individual selection; Multi-stage; Penalized Q-learning; Q-learning; Shrinkage; Two-stage procedure

Year: 2015 PMID： 26257504 PMCID： PMC4526274 DOI： 10.5705/ss.2012.364

Source DB: PubMed Journal: Stat Sin ISSN： 1017-0405 Impact factor: 1.261

11 in total

1. Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials.

Authors: Jared K Lunceford; Marie Davidian; Anastasios A Tsiatis
Journal: Biometrics Date: 2002-03 Impact factor: 2.571

2. Optimal estimator for the survival distribution and related quantities for treatment policies in two-stage randomization designs in clinical trials.

Authors: Abdus S Wahed; Anastasios A Tsiatis
Journal: Biometrics Date: 2004-03 Impact factor: 2.571

3. An experimental design for the development of adaptive treatment strategies.

Authors: S A Murphy
Journal: Stat Med Date: 2005-05-30 Impact factor: 2.373

4. Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.

Authors: Peter F Thall; Leiko H Wooten; Christopher J Logothetis; Randall E Millikan; Nizar M Tannir
Journal: Stat Med Date: 2007-11-20 Impact factor: 2.373

5. Reinforcement learning design for cancer clinical trials.

Authors: Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal: Stat Med Date: 2009-11-20 Impact factor: 2.373

6. Evaluating multiple treatment courses in clinical trials.

Authors: P F Thall; R E Millikan; H G Sung
Journal: Stat Med Date: 2000-04-30 Impact factor: 2.373

7. Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.

Authors: Yufan Zhao; Donglin Zeng; Mark A Socinski; Michael R Kosorok
Journal: Biometrics Date: 2011-03-08 Impact factor: 2.571

Review 8. Inference for non-regular parameters in optimal dynamic treatment regimes.

Authors: Bibhas Chakraborty; Susan Murphy; Victor Strecher
Journal: Stat Methods Med Res Date: 2009-07-16 Impact factor: 3.021

9. One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors: Hui Zou; Runze Li
Journal: Ann Stat Date: 2008-08-01 Impact factor: 4.028

10. Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.

Authors: Bibhas Chakraborty; Eric B Laber; Yingqi Zhao
Journal: Biometrics Date: 2013-07-11 Impact factor: 2.571

14 in total

1. iqLearn: Interactive Q-Learning in R.

Authors: Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal: J Stat Softw Date: 2015-03-20 Impact factor: 6.440

2. Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Authors: Ashkan Ertefaie; Susan Shortreed; Bibhas Chakraborty
Journal: Stat Med Date: 2016-01-10 Impact factor: 2.373

3. Entropy Learning for Dynamic Treatment Regimes.

Authors: Binyan Jiang; Rui Song; Jialiang Li; Donglin Zeng
Journal: Stat Sin Date: 2019 Impact factor: 1.261

4. Quantile-Optimal Treatment Regimes.

Authors: Lan Wang; Yu Zhou; Rui Song; Ben Sherwood
Journal: J Am Stat Assoc Date: 2018-06-08 Impact factor: 5.033

5. High-Dimensional Inference for Personalized Treatment Decision.

Authors: X Jessie Jeng; Wenbin Lu; Huimin Peng
Journal: Electron J Stat Date: 2018-06-21 Impact factor: 1.125

6. Sparse concordance-assisted learning for optimal treatment decision.

Authors: Shuhan Liang; Wenbin Lu; Rui Song; Lan Wang
Journal: J Mach Learn Res Date: 2018-04 Impact factor: 3.654

7. Regularized outcome weighted subgroup identification for differential treatment effects.

Authors: Yaoyao Xu; Menggang Yu; Ying-Qi Zhao; Quefeng Li; Sijian Wang; Jun Shao
Journal: Biometrics Date: 2015-05-11 Impact factor: 2.571

8. Precision Medicine.

Authors: Michael R Kosorok; Eric B Laber
Journal: Annu Rev Stat Appl Date: 2019-03 Impact factor: 5.810

9. A cure-rate model for Q-learning: Estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients.

Authors: Erica E M Moodie; David A Stephens; Shomoita Alam; Mei-Jie Zhang; Brent Logan; Mukta Arora; Stephen Spellman; Elizabeth F Krakow
Journal: Biom J Date: 2018-05-16 Impact factor: 2.207

10. Interactive Q-learning for Quantiles.

Authors: Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal: J Am Stat Assoc Date: 2017-03-31 Impact factor: 5.033