Literature DB >> 26257504

Penalized Q-Learning for Dynamic Treatment Regimens.

R Song1, W Wang1, D Zeng1, M R Kosorok1.   

Abstract

A dynamic treatment regimen incorporates both accrued information and long-term effects of treatment from specially designed clinical trials. As these trials become more and more popular in conjunction with longitudinal data from clinical studies, the development of statistical inference for optimal dynamic treatment regimens is a high priority. In this paper, we propose a new machine learning framework called penalized Q-learning, under which valid statistical inference is established. We also propose a new statistical procedure: individual selection and corresponding methods for incorporating individual selection within penalized Q-learning. Extensive numerical studies are presented which compare the proposed methods with existing methods, under a variety of scenarios, and demonstrate that the proposed approach is both inferentially and computationally superior. It is illustrated with a depression clinical trial study.

Entities:  

Keywords:  Dynamic treatment regimen; Individual selection; Multi-stage; Penalized Q-learning; Q-learning; Shrinkage; Two-stage procedure

Year:  2015        PMID: 26257504      PMCID: PMC4526274          DOI: 10.5705/ss.2012.364

Source DB:  PubMed          Journal:  Stat Sin        ISSN: 1017-0405            Impact factor:   1.261


  11 in total

1.  Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials.

Authors:  Jared K Lunceford; Marie Davidian; Anastasios A Tsiatis
Journal:  Biometrics       Date:  2002-03       Impact factor: 2.571

2.  Optimal estimator for the survival distribution and related quantities for treatment policies in two-stage randomization designs in clinical trials.

Authors:  Abdus S Wahed; Anastasios A Tsiatis
Journal:  Biometrics       Date:  2004-03       Impact factor: 2.571

3.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

4.  Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.

Authors:  Peter F Thall; Leiko H Wooten; Christopher J Logothetis; Randall E Millikan; Nizar M Tannir
Journal:  Stat Med       Date:  2007-11-20       Impact factor: 2.373

5.  Reinforcement learning design for cancer clinical trials.

Authors:  Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal:  Stat Med       Date:  2009-11-20       Impact factor: 2.373

6.  Evaluating multiple treatment courses in clinical trials.

Authors:  P F Thall; R E Millikan; H G Sung
Journal:  Stat Med       Date:  2000-04-30       Impact factor: 2.373

7.  Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.

Authors:  Yufan Zhao; Donglin Zeng; Mark A Socinski; Michael R Kosorok
Journal:  Biometrics       Date:  2011-03-08       Impact factor: 2.571

Review 8.  Inference for non-regular parameters in optimal dynamic treatment regimes.

Authors:  Bibhas Chakraborty; Susan Murphy; Victor Strecher
Journal:  Stat Methods Med Res       Date:  2009-07-16       Impact factor: 3.021

9.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

10.  Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.

Authors:  Bibhas Chakraborty; Eric B Laber; Yingqi Zhao
Journal:  Biometrics       Date:  2013-07-11       Impact factor: 2.571

View more
  14 in total

1.  iqLearn: Interactive Q-Learning in R.

Authors:  Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal:  J Stat Softw       Date:  2015-03-20       Impact factor: 6.440

2.  Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Authors:  Ashkan Ertefaie; Susan Shortreed; Bibhas Chakraborty
Journal:  Stat Med       Date:  2016-01-10       Impact factor: 2.373

3.  Entropy Learning for Dynamic Treatment Regimes.

Authors:  Binyan Jiang; Rui Song; Jialiang Li; Donglin Zeng
Journal:  Stat Sin       Date:  2019       Impact factor: 1.261

4.  Quantile-Optimal Treatment Regimes.

Authors:  Lan Wang; Yu Zhou; Rui Song; Ben Sherwood
Journal:  J Am Stat Assoc       Date:  2018-06-08       Impact factor: 5.033

5.  High-Dimensional Inference for Personalized Treatment Decision.

Authors:  X Jessie Jeng; Wenbin Lu; Huimin Peng
Journal:  Electron J Stat       Date:  2018-06-21       Impact factor: 1.125

6.  Sparse concordance-assisted learning for optimal treatment decision.

Authors:  Shuhan Liang; Wenbin Lu; Rui Song; Lan Wang
Journal:  J Mach Learn Res       Date:  2018-04       Impact factor: 3.654

7.  Regularized outcome weighted subgroup identification for differential treatment effects.

Authors:  Yaoyao Xu; Menggang Yu; Ying-Qi Zhao; Quefeng Li; Sijian Wang; Jun Shao
Journal:  Biometrics       Date:  2015-05-11       Impact factor: 2.571

8.  Precision Medicine.

Authors:  Michael R Kosorok; Eric B Laber
Journal:  Annu Rev Stat Appl       Date:  2019-03       Impact factor: 5.810

9.  A cure-rate model for Q-learning: Estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients.

Authors:  Erica E M Moodie; David A Stephens; Shomoita Alam; Mei-Jie Zhang; Brent Logan; Mukta Arora; Stephen Spellman; Elizabeth F Krakow
Journal:  Biom J       Date:  2018-05-16       Impact factor: 2.207

10.  Interactive Q-learning for Quantiles.

Authors:  Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal:  J Am Stat Assoc       Date:  2017-03-31       Impact factor: 5.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.