Literature DB >> 22754029

Q-LEARNING WITH CENSORED DATA.

Yair Goldberg1, Michael R Kosorok.   

Abstract

We develop methodology for a multistage-decision problem with flexible number of stages in which the rewards are survival times that are subject to censoring. We present a novel Q-learning algorithm that is adjusted for censored data and allows a flexible number of stages. We provide finite sample bounds on the generalization error of the policy learned by the algorithm, and show that when the optimal Q-function belongs to the approximation space, the expected survival time for policies obtained by the algorithm converges to that of the optimal policy. We simulate a multistage clinical trial with flexible number of stages and apply the proposed censored-Q-learning algorithm to find individualized treatment regimens. The methodology presented in this paper has implications in the design of personalized medicine trials in cancer and in other life-threatening diseases.

Entities:  

Year:  2012        PMID: 22754029      PMCID: PMC3385950          DOI: 10.1214/12-AOS968

Source DB:  PubMed          Journal:  Ann Stat        ISSN: 0090-5364            Impact factor:   4.028


  18 in total

1.  Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials.

Authors:  Jared K Lunceford; Marie Davidian; Anastasios A Tsiatis
Journal:  Biometrics       Date:  2002-03       Impact factor: 2.571

2.  Weighted Kaplan-Meier estimators for two-stage treatment regimes.

Authors:  Sachiko Miyahara; Abdus S Wahed
Journal:  Stat Med       Date:  2010-11-10       Impact factor: 2.373

3.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

4.  Dynamic treatment regimes: practical design considerations.

Authors:  Philip W Lavori; Ree Dawson
Journal:  Clin Trials       Date:  2004-02       Impact factor: 2.486

5.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

6.  Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.

Authors:  Peter F Thall; Leiko H Wooten; Christopher J Logothetis; Randall E Millikan; Nizar M Tannir
Journal:  Stat Med       Date:  2007-11-20       Impact factor: 2.373

7.  Estimation and extrapolation of optimal treatment and testing strategies.

Authors:  James Robins; Liliana Orellana; Andrea Rotnitzky
Journal:  Stat Med       Date:  2008-10-15       Impact factor: 2.373

8.  Reinforcement learning design for cancer clinical trials.

Authors:  Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal:  Stat Med       Date:  2009-11-20       Impact factor: 2.373

9.  Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders.

Authors:  Susan A Murphy; David W Oslin; A John Rush; Ji Zhu
Journal:  Neuropsychopharmacology       Date:  2006-11-08       Impact factor: 7.853

Review 10.  Considerations for second-line therapy of non-small cell lung cancer.

Authors:  Thomas E Stinchcombe; Mark A Socinski
Journal:  Oncologist       Date:  2008
View more
  35 in total

1.  Doubly Robust Learning for Estimating Individualized Treatment with Censored Data.

Authors:  Y Q Zhao; D Zeng; E B Laber; R Song; M Yuan; M R Kosorok
Journal:  Biometrika       Date:  2015-03-01       Impact factor: 2.445

2.  Tree based weighted learning for estimating individualized treatment rules with censored data.

Authors:  Yifan Cui; Ruoqing Zhu; Michael Kosorok
Journal:  Electron J Stat       Date:  2017-10-18       Impact factor: 1.125

3.  DOUBLY ROBUST ESTIMATION OF OPTIMAL TREATMENT REGIMES FOR SURVIVAL DATA-WITH APPLICATION TO AN HIV/AIDS STUDY.

Authors:  Runchao Jiang; Wenbin Lu; Rui Song; Michael G Hudgens; Sonia Naprvavnik
Journal:  Ann Appl Stat       Date:  2017-10-05       Impact factor: 2.083

Review 4.  Recent development on statistical methods for personalized medicine discovery.

Authors:  Yingqi Zhao; Donglin Zeng
Journal:  Front Med       Date:  2013-02-02       Impact factor: 4.592

5.  Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Authors:  Ashkan Ertefaie; Susan Shortreed; Bibhas Chakraborty
Journal:  Stat Med       Date:  2016-01-10       Impact factor: 2.373

6.  Entropy Learning for Dynamic Treatment Regimes.

Authors:  Binyan Jiang; Rui Song; Jialiang Li; Donglin Zeng
Journal:  Stat Sin       Date:  2019       Impact factor: 1.261

7.  Robustifying Trial-Derived Optimal Treatment Rules for A Target Population.

Authors:  Ying-Qi Zhao; Donglin Zeng; Catherine M Tangen; Michael L LeBlanc
Journal:  Electron J Stat       Date:  2019-04-30       Impact factor: 1.125

8.  ON ESTIMATION OF THE OPTIMAL TREATMENT REGIME WITH THE ADDITIVE HAZARDS MODEL.

Authors:  Suhyun Kang; Wenbin Lu; Jiajia Zhang
Journal:  Stat Sin       Date:  2018-07       Impact factor: 1.261

9.  Comment.

Authors:  Jingxiang Chen; Yufeng Liu; Donglin Zeng; Rui Song; Yingqi Zhao; Michael R Kosorok
Journal:  J Am Stat Assoc       Date:  2016-10-18       Impact factor: 5.033

10.  Dynamic Treatment Regimes.

Authors:  Bibhas Chakraborty; Susan A Murphy
Journal:  Annu Rev Stat Appl       Date:  2014       Impact factor: 5.810

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.