Literature DB >> 26095711

Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Xuelin Huang1, Sangbum Choi2, Lu Wang3, Peter F Thall1.   

Abstract

In medical therapies involving multiple stages, a physician's choice of a subject's treatment at each stage depends on the subject's history of previous treatments and outcomes. The sequence of decisions is known as a dynamic treatment regime or treatment policy. We consider dynamic treatment regimes in settings where each subject's final outcome can be defined as the sum of longitudinally observed values, each corresponding to a stage of the regime. Q-learning, which is a backward induction method, is used to first optimize the last stage treatment then sequentially optimize each previous stage treatment until the first stage treatment is optimized. During this process, model-based expectations of outcomes of late stages are used in the optimization of earlier stages. When the outcome models are misspecified, bias can accumulate from stage to stage and become severe, especially when the number of treatment stages is large. We demonstrate that a modification of standard Q-learning can help reduce the accumulated bias. We provide a computational algorithm, estimators, and closed-form variance formulas. Simulation studies show that the modified Q-learning method has a higher probability of identifying the optimal treatment regime even in settings with misspecified models for outcomes. It is applied to identify optimal treatment regimes in a study for advanced prostate cancer and to estimate and compare the final mean rewards of all the possible discrete two-stage treatment sequences.
Copyright © 2015 John Wiley & Sons, Ltd.

Entities:  

Keywords:  Q-learning; backward induction; multi-stage treatment; optimal treatment sequence; treatment decision-making

Mesh:

Year:  2015        PMID: 26095711      PMCID: PMC4596799          DOI: 10.1002/sim.6558

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  24 in total

1.  Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials.

Authors:  Jared K Lunceford; Marie Davidian; Anastasios A Tsiatis
Journal:  Biometrics       Date:  2002-03       Impact factor: 2.571

2.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

3.  Doubly robust estimation in missing data and causal inference models.

Authors:  Heejung Bang; James M Robins
Journal:  Biometrics       Date:  2005-12       Impact factor: 2.571

4.  The multiphase optimization strategy (MOST) and the sequential multiple assignment randomized trial (SMART): new methods for more potent eHealth interventions.

Authors:  Linda M Collins; Susan A Murphy; Victor Strecher
Journal:  Am J Prev Med       Date:  2007-05       Impact factor: 5.043

5.  Estimation and extrapolation of optimal treatment and testing strategies.

Authors:  James Robins; Liliana Orellana; Andrea Rotnitzky
Journal:  Stat Med       Date:  2008-10-15       Impact factor: 2.373

6.  Marginal Mean Models for Dynamic Regimes.

Authors:  S A Murphy; M J van der Laan; J M Robins
Journal:  J Am Stat Assoc       Date:  2001-12-01       Impact factor: 5.033

7.  Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].

Authors:  Erica E M Moodie; Thomas S Richardson
Journal:  Scand Stat Theory Appl       Date:  2009-09-22       Impact factor: 1.396

8.  Evaluating multiple treatment courses in clinical trials.

Authors:  P F Thall; R E Millikan; H G Sung
Journal:  Stat Med       Date:  2000-04-30       Impact factor: 2.373

9.  Q-LEARNING WITH CENSORED DATA.

Authors:  Yair Goldberg; Michael R Kosorok
Journal:  Ann Stat       Date:  2012-02-01       Impact factor: 4.028

10.  Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer.

Authors:  Lu Wang; Andrea Rotnitzky; Xihong Lin; Randall E Millikan; Peter F Thall
Journal:  J Am Stat Assoc       Date:  2012-06       Impact factor: 5.033

View more
  5 in total

1.  Multilevel Interventions Targeting Obesity: Research Recommendations for Vulnerable Populations.

Authors:  June Stevens; Charlotte Pratt; Josephine Boyington; Cheryl Nelson; Kimberly P Truesdale; Dianne S Ward; Leslie Lytle; Nancy E Sherwood; Thomas N Robinson; Shirley Moore; Shari Barkin; Ying Kuen Cheung; David M Murray
Journal:  Am J Prev Med       Date:  2016-10-26       Impact factor: 5.043

2.  A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.

Authors:  Thomas A Murray; Ying Yuan; Peter F Thall
Journal:  J Am Stat Assoc       Date:  2018-10-08       Impact factor: 5.033

3.  Quantile-Optimal Treatment Regimes.

Authors:  Lan Wang; Yu Zhou; Rui Song; Ben Sherwood
Journal:  J Am Stat Assoc       Date:  2018-06-08       Impact factor: 5.033

4.  Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

Authors:  Ming Tang; Lu Wang; Michael A Gorin; Jeremy M G Taylor
Journal:  Stat Med       Date:  2021-09-07       Impact factor: 2.373

5.  Identifying cost-effective dynamic policies to control epidemics.

Authors:  Reza Yaesoubi; Ted Cohen
Journal:  Stat Med       Date:  2016-07-24       Impact factor: 2.373

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.