Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Literature DB >> 26095711

Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Xuelin Huang¹, Sangbum Choi², Lu Wang³, Peter F Thall¹.

Abstract

In medical therapies involving multiple stages, a physician's choice of a subject's treatment at each stage depends on the subject's history of previous treatments and outcomes. The sequence of decisions is known as a dynamic treatment regime or treatment policy. We consider dynamic treatment regimes in settings where each subject's final outcome can be defined as the sum of longitudinally observed values, each corresponding to a stage of the regime. Q-learning, which is a backward induction method, is used to first optimize the last stage treatment then sequentially optimize each previous stage treatment until the first stage treatment is optimized. During this process, model-based expectations of outcomes of late stages are used in the optimization of earlier stages. When the outcome models are misspecified, bias can accumulate from stage to stage and become severe, especially when the number of treatment stages is large. We demonstrate that a modification of standard Q-learning can help reduce the accumulated bias. We provide a computational algorithm, estimators, and closed-form variance formulas. Simulation studies show that the modified Q-learning method has a higher probability of identifying the optimal treatment regime even in settings with misspecified models for outcomes. It is applied to identify optimal treatment regimes in a study for advanced prostate cancer and to estimate and compare the final mean rewards of all the possible discrete two-stage treatment sequences.

Entities: Chemical Disease Gene Mutation Species

Keywords: Q-learning; backward induction; multi-stage treatment; optimal treatment sequence; treatment decision-making

Mesh：

Year: 2015 PMID： 26095711 PMCID： PMC4596799 DOI： 10.1002/sim.6558

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.373

24 in total

1. Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials.

Authors: Jared K Lunceford; Marie Davidian; Anastasios A Tsiatis
Journal: Biometrics Date: 2002-03 Impact factor: 2.571

2. A Generalization Error for Q-Learning.

Authors: Susan A Murphy
Journal: J Mach Learn Res Date: 2005-07 Impact factor: 3.654

3. Doubly robust estimation in missing data and causal inference models.

Authors: Heejung Bang; James M Robins
Journal: Biometrics Date: 2005-12 Impact factor: 2.571

4. The multiphase optimization strategy (MOST) and the sequential multiple assignment randomized trial (SMART): new methods for more potent eHealth interventions.

Authors: Linda M Collins; Susan A Murphy; Victor Strecher
Journal: Am J Prev Med Date: 2007-05 Impact factor: 5.043

5. Estimation and extrapolation of optimal treatment and testing strategies.

Authors: James Robins; Liliana Orellana; Andrea Rotnitzky
Journal: Stat Med Date: 2008-10-15 Impact factor: 2.373

6. Marginal Mean Models for Dynamic Regimes.

Authors: S A Murphy; M J van der Laan; J M Robins
Journal: J Am Stat Assoc Date: 2001-12-01 Impact factor: 5.033

7. Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].

Authors: Erica E M Moodie; Thomas S Richardson
Journal: Scand Stat Theory Appl Date: 2009-09-22 Impact factor: 1.396

8. Evaluating multiple treatment courses in clinical trials.

Authors: P F Thall; R E Millikan; H G Sung
Journal: Stat Med Date: 2000-04-30 Impact factor: 2.373

9. Q-LEARNING WITH CENSORED DATA.

Authors: Yair Goldberg; Michael R Kosorok
Journal: Ann Stat Date: 2012-02-01 Impact factor: 4.028

10. Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer.

Authors: Lu Wang; Andrea Rotnitzky; Xihong Lin; Randall E Millikan; Peter F Thall
Journal: J Am Stat Assoc Date: 2012-06 Impact factor: 5.033

5 in total

1. Multilevel Interventions Targeting Obesity: Research Recommendations for Vulnerable Populations.

Authors: June Stevens; Charlotte Pratt; Josephine Boyington; Cheryl Nelson; Kimberly P Truesdale; Dianne S Ward; Leslie Lytle; Nancy E Sherwood; Thomas N Robinson; Shirley Moore; Shari Barkin; Ying Kuen Cheung; David M Murray
Journal: Am J Prev Med Date: 2016-10-26 Impact factor: 5.043

2. A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.

Authors: Thomas A Murray; Ying Yuan; Peter F Thall
Journal: J Am Stat Assoc Date: 2018-10-08 Impact factor: 5.033

3. Quantile-Optimal Treatment Regimes.

Authors: Lan Wang; Yu Zhou; Rui Song; Ben Sherwood
Journal: J Am Stat Assoc Date: 2018-06-08 Impact factor: 5.033

4. Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

Authors: Ming Tang; Lu Wang; Michael A Gorin; Jeremy M G Taylor
Journal: Stat Med Date: 2021-09-07 Impact factor: 2.373

5. Identifying cost-effective dynamic policies to control epidemics.

Authors: Reza Yaesoubi; Ted Cohen
Journal: Stat Med Date: 2016-07-24 Impact factor: 2.373

5 in total