Literature DB >> 34490942

Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

Ming Tang1, Lu Wang1, Michael A Gorin2, Jeremy M G Taylor1.   

Abstract

Dynamic treatment regimes (DTRs) include a sequence of treatment decision rules, in which treatment is adapted over time in response to the changes in an individual's disease progression and health care history. In medical practice, nested test-and-treat strategies are common to improve cost-effectiveness. For example, for patients at risk of prostate cancer, only patients who have high prostate-specific antigen (PSA) need a biopsy, which is costly and invasive, to confirm the diagnosis and help determine the treatment if needed. A decision about treatment happens after the biopsy, and is thus nested within the decision of whether to do the test. However, current existing statistical methods are not able to accommodate such a naturally embedded property of the treatment decision within the test decision. Therefore, we developed a new statistical learning method, step-adjusted tree-based reinforcement learning, to evaluate DTRs within such a nested multistage dynamic decision framework using observational data. At each step within each stage, we combined the robust semiparametric estimation via augmented inverse probability weighting with a tree-based reinforcement learning method to deal with the counterfactual optimization. The simulation studies demonstrated robust performance of the proposed methods under different scenarios. We further applied our method to evaluate the necessity of prostate biopsy and identify the optimal test-and-treat regimes for prostate cancer patients using data from the Johns Hopkins University prostate cancer active surveillance dataset.
© 2021 John Wiley & Sons Ltd.

Entities:  

Keywords:  dynamic treatment regimes; multistage decision-making; observational data; personalized health care; test-and-treat strategy; tree-based reinforcement learning

Mesh:

Year:  2021        PMID: 34490942      PMCID: PMC8595655          DOI: 10.1002/sim.9177

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  24 in total

1.  Expected value of information and decision making in HTA.

Authors:  Simon Eckermann; Andrew R Willan
Journal:  Health Econ       Date:  2007-02       Impact factor: 3.046

2.  Marginal Mean Models for Dynamic Regimes.

Authors:  S A Murphy; M J van der Laan; J M Robins
Journal:  J Am Stat Assoc       Date:  2001-12-01       Impact factor: 5.033

Review 3.  Overdiagnosis and overtreatment of prostate cancer.

Authors:  Stacy Loeb; Marc A Bjurlin; Joseph Nicholson; Teuvo L Tammela; David F Penson; H Ballentine Carter; Peter Carroll; Ruth Etzioni
Journal:  Eur Urol       Date:  2014-01-09       Impact factor: 20.096

4.  Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.

Authors:  Yebin Tao; Lu Wang
Journal:  Biometrics       Date:  2016-05-23       Impact factor: 2.571

5.  Estimation of the optimal regime in treatment of prostate cancer recurrence from observational data using flexible weighting models.

Authors:  Jincheng Shen; Lu Wang; Jeremy M G Taylor
Journal:  Biometrics       Date:  2016-11-28       Impact factor: 2.571

6.  Multi-Objective Markov Decision Processes for Data-Driven Decision Support.

Authors:  Daniel J Lizotte; Eric B Laber
Journal:  J Mach Learn Res       Date:  2016-12-01       Impact factor: 3.654

7.  Active surveillance program for prostate cancer: an update of the Johns Hopkins experience.

Authors:  Jeffrey J Tosoian; Bruce J Trock; Patricia Landis; Zhaoyong Feng; Jonathan I Epstein; Alan W Partin; Patrick C Walsh; H Ballentine Carter
Journal:  J Clin Oncol       Date:  2011-04-04       Impact factor: 44.544

8.  Optimization of multi-stage dynamic treatment regimes utilizing accumulated data.

Authors:  Xuelin Huang; Sangbum Choi; Lu Wang; Peter F Thall
Journal:  Stat Med       Date:  2015-06-21       Impact factor: 2.373

Review 9.  Prostate cancer: measuring PSA.

Authors:  C Pezaro; H H Woo; I D Davis
Journal:  Intern Med J       Date:  2014-05       Impact factor: 2.048

10.  Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.

Authors:  Bibhas Chakraborty; Eric B Laber; Yingqi Zhao
Journal:  Biometrics       Date:  2013-07-11       Impact factor: 2.571

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.