Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

Literature DB >> 34490942

Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

Ming Tang¹, Lu Wang¹, Michael A Gorin², Jeremy M G Taylor¹.

Abstract

Dynamic treatment regimes (DTRs) include a sequence of treatment decision rules, in which treatment is adapted over time in response to the changes in an individual's disease progression and health care history. In medical practice, nested test-and-treat strategies are common to improve cost-effectiveness. For example, for patients at risk of prostate cancer, only patients who have high prostate-specific antigen (PSA) need a biopsy, which is costly and invasive, to confirm the diagnosis and help determine the treatment if needed. A decision about treatment happens after the biopsy, and is thus nested within the decision of whether to do the test. However, current existing statistical methods are not able to accommodate such a naturally embedded property of the treatment decision within the test decision. Therefore, we developed a new statistical learning method, step-adjusted tree-based reinforcement learning, to evaluate DTRs within such a nested multistage dynamic decision framework using observational data. At each step within each stage, we combined the robust semiparametric estimation via augmented inverse probability weighting with a tree-based reinforcement learning method to deal with the counterfactual optimization. The simulation studies demonstrated robust performance of the proposed methods under different scenarios. We further applied our method to evaluate the necessity of prostate biopsy and identify the optimal test-and-treat regimes for prostate cancer patients using data from the Johns Hopkins University prostate cancer active surveillance dataset.

Entities: Chemical

Keywords: dynamic treatment regimes; multistage decision-making; observational data; personalized health care; test-and-treat strategy; tree-based reinforcement learning

Mesh：

Year: 2021 PMID： 34490942 PMCID： PMC8595655 DOI： 10.1002/sim.9177

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.373

Keyword Cloud
References

24 in total

1. Expected value of information and decision making in HTA.

Authors: Simon Eckermann; Andrew R Willan
Journal: Health Econ Date: 2007-02 Impact factor: 3.046

2. Marginal Mean Models for Dynamic Regimes.

Authors: S A Murphy; M J van der Laan; J M Robins
Journal: J Am Stat Assoc Date: 2001-12-01 Impact factor: 5.033

Review 3. Overdiagnosis and overtreatment of prostate cancer.

Authors: Stacy Loeb; Marc A Bjurlin; Joseph Nicholson; Teuvo L Tammela; David F Penson; H Ballentine Carter; Peter Carroll; Ruth Etzioni
Journal: Eur Urol Date: 2014-01-09 Impact factor: 20.096

4. Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.

Authors: Yebin Tao; Lu Wang
Journal: Biometrics Date: 2016-05-23 Impact factor: 2.571

5. Estimation of the optimal regime in treatment of prostate cancer recurrence from observational data using flexible weighting models.

Authors: Jincheng Shen; Lu Wang; Jeremy M G Taylor
Journal: Biometrics Date: 2016-11-28 Impact factor: 2.571

Review 9. Prostate cancer: measuring PSA.

Authors: C Pezaro; H H Woo; I D Davis
Journal: Intern Med J Date: 2014-05 Impact factor: 2.048

10. Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme.

Authors: Bibhas Chakraborty; Eric B Laber; Yingqi Zhao
Journal: Biometrics Date: 2013-07-11 Impact factor: 2.571