Literature DB >> 26236062

New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes.

Ying-Qi Zhao1, Donglin Zeng2, Eric B Laber3, Michael R Kosorok4.   

Abstract

Dynamic treatment regimes (DTRs) are sequential decision rules for individual patients that can adapt over time to an evolving illness. The goal is to accommodate heterogeneity among patients and find the DTR which will produce the best long term outcome if implemented. We introduce two new statistical learning methods for estimating the optimal DTR, termed backward outcome weighted learning (BOWL), and simultaneous outcome weighted learning (SOWL). These approaches convert individualized treatment selection into an either sequential or simultaneous classification problem, and can thus be applied by modifying existing machine learning techniques. The proposed methods are based on directly maximizing over all DTRs a nonparametric estimator of the expected long-term outcome; this is fundamentally different than regression-based methods, for example Q-learning, which indirectly attempt such maximization and rely heavily on the correctness of postulated regression models. We prove that the resulting rules are consistent, and provide finite sample bounds for the errors using the estimated rules. Simulation results suggest the proposed methods produce superior DTRs compared with Q-learning especially in small samples. We illustrate the methods using data from a clinical trial for smoking cessation.

Entities:  

Keywords:  Classification; Dynamic treatment regimes; Personalized medicine; Q-learning; Reinforcement learning; Risk Bound; Support vector machine

Year:  2015        PMID: 26236062      PMCID: PMC4517946          DOI: 10.1080/01621459.2014.937488

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  27 in total

1.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

2.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

3.  Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.

Authors:  Peter F Thall; Leiko H Wooten; Christopher J Logothetis; Randall E Millikan; Nizar M Tannir
Journal:  Stat Med       Date:  2007-11-20       Impact factor: 2.373

4.  Marginal Mean Models for Dynamic Regimes.

Authors:  S A Murphy; M J van der Laan; J M Robins
Journal:  J Am Stat Assoc       Date:  2001-12-01       Impact factor: 5.033

5.  Reinforcement learning design for cancer clinical trials.

Authors:  Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal:  Stat Med       Date:  2009-11-20       Impact factor: 2.373

6.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES.

Authors:  Min Qian; Susan A Murphy
Journal:  Ann Stat       Date:  2011-04-01       Impact factor: 4.028

7.  Evaluating multiple treatment courses in clinical trials.

Authors:  P F Thall; R E Millikan; H G Sung
Journal:  Stat Med       Date:  2000-04-30       Impact factor: 2.373

8.  Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders.

Authors:  Susan A Murphy; David W Oslin; A John Rush; Ji Zhu
Journal:  Neuropsychopharmacology       Date:  2006-11-08       Impact factor: 7.853

9.  Q-LEARNING WITH CENSORED DATA.

Authors:  Yair Goldberg; Michael R Kosorok
Journal:  Ann Stat       Date:  2012-02-01       Impact factor: 4.028

10.  Estimating Optimal Treatment Regimes from a Classification Perspective.

Authors:  Baqun Zhang; Anastasios A Tsiatis; Marie Davidian; Min Zhang; Eric Laber
Journal:  Stat       Date:  2012-01-01
View more
  68 in total

1.  Doubly Robust Learning for Estimating Individualized Treatment with Censored Data.

Authors:  Y Q Zhao; D Zeng; E B Laber; R Song; M Yuan; M R Kosorok
Journal:  Biometrika       Date:  2015-03-01       Impact factor: 2.445

2.  Tree based weighted learning for estimating individualized treatment rules with censored data.

Authors:  Yifan Cui; Ruoqing Zhu; Michael Kosorok
Journal:  Electron J Stat       Date:  2017-10-18       Impact factor: 1.125

3.  Learning Optimal Individualized Treatment Rules from Electronic Health Record Data.

Authors:  Yuanjia Wang; Peng Wu; Ying Liu; Chunhua Weng; Donglin Zeng
Journal:  IEEE Int Conf Healthc Inform       Date:  2016-12-08

4.  Entropy Learning for Dynamic Treatment Regimes.

Authors:  Binyan Jiang; Rui Song; Jialiang Li; Donglin Zeng
Journal:  Stat Sin       Date:  2019       Impact factor: 1.261

5.  A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes.

Authors:  Thomas A Murray; Ying Yuan; Peter F Thall
Journal:  J Am Stat Assoc       Date:  2018-10-08       Impact factor: 5.033

6.  Generated effect modifiers (GEM's) in randomized clinical trials.

Authors:  Eva Petkova; Thaddeus Tarpey; Zhe Su; R Todd Ogden
Journal:  Biostatistics       Date:  2016-07-27       Impact factor: 5.899

7.  Comparing cluster-level dynamic treatment regimens using sequential, multiple assignment, randomized trials: Regression estimation and sample size considerations.

Authors:  Timothy NeCamp; Amy Kilbourne; Daniel Almirall
Journal:  Stat Methods Med Res       Date:  2017-06-19       Impact factor: 3.021

8.  TARGETED SEQUENTIAL DESIGN FOR TARGETED LEARNING INFERENCE OF THE OPTIMAL TREATMENT RULE AND ITS MEAN REWARD.

Authors:  Antoine Chambaz; Wenjing Zheng; Mark J van der Laan
Journal:  Ann Stat       Date:  2017-12-15       Impact factor: 4.028

9.  Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

Authors:  Ying Liu; Brent Logan; Ning Liu; Zhiyuan Xu; Jian Tang; Yanzhi Wang
Journal:  Healthc Inform       Date:  2017-08

10.  Identifying cost-effective dynamic policies to control epidemics.

Authors:  Reza Yaesoubi; Ted Cohen
Journal:  Stat Med       Date:  2016-07-24       Impact factor: 2.373

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.