Literature DB >> 21385164

Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer.

Yufan Zhao1, Donglin Zeng, Mark A Socinski, Michael R Kosorok.   

Abstract

Typical regimens for advanced metastatic stage IIIB/IV nonsmall cell lung cancer (NSCLC) consist of multiple lines of treatment. We present an adaptive reinforcement learning approach to discover optimal individualized treatment regimens from a specially designed clinical trial (a "clinical reinforcement trial") of an experimental treatment for patients with advanced NSCLC who have not been treated previously with systemic therapy. In addition to the complexity of the problem of selecting optimal compounds for first- and second-line treatments based on prognostic factors, another primary goal is to determine the optimal time to initiate second-line therapy, either immediately or delayed after induction therapy, yielding the longest overall survival time. A reinforcement learning method called Q-learning is utilized, which involves learning an optimal regimen from patient data generated from the clinical reinforcement trial. Approximating the Q-function with time-indexed parameters can be achieved by using a modification of support vector regression that can utilize censored data. Within this framework, a simulation study shows that the procedure can extract optimal regimens for two lines of treatment directly from clinical data without prior knowledge of the treatment effect mechanism. In addition, we demonstrate that the design reliably selects the best initial time for second-line therapy while taking into account the heterogeneity of NSCLC across patients.
© 2011, The International Biometric Society.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21385164      PMCID: PMC3138840          DOI: 10.1111/j.1541-0420.2011.01572.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  9 in total

1.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

2.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

3.  Duration of first-line chemotherapy in advanced non small-cell lung cancer: less is more in the era of effective subsequent therapies.

Authors:  Mark A Socinski; Thomas E Stinchcombe
Journal:  J Clin Oncol       Date:  2007-11-20       Impact factor: 44.544

4.  Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.

Authors:  Peter F Thall; Leiko H Wooten; Christopher J Logothetis; Randall E Millikan; Nizar M Tannir
Journal:  Stat Med       Date:  2007-11-20       Impact factor: 2.373

5.  Reinforcement learning design for cancer clinical trials.

Authors:  Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal:  Stat Med       Date:  2009-11-20       Impact factor: 2.373

6.  Erlotinib in previously treated non-small-cell lung cancer.

Authors:  Frances A Shepherd; José Rodrigues Pereira; Tudor Ciuleanu; Eng Huat Tan; Vera Hirsh; Sumitra Thongprasert; Daniel Campos; Savitree Maoleekoonpiroj; Michael Smylie; Renato Martins; Maximiliano van Kooten; Mircea Dediu; Brian Findlay; Dongsheng Tu; Dianne Johnston; Andrea Bezjak; Gary Clark; Pedro Santabárbara; Lesley Seymour
Journal:  N Engl J Med       Date:  2005-07-14       Impact factor: 91.245

7.  Paclitaxel-carboplatin alone or with bevacizumab for non-small-cell lung cancer.

Authors:  Alan Sandler; Robert Gray; Michael C Perry; Julie Brahmer; Joan H Schiller; Afshin Dowlati; Rogerio Lilenbaum; David H Johnson
Journal:  N Engl J Med       Date:  2006-12-14       Impact factor: 91.245

8.  Treatment of non-small cell lung cancer, stage IV: ACCP evidence-based clinical practice guidelines (2nd edition).

Authors:  Mark A Socinski; Richard Crowell; Thomas E Hensing; Corey J Langer; Rogerio Lilenbaum; Alan B Sandler; David Morris
Journal:  Chest       Date:  2007-09       Impact factor: 9.410

Review 9.  Considerations for second-line therapy of non-small cell lung cancer.

Authors:  Thomas E Stinchcombe; Mark A Socinski
Journal:  Oncologist       Date:  2008
  9 in total
  56 in total

1.  A decision-theoretic phase I-II design for ordinal outcomes in two cycles.

Authors:  Juhee Lee; Peter F Thall; Yuan Ji; Peter Müller
Journal:  Biostatistics       Date:  2015-11-09       Impact factor: 5.899

2.  Doubly Robust Learning for Estimating Individualized Treatment with Censored Data.

Authors:  Y Q Zhao; D Zeng; E B Laber; R Song; M Yuan; M R Kosorok
Journal:  Biometrika       Date:  2015-03-01       Impact factor: 2.445

3.  Tree based weighted learning for estimating individualized treatment rules with censored data.

Authors:  Yifan Cui; Ruoqing Zhu; Michael Kosorok
Journal:  Electron J Stat       Date:  2017-10-18       Impact factor: 1.125

4.  Semiparametric Single-Index Model for Estimating Optimal Individualized Treatment Strategy.

Authors:  Rui Song; Shikai Luo; Donglin Zeng; Hao Helen Zhang; Wenbin Lu; Zhiguo Li
Journal:  Electron J Stat       Date:  2017       Impact factor: 1.125

Review 5.  Recent development on statistical methods for personalized medicine discovery.

Authors:  Yingqi Zhao; Donglin Zeng
Journal:  Front Med       Date:  2013-02-02       Impact factor: 4.592

6.  Sequential advantage selection for optimal treatment regime.

Authors:  Ailin Fan; Wenbin Lu; Rui Song
Journal:  Ann Appl Stat       Date:  2016-03-25       Impact factor: 2.083

7.  Residual Weighted Learning for Estimating Individualized Treatment Rules.

Authors:  Xin Zhou; Nicole Mayer-Hamblett; Umer Khan; Michael R Kosorok
Journal:  J Am Stat Assoc       Date:  2017-05-03       Impact factor: 5.033

8.  Dynamic Treatment Regimes.

Authors:  Bibhas Chakraborty; Susan A Murphy
Journal:  Annu Rev Stat Appl       Date:  2014       Impact factor: 5.810

9.  Penalized Q-Learning for Dynamic Treatment Regimens.

Authors:  R Song; W Wang; D Zeng; M R Kosorok
Journal:  Stat Sin       Date:  2015-07       Impact factor: 1.261

10.  Q-learning for estimating optimal dynamic treatment rules from observational data.

Authors:  Erica E M Moodie; Bibhas Chakraborty; Michael S Kramer
Journal:  Can J Stat       Date:  2012-11-07       Impact factor: 0.875

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.