Literature DB >> 19750510

Reinforcement learning design for cancer clinical trials.

Yufan Zhao1, Michael R Kosorok, Donglin Zeng.   

Abstract

We develop reinforcement learning trials for discovering individualized treatment regimens for life-threatening diseases such as cancer. A temporal-difference learning method called Q-learning is utilized that involves learning an optimal policy from a single training set of finite longitudinal patient trajectories. Approximating the Q-function with time-indexed parameters can be achieved by using support vector regression or extremely randomized trees. Within this framework, we demonstrate that the procedure can extract optimal strategies directly from clinical data without relying on the identification of any accurate mathematical models, unlike approaches based on adaptive design. We show that reinforcement learning has tremendous potential in clinical research because it can select actions that improve outcomes by taking into account delayed effects even when the relationship between actions and outcomes is not fully known. To support our claims, the methodology's practical utility is illustrated in a simulation analysis. In the immediate future, we will apply this general strategy to studying and identifying new treatments for advanced metastatic stage IIIB/IV non-small cell lung cancer, which usually includes multiple lines of chemotherapy treatment. Moreover, there is significant potential of the proposed methodology for developing personalized treatment strategies in other cancers, in cystic fibrosis, and in other life-threatening diseases.

Entities:  

Mesh:

Year:  2009        PMID: 19750510      PMCID: PMC2767418          DOI: 10.1002/sim.3720

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  17 in total

1.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

2.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

3.  Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring.

Authors:  Peter F Thall; Leiko H Wooten; Christopher J Logothetis; Randall E Millikan; Nizar M Tannir
Journal:  Stat Med       Date:  2007-11-20       Impact factor: 2.373

4.  Dynamic multidrug therapies for hiv: optimal and sti control approaches.

Authors:  B M Adams; H T Banks; Hee-Dae Kwon; Hien T Tran
Journal:  Math Biosci Eng       Date:  2004-09       Impact factor: 2.080

5.  Evaluating multiple treatment courses in clinical trials.

Authors:  P F Thall; R E Millikan; H G Sung
Journal:  Stat Med       Date:  2000-04-30       Impact factor: 2.373

6.  Prospective randomized trial of docetaxel versus best supportive care in patients with non-small-cell lung cancer previously treated with platinum-based chemotherapy.

Authors:  F A Shepherd; J Dancey; R Ramlau; K Mattson; R Gralla; M O'Rourke; N Levitan; L Gressot; M Vincent; R Burkes; S Coughlin; Y Kim; J Berille
Journal:  J Clin Oncol       Date:  2000-05       Impact factor: 44.544

7.  Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders.

Authors:  Susan A Murphy; David W Oslin; A John Rush; Ji Zhu
Journal:  Neuropsychopharmacology       Date:  2006-11-08       Impact factor: 7.853

8.  Randomized phase III trial of pemetrexed versus docetaxel in patients with non-small-cell lung cancer previously treated with chemotherapy.

Authors:  Nasser Hanna; Frances A Shepherd; Frank V Fossella; Jose R Pereira; Filippo De Marinis; Joachim von Pawel; Ulrich Gatzemeier; Thomas Chang Yao Tsao; Miklos Pless; Thomas Muller; Hong-Liang Lim; Christopher Desch; Klara Szondy; Radj Gervais; Christian Manegold; Sofia Paul; Paolo Paoletti; Lawrence Einhorn; Paul A Bunn
Journal:  J Clin Oncol       Date:  2004-05-01       Impact factor: 44.544

9.  Phase III study comparing cisplatin plus gemcitabine with cisplatin plus pemetrexed in chemotherapy-naive patients with advanced-stage non-small-cell lung cancer.

Authors:  Giorgio Vittorio Scagliotti; Purvish Parikh; Joachim von Pawel; Bonne Biesma; Johan Vansteenkiste; Christian Manegold; Piotr Serwatowski; Ulrich Gatzemeier; Raghunadharao Digumarti; Mauro Zukin; Jin S Lee; Anders Mellemgaard; Keunchil Park; Shehkar Patil; Janusz Rolski; Tuncay Goksel; Filippo de Marinis; Lorinda Simms; Katherine P Sugarman; David Gandara
Journal:  J Clin Oncol       Date:  2008-05-27       Impact factor: 44.544

10.  Application of reinforcement learning for segmentation of transrectal ultrasound images.

Authors:  Farhang Sahba; Hamid R Tizhoosh; Magdy M A Salama
Journal:  BMC Med Imaging       Date:  2008-04-22       Impact factor: 1.930

View more
  68 in total

1.  Doubly Robust Learning for Estimating Individualized Treatment with Censored Data.

Authors:  Y Q Zhao; D Zeng; E B Laber; R Song; M Yuan; M R Kosorok
Journal:  Biometrika       Date:  2015-03-01       Impact factor: 2.445

2.  DOUBLY ROBUST ESTIMATION OF OPTIMAL TREATMENT REGIMES FOR SURVIVAL DATA-WITH APPLICATION TO AN HIV/AIDS STUDY.

Authors:  Runchao Jiang; Wenbin Lu; Rui Song; Michael G Hudgens; Sonia Naprvavnik
Journal:  Ann Appl Stat       Date:  2017-10-05       Impact factor: 2.083

3.  Innovative Clinical Trial Designs: Toward a 21st-Century Health Care System.

Authors:  Tze L Lai; Philip W Lavori
Journal:  Stat Biosci       Date:  2011-12

Review 4.  Recent development on statistical methods for personalized medicine discovery.

Authors:  Yingqi Zhao; Donglin Zeng
Journal:  Front Med       Date:  2013-02-02       Impact factor: 4.592

5.  Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia.

Authors:  Ashkan Ertefaie; Susan Shortreed; Bibhas Chakraborty
Journal:  Stat Med       Date:  2016-01-10       Impact factor: 2.373

6.  Preclinical to Clinical Translation of Antibody-Drug Conjugates Using PK/PD Modeling: a Retrospective Analysis of Inotuzumab Ozogamicin.

Authors:  Alison M Betts; Nahor Haddish-Berhane; John Tolsma; Paul Jasper; Lindsay E King; Yongliang Sun; Subramanyam Chakrapani; Boris Shor; Joseph Boni; Theodore R Johnson
Journal:  AAPS J       Date:  2016-05-19       Impact factor: 4.009

7.  Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

Authors:  Ying Liu; Brent Logan; Ning Liu; Zhiyuan Xu; Jian Tang; Yanzhi Wang
Journal:  Healthc Inform       Date:  2017-08

8.  Bench to bedside translation of antibody drug conjugates using a multiscale mechanistic PK/PD model: a case study with brentuximab-vedotin.

Authors:  Dhaval K Shah; Nahor Haddish-Berhane; Alison Betts
Journal:  J Pharmacokinet Pharmacodyn       Date:  2012-11-15       Impact factor: 2.745

9.  Penalized Q-Learning for Dynamic Treatment Regimens.

Authors:  R Song; W Wang; D Zeng; M R Kosorok
Journal:  Stat Sin       Date:  2015-07       Impact factor: 1.261

10.  Towards better combination regimens of cytarabine and FLT3 inhibitors in acute myeloid leukemia.

Authors:  Mohamed Elmeliegy; Jason Den Haese; Chetasi Talati; Meir Wetzler; William J Jusko
Journal:  Cancer Chemother Pharmacol       Date:  2020-08-03       Impact factor: 3.333

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.