Literature DB >> 23355757

Q-learning for estimating optimal dynamic treatment rules from observational data.

Erica E M Moodie1, Bibhas Chakraborty, Michael S Kramer.   

Abstract

The area of dynamic treatment regimes (DTR) aims to make inference about adaptive, multistage decision-making in clinical practice. A DTR is a set of decision rules, one per interval of treatment, where each decision is a function of treatment and covariate history that returns a recommended treatment. Q-learning is a popular method from the reinforcement learning literature that has recently been applied to estimate DTRs. While, in principle, Q-learning can be used for both randomized and observational data, the focus in the literature thus far has been exclusively on the randomized treatment setting. We extend the method to incorporate measured confounding covariates, using direct adjustment and a variety of propensity score approaches. The methods are examined under various settings including non-regular scenarios. We illustrate the methods in examining the effect of breastfeeding on vocabulary testing, based on data from the Promotion of Breastfeeding Intervention Trial.

Entities:  

Year:  2012        PMID: 23355757      PMCID: PMC3551601          DOI: 10.1002/cjs.11162

Source DB:  PubMed          Journal:  Can J Stat        ISSN: 0319-5724            Impact factor:   0.875


  23 in total

1.  Optimal dynamic regimes: presenting a case for predictive inference.

Authors:  Elja Arjas; Olli Saarela
Journal:  Int J Biostat       Date:  2010-03-03       Impact factor: 0.968

2.  A Generalization Error for Q-Learning.

Authors:  Susan A Murphy
Journal:  J Mach Learn Res       Date:  2005-07       Impact factor: 3.654

3.  Estimation and extrapolation of optimal treatment and testing strategies.

Authors:  James Robins; Liliana Orellana; Andrea Rotnitzky
Journal:  Stat Med       Date:  2008-10-15       Impact factor: 2.373

4.  Regret-regression for optimal dynamic treatment regimes.

Authors:  Robin Henderson; Phil Ansell; Deyadeen Alshibani
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

5.  Reinforcement learning design for cancer clinical trials.

Authors:  Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal:  Stat Med       Date:  2009-11-20       Impact factor: 2.373

6.  Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].

Authors:  Erica E M Moodie; Thomas S Richardson
Journal:  Scand Stat Theory Appl       Date:  2009-09-22       Impact factor: 1.396

7.  Promotion of Breastfeeding Intervention Trial (PROBIT): a randomized trial in the Republic of Belarus.

Authors:  M S Kramer; B Chalmers; E D Hodnett; Z Sevkovskaya; I Dzikovich; S Shapiro; J P Collet; I Vanilovich; I Mezen; T Ducruet; G Shishko; V Zubovich; D Mknuik; E Gluchanina; V Dombrovskiy; A Ustinovitch; T Kot; N Bogdanovich; L Ovchinikova; E Helsing
Journal:  JAMA       Date:  2001 Jan 24-31       Impact factor: 56.272

8.  Evaluating multiple treatment courses in clinical trials.

Authors:  P F Thall; R E Millikan; H G Sung
Journal:  Stat Med       Date:  2000-04-30       Impact factor: 2.373

9.  Infant growth and health outcomes associated with 3 compared with 6 mo of exclusive breastfeeding.

Authors:  Michael S Kramer; Tong Guo; Robert W Platt; Zinaida Sevkovskaya; Irina Dzikovich; Jean-Paul Collet; Stanley Shapiro; Beverley Chalmers; Ellen Hodnett; Irina Vanilovich; Irina Mezen; Thierry Ducruet; George Shishko; Natalia Bogdanovich
Journal:  Am J Clin Nutr       Date:  2003-08       Impact factor: 7.045

10.  Effects of prolonged and exclusive breastfeeding on child height, weight, adiposity, and blood pressure at age 6.5 y: evidence from a large randomized trial.

Authors:  Michael S Kramer; Lidia Matush; Irina Vanilovich; Robert W Platt; Natalia Bogdanovich; Zinaida Sevkovskaya; Irina Dzikovich; Gyorgy Shishko; Jean-Paul Collet; Richard M Martin; George Davey Smith; Matthew W Gillman; Beverley Chalmers; Ellen Hodnett; Stanley Shapiro
Journal:  Am J Clin Nutr       Date:  2007-12       Impact factor: 7.045

View more
  16 in total

1.  iqLearn: Interactive Q-Learning in R.

Authors:  Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal:  J Stat Softw       Date:  2015-03-20       Impact factor: 6.440

2.  Efficient augmentation and relaxation learning for individualized treatment rules using observational data.

Authors:  Ying-Qi Zhao; Eric B Laber; Yang Ning; Sumona Saha; Bruce E Sands
Journal:  J Mach Learn Res       Date:  2019       Impact factor: 3.654

3.  Dynamic Treatment Regimes.

Authors:  Bibhas Chakraborty; Susan A Murphy
Journal:  Annu Rev Stat Appl       Date:  2014       Impact factor: 5.810

4.  Developing adaptive interventions for adolescent substance use treatment settings: protocol of an observational, mixed-methods project.

Authors:  Sean Grant; Denis Agniel; Daniel Almirall; Q Burkhart; Sarah B Hunter; Daniel F McCaffrey; Eric R Pedersen; Rajeev Ramchand; Beth Ann Griffin
Journal:  Addict Sci Clin Pract       Date:  2017-12-19

5.  Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain.

Authors:  Eric B Laber; Fan Wu; Catherine Munera; Ilya Lipkovich; Salvatore Colucci; Steve Ripa
Journal:  Stat Med       Date:  2018-02-21       Impact factor: 2.373

6.  Inference about the expected performance of a data-driven dynamic treatment regime.

Authors:  Bibhas Chakraborty; Eric B Laber; Ying-Qi Zhao
Journal:  Clin Trials       Date:  2014-06-12       Impact factor: 2.486

7.  Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

Authors:  Elizabeth F Krakow; Michael Hemmer; Tao Wang; Brent Logan; Mukta Arora; Stephen Spellman; Daniel Couriel; Amin Alousi; Joseph Pidala; Michael Last; Silvy Lachance; Erica E M Moodie
Journal:  Am J Epidemiol       Date:  2017-07-15       Impact factor: 4.897

8.  Program for lung cancer screening and tobacco cessation: Study protocol of a sequential, multiple assignment, randomized trial.

Authors:  Steven S Fu; Alexander J Rothman; David M Vock; Bruce Lindgren; Daniel Almirall; Abbie Begnaud; Anne Melzer; Kelsey Schertz; Susan Glaeser; Patrick Hammett; Anne M Joseph
Journal:  Contemp Clin Trials       Date:  2017-07-04       Impact factor: 2.226

9.  Personalized Dose Finding Using Outcome Weighted Learning.

Authors:  Guanhua Chen; Donglin Zeng; Michael R Kosorok
Journal:  J Am Stat Assoc       Date:  2017-01-04       Impact factor: 5.033

10.  A cure-rate model for Q-learning: Estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients.

Authors:  Erica E M Moodie; David A Stephens; Shomoita Alam; Mei-Jie Zhang; Brent Logan; Mukta Arora; Stephen Spellman; Elizabeth F Krakow
Journal:  Biom J       Date:  2018-05-16       Impact factor: 2.207

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.