Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Q-learning for estimating optimal dynamic treatment rules from observational data.

Literature DB >> 23355757

Q-learning for estimating optimal dynamic treatment rules from observational data.

Erica E M Moodie¹, Bibhas Chakraborty, Michael S Kramer.

Abstract

The area of dynamic treatment regimes (DTR) aims to make inference about adaptive, multistage decision-making in clinical practice. A DTR is a set of decision rules, one per interval of treatment, where each decision is a function of treatment and covariate history that returns a recommended treatment. Q-learning is a popular method from the reinforcement learning literature that has recently been applied to estimate DTRs. While, in principle, Q-learning can be used for both randomized and observational data, the focus in the literature thus far has been exclusively on the randomized treatment setting. We extend the method to incorporate measured confounding covariates, using direct adjustment and a variety of propensity score approaches. The methods are examined under various settings including non-regular scenarios. We illustrate the methods in examining the effect of breastfeeding on vocabulary testing, based on data from the Promotion of Breastfeeding Intervention Trial.

Entities: Chemical Disease Gene Species

Year: 2012 PMID： 23355757 PMCID： PMC3551601 DOI： 10.1002/cjs.11162

Source DB: PubMed Journal: Can J Stat ISSN： 0319-5724 Impact factor: 0.875

23 in total

1. Optimal dynamic regimes: presenting a case for predictive inference.

Authors: Elja Arjas; Olli Saarela
Journal: Int J Biostat Date: 2010-03-03 Impact factor: 0.968

2. A Generalization Error for Q-Learning.

Authors: Susan A Murphy
Journal: J Mach Learn Res Date: 2005-07 Impact factor: 3.654

3. Estimation and extrapolation of optimal treatment and testing strategies.

Authors: James Robins; Liliana Orellana; Andrea Rotnitzky
Journal: Stat Med Date: 2008-10-15 Impact factor: 2.373

4. Regret-regression for optimal dynamic treatment regimes.

Authors: Robin Henderson; Phil Ansell; Deyadeen Alshibani
Journal: Biometrics Date: 2010-12 Impact factor: 2.571

5. Reinforcement learning design for cancer clinical trials.

Authors: Yufan Zhao; Michael R Kosorok; Donglin Zeng
Journal: Stat Med Date: 2009-11-20 Impact factor: 2.373

6. Estimating Optimal Dynamic Regimes: Correcting Bias under the Null: [Optimal dynamic regimes: bias correction].

Authors: Erica E M Moodie; Thomas S Richardson
Journal: Scand Stat Theory Appl Date: 2009-09-22 Impact factor: 1.396

7. Promotion of Breastfeeding Intervention Trial (PROBIT): a randomized trial in the Republic of Belarus.

Authors: M S Kramer; B Chalmers; E D Hodnett; Z Sevkovskaya; I Dzikovich; S Shapiro; J P Collet; I Vanilovich; I Mezen; T Ducruet; G Shishko; V Zubovich; D Mknuik; E Gluchanina; V Dombrovskiy; A Ustinovitch; T Kot; N Bogdanovich; L Ovchinikova; E Helsing
Journal: JAMA Date: 2001 Jan 24-31 Impact factor: 56.272

8. Evaluating multiple treatment courses in clinical trials.

Authors: P F Thall; R E Millikan; H G Sung
Journal: Stat Med Date: 2000-04-30 Impact factor: 2.373

9. Infant growth and health outcomes associated with 3 compared with 6 mo of exclusive breastfeeding.

Authors: Michael S Kramer; Tong Guo; Robert W Platt; Zinaida Sevkovskaya; Irina Dzikovich; Jean-Paul Collet; Stanley Shapiro; Beverley Chalmers; Ellen Hodnett; Irina Vanilovich; Irina Mezen; Thierry Ducruet; George Shishko; Natalia Bogdanovich
Journal: Am J Clin Nutr Date: 2003-08 Impact factor: 7.045

10. Effects of prolonged and exclusive breastfeeding on child height, weight, adiposity, and blood pressure at age 6.5 y: evidence from a large randomized trial.

Authors: Michael S Kramer; Lidia Matush; Irina Vanilovich; Robert W Platt; Natalia Bogdanovich; Zinaida Sevkovskaya; Irina Dzikovich; Gyorgy Shishko; Jean-Paul Collet; Richard M Martin; George Davey Smith; Matthew W Gillman; Beverley Chalmers; Ellen Hodnett; Stanley Shapiro
Journal: Am J Clin Nutr Date: 2007-12 Impact factor: 7.045

16 in total

1. iqLearn: Interactive Q-Learning in R.

Authors: Kristin A Linn; Eric B Laber; Leonard A Stefanski
Journal: J Stat Softw Date: 2015-03-20 Impact factor: 6.440

2. Efficient augmentation and relaxation learning for individualized treatment rules using observational data.

Authors: Ying-Qi Zhao; Eric B Laber; Yang Ning; Sumona Saha; Bruce E Sands
Journal: J Mach Learn Res Date: 2019 Impact factor: 3.654

3. Dynamic Treatment Regimes.

Authors: Bibhas Chakraborty; Susan A Murphy
Journal: Annu Rev Stat Appl Date: 2014 Impact factor: 5.810

4. Developing adaptive interventions for adolescent substance use treatment settings: protocol of an observational, mixed-methods project.

Authors: Sean Grant; Denis Agniel; Daniel Almirall; Q Burkhart; Sarah B Hunter; Daniel F McCaffrey; Eric R Pedersen; Rajeev Ramchand; Beth Ann Griffin
Journal: Addict Sci Clin Pract Date: 2017-12-19

5. Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain.

Authors: Eric B Laber; Fan Wu; Catherine Munera; Ilya Lipkovich; Salvatore Colucci; Steve Ripa
Journal: Stat Med Date: 2018-02-21 Impact factor: 2.373

6. Inference about the expected performance of a data-driven dynamic treatment regime.

Authors: Bibhas Chakraborty; Eric B Laber; Ying-Qi Zhao
Journal: Clin Trials Date: 2014-06-12 Impact factor: 2.486

7. Tools for the Precision Medicine Era: How to Develop Highly Personalized Treatment Recommendations From Cohort and Registry Data Using Q-Learning.

Authors: Elizabeth F Krakow; Michael Hemmer; Tao Wang; Brent Logan; Mukta Arora; Stephen Spellman; Daniel Couriel; Amin Alousi; Joseph Pidala; Michael Last; Silvy Lachance; Erica E M Moodie
Journal: Am J Epidemiol Date: 2017-07-15 Impact factor: 4.897

8. Program for lung cancer screening and tobacco cessation: Study protocol of a sequential, multiple assignment, randomized trial.

Authors: Steven S Fu; Alexander J Rothman; David M Vock; Bruce Lindgren; Daniel Almirall; Abbie Begnaud; Anne Melzer; Kelsey Schertz; Susan Glaeser; Patrick Hammett; Anne M Joseph
Journal: Contemp Clin Trials Date: 2017-07-04 Impact factor: 2.226

9. Personalized Dose Finding Using Outcome Weighted Learning.

Authors: Guanhua Chen; Donglin Zeng; Michael R Kosorok
Journal: J Am Stat Assoc Date: 2017-01-04 Impact factor: 5.033

10. A cure-rate model for Q-learning: Estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients.

Authors: Erica E M Moodie; David A Stephens; Shomoita Alam; Mei-Jie Zhang; Brent Logan; Mukta Arora; Stephen Spellman; Elizabeth F Krakow
Journal: Biom J Date: 2018-05-16 Impact factor: 2.207