Literature DB >> 30420972

Deep advantage learning for optimal dynamic treatment regime.

Shuhan Liang1, Wenbin Lu1, Rui Song1.   

Abstract

Recently deep learning has successfully achieved state-of-the-art performance on many difficult tasks. Deep neural network outperforms many existing popular methods in the field of reinforcement learning. It can also identify important covariates automatically. Parameter sharing of convolutional neural network (CNN) greatly reduces the amount of parameters in the neural network, which allows for high scalability. However few research has been done on deep advantage learning (A-learning). In this paper, we present a deep A-learning approach to estimate optimal dynamic treatment regime. A-learning models the advantage function, which is of direct relevance to the goal. We use an inverse probability weighting (IPW) method to estimate the difference between potential outcomes, which does not require to make any model assumption on the baseline mean function. We implemented different architectures of deep CNN and convexified convolutional neural networks (CCNN). The proposed deep A-learning methods are applied to a data from the STAR*D trial and are shown to have better performance compared with the penalized least square estimator using a linear decision rule.

Entities:  

Keywords:  Advantage Learning; Convexified Convolutional Neural Networks; Convolutional Neural Networks; Dynamic Treatment Regime; Inverse Probability Weighting

Year:  2018        PMID: 30420972      PMCID: PMC6226036          DOI: 10.1080/24754269.2018.1466096

Source DB:  PubMed          Journal:  Stat Theory Relat Fields


  12 in total

1.  Mastering the game of Go with deep neural networks and tree search.

Authors:  David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

2.  Demystifying optimal dynamic treatment regimes.

Authors:  Erica E M Moodie; Thomas S Richardson; David A Stephens
Journal:  Biometrics       Date:  2007-06       Impact factor: 2.571

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes.

Authors:  Phillip J Schulte; Anastasios A Tsiatis; Eric B Laber; Marie Davidian
Journal:  Stat Sci       Date:  2014-11       Impact factor: 2.901

5.  Robust learning for optimal treatment decision with NP-dimensionality.

Authors:  Chengchun Shi; Rui Song; Wenbin Lu
Journal:  Electron J Stat       Date:  2016-10-13       Impact factor: 1.125

6.  Variable selection for optimal treatment decision.

Authors:  Wenbin Lu; Hao Helen Zhang; Donglin Zeng
Journal:  Stat Methods Med Res       Date:  2011-11-23       Impact factor: 3.021

7.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES.

Authors:  Min Qian; Susan A Murphy
Journal:  Ann Stat       Date:  2011-04-01       Impact factor: 4.028

8.  On Sparse representation for Optimal Individualized Treatment Selection with Penalized Outcome Weighted Learning.

Authors:  Rui Song; Michael Kosorok; Donglin Zeng; Yingqi Zhao; Eric Laber; Ming Yuan
Journal:  Stat       Date:  2015

Review 9.  Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study.

Authors:  Maurizio Fava; A John Rush; Madhukar H Trivedi; Andrew A Nierenberg; Michael E Thase; Harold A Sackeim; Frederic M Quitkin; Steven Wisniewski; Philip W Lavori; Jerrold F Rosenbaum; David J Kupfer
Journal:  Psychiatr Clin North Am       Date:  2003-06

10.  Estimating Optimal Treatment Regimes from a Classification Perspective.

Authors:  Baqun Zhang; Anastasios A Tsiatis; Marie Davidian; Min Zhang; Eric Laber
Journal:  Stat       Date:  2012-01-01
View more
  1 in total

1.  Representation Learning for Integrating Multi-domain Outcomes to Optimize Individualized Treatments.

Authors:  Yuan Chen; Donglin Zeng; Tianchen Xu; Yuanjia Wang
Journal:  Adv Neural Inf Process Syst       Date:  2020-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.