Literature DB >> 30984321

TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.

Yebin Tao1, Lu Wang1, Daniel Almirall2.   

Abstract

Dynamic treatment regimes (DTRs) are sequences of treatment decision rules, in which treatment may be adapted over time in response to the changing course of an individual. Motivated by the substance use disorder (SUD) study, we propose a tree-based reinforcement learning (T-RL) method to directly estimate optimal DTRs in a multi-stage multi-treatment setting. At each stage, T-RL builds an unsupervised decision tree that directly handles the problem of optimization with multiple treatment comparisons, through a purity measure constructed with augmented inverse probability weighted estimators. For the multiple stages, the algorithm is implemented recursively using backward induction. By combining semiparametric regression with flexible tree-based learning, T-RL is robust, efficient and easy to interpret for the identification of optimal DTRs, as shown in the simulation studies. With the proposed method, we identify dynamic SUD treatment regimes for adolescents.

Entities:  

Keywords:  Multi-stage decision-making; backward induction; classification; decision tree; personalized medicine

Year:  2018        PMID: 30984321      PMCID: PMC6457899          DOI: 10.1214/18-AOAS1137

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  4 in total

1.  Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data.

Authors:  Ming Tang; Lu Wang; Michael A Gorin; Jeremy M G Taylor
Journal:  Stat Med       Date:  2021-09-07       Impact factor: 2.373

2.  Optimal dynamic treatment regime estimation using information extraction from unstructured clinical text.

Authors:  Nina Zhou; Robert D Brook; Ivo D Dinov; Lu Wang
Journal:  Biom J       Date:  2022-02-03       Impact factor: 1.715

Review 3.  Artificial intelligence in reproductive medicine.

Authors:  Renjie Wang; Wei Pan; Lei Jin; Yuehan Li; Yudi Geng; Chun Gao; Gang Chen; Hui Wang; Ding Ma; Shujie Liao
Journal:  Reproduction       Date:  2019-10       Impact factor: 3.906

4.  What Are the Tradeoffs in Outcomes after Casting Versus Surgery for Closed Extraarticular Distal Radius Fractures in Older Patients? A Statistical Learning Model.

Authors:  Alfred P Yoon; Yibo Wang; Lu Wang; Kevin C Chung
Journal:  Clin Orthop Relat Res       Date:  2021-12-01       Impact factor: 4.176

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.