| Literature DB >> 25642116 |
Ying Liu1, Donglin Zeng2, Yuanjia Wang1.
Abstract
Dynamic treatment regimens (DTRs) are sequential decision rules tailored at each point where a clinical decision is made based on each patient's time-varying characteristics and intermediate outcomes observed at earlier points in time. The complexity, patient heterogeneity, and chronicity of mental disorders call for learning optimal DTRs to dynamically adapt treatment to an individual's response over time. The Sequential Multiple Assignment Randomized Trial (SMARTs) design allows for estimating causal effects of DTRs. Modern statistical tools have been developed to optimize DTRs based on personalized variables and intermediate outcomes using rich data collected from SMARTs; these statistical methods can also be used to recommend tailoring variables for designing future SMART studies. This paper introduces DTRs and SMARTs using two examples in mental health studies, discusses two machine learning methods for estimating optimal DTR from SMARTs data, and demonstrates the performance of the statistical methods using simulated data.Entities:
Keywords: O-learning; Q-learning; SMART; double robust estimation; dynamic treatment regimes; personalized medicine
Year: 2014 PMID: 25642116 PMCID: PMC4311115 DOI: 10.11919/j.issn.1002-0829.214172
Source DB: PubMed Journal: Shanghai Arch Psychiatry ISSN: 1002-0829
Primary analysis questions and example in the ADHD study
| Type of primary question | Example in the Attention Deficit/Hyperactivity Disorder(ADHD)study |
|---|---|
| Comparing first-stage | Compare the potential outcomes for patients beginning with low-intensity behavior modification(BMOD)and low-dose oral methamphetamine(MEDS) |
| Comparing | Among patients who do not respond to the first stage treatment, compare intensifying the initial intervention versus augmenting the initial intervention with the alternative intervention |
| Comparing adaptive | There are four imbedded adaptive interventions: |
Table 2. Standardized coefficients for the optimal dynamic treatment rule estimated by various methods using data from the Attention Deficit/Hyperactivity Disorder (ADHD) studya
| stage 1 | stage 2 | |||||
|---|---|---|---|---|---|---|
| Q-L | AMOLb | Q-L | AMOLb | |||
| Intercept | 3.454 | 0 | Intercept | 2.889 | 0 | |
| ODD diagnosis | -0.199 | -0.229 | ODD Diagnosis | -0.144 | 0 | |
| Baseline ADHD score | -0.357 | 0.276 | ADHD score | -0.28 | 0 | |
| Prior medication | -0.028 | -1.557 | Prior medication | 0.012 | 0 | |
| White race | 0.211 | 0.456 | White race | 0.247 | 0.088 | |
| trt1 (1 for BMOD; -1 for MED) | 0.225 | trt1 | 0.273 | -0.043 | ||
| ODD diagnosis* trt1 | -0.068 | ODD diagnosis* trt1 | -0.141 | 0 | ||
| ADHD *trt1 | 0.163 | ADHD *trt1 | 0.075 | 0 | ||
| Prior medication*trt1 | -0.348 | Prior medication *trt1 | -0.049 | 0 | ||
| race*trt1 | 0.086 | White race*trt1 | 0.11 | 0.088 | ||
| Months to non-response | -0.015 | 0 | ||||
| Adherence to trt1 | 0.003 | 0.999 | ||||
| Months to non-response*trt1 | -0.33 | 0 | ||||
| Adherence to trt1*trt1 | 0.09 | 0 | ||||
| trt2 | -0.385 | |||||
| … | ||||||
| Adherence to trt1*trt2 | 0.633 | |||||
Q-L, Q-learning O-L, O-learning AMOL, Augmented Multi-stage Outcome-weighted Learning
ODD, Oppositional Defiant Disorder BMOD, Behavioral Modification MED, Medication
trt1: first stage treatment, 1=use BMOD; -1=use MED
trt2: second stage treatment, 1=intensify current treatment; -1=add alternative treatment
aQ-learning also included other interaction terms with trt2 which are omitted in the table
bThe reported coefficients were obtained from fitting a linear prediction rule for the outcome with listed variables included as covariates in AMOL. The estimated coefficients were the numbers displayed in this column multiplied by 0.001 for the ease to show relative magnitude of each variable (e.g., the estimated coefficient for prior medication was -0.001557).