| Literature DB >> 26890628 |
Bibhas Chakraborty1, Palash Ghosh2, Erica E M Moodie3, A John Rush4.
Abstract
A dynamic treatment regimen consists of decision rules that recommend how to individualize treatment to patients based on available treatment and covariate history. In many scientific domains, these decision rules are shared across stages of intervention. As an illustrative example, we discuss STAR*D, a multistage randomized clinical trial for treating major depression. Estimating these shared decision rules often amounts to estimating parameters indexing the decision rules that are shared across stages. In this article, we propose a novel simultaneous estimation procedure for the shared parameters based on Q-learning. We provide an extensive simulation study to illustrate the merit of the proposed method over simple competitors, in terms of the treatment allocation matching of the procedure with the "oracle" procedure, defined as the one that makes treatment recommendations based on the true parameter values as opposed to their estimates. We also look at bias and mean squared error of the individual parameter-estimates as secondary metrics. Finally, we analyze the STAR*D data using the proposed method.Entities:
Keywords: Dynamic treatment regimens; Q-learning; STAR*D; Shared parameters
Mesh:
Year: 2016 PMID: 26890628 PMCID: PMC4988949 DOI: 10.1111/biom.12493
Source DB: PubMed Journal: Biometrics ISSN: 0006-341X Impact factor: 2.571