| Literature DB >> 34085036 |
Adam Kapelner1, Justin Bleich2, Alina Levine1, Zachary D Cohen3, Robert J DeRubeis3, Richard Berk2.
Abstract
We present methodological advances in understanding the effectiveness of personalized medicine models and supply easy-to-use open-source software. Personalized medicine involves the systematic use of individual patient characteristics to determine which treatment option is most likely to result in a better average outcome for the patient. Why is personalized medicine not done more in practice? One of many reasons is because practitioners do not have any easy way to holistically evaluate whether their personalization procedure does better than the standard of care, termed improvement. Our software, "Personalized Treatment Evaluator" (the R package PTE), provides inference for improvement out-of-sample in many clinical scenarios. We also extend current methodology by allowing evaluation of improvement in the case where the endpoint is binary or survival. In the software, the practitioner inputs 1) data from a single-stage randomized trial with one continuous, incidence or survival endpoint and 2) an educated guess of a functional form of a model for the endpoint constructed from domain knowledge. The bootstrap is then employed on data unseen during model fitting to provide confidence intervals for the improvement for the average future patient (assuming future patients are similar to the patients in the trial). One may also test against a null scenario where the hypothesized personalization are not more useful than a standard of care. We demonstrate our method's promise on simulated data as well as on data from a randomized comparative trial investigating two treatments for depression.Entities:
Keywords: bootstrap; inference; personalized medicine; randomized comparative trial; statistical software; treatment regimes
Year: 2021 PMID: 34085036 PMCID: PMC8167073 DOI: 10.3389/fdata.2021.572532
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
A compendium of the main notation in our methodology by section.
| Notation | Description |
|---|---|
| Framework ( | |
| | The random variable (r.v.) for the outcomes for the subjects |
| | The r. v. for the observed measurements for the subjects, its support |
| | The r. v. for the treatment |
| | The two treatments, shorthand for their codes, zero and one |
| | The decision function; this function maps observed measurements to treatment |
| | The value of the decision function, the average outcome over all patients if this decision is used to allocate treatment |
| | The unknown optimal decision function i.e. the one with highest |
| | A naive, baseline, business-as-usual or null decision function |
| | The unknown improvement of |
| The RCT data ( | |
| | The number of subjects in the randomized comparative trial (RCT) |
| | The number of measurements assessed on each subject |
| | The vector of |
| | The jth measurement for the ith subject |
| | The |
| | The vector of treatments for all the |
| | The vector of outcomes for all the |
| The response model ( | |
| | The function that relates the |
| | The r. v. for the unknown covariates for subject |
| | The function that computes misspecification in the response |
| | The r. v. for irreducible noise for the ith subject |
| | The linear coefficient for the |
| | The additional linear coefficient for the |
| Out of sample estimation and validation ( | |
| | The subset of the data used to create the fit of |
| | The subset of the data used to validate the fit of |
| | The finite-sample estimates of |
| | The arithmetic average of |
| | The finite-sample estimates of |
| Inference ( | |
| | The number of bootstrap samples |
| | A sample of the rows of |
| | The |
| | The size of the hypothesis test |
| Personalization of future subjects’ treatments ( | |
| | A future subject (not part of the RCT) |
FIGURE 1A graphical illustration of (1) our proposed method for estimation and (2) our proposed method for inference on the population mean improvement of an allocation procedure and (3) our proposed future allocation procedure (top left of the illustration). To compute the best estimate of the improvement , the RCT data goes through the K-fold cross validation procedure of Section 3.4 (depicted in the top center). The black slices of the data frame represent the test data. To draw inference, we employ the non-parametric bootstrap procedure of Section 3.5 by sampling the RCT data with replacement and repeating the K-fold CV to produce (bottom). The gray slices of the data frame represent the duplicate rows in the original data due to sampling with replacement. The confidence interval and significance of is computed from the bootstrap distribution (middle center). Finally, the practitioner receives which is built with the complete RCT data (top left).
The elements of cross-tabulated by their administered treatment and our model’s estimate of the better treatment (x ).
|
|
| |
|---|---|---|
|
| P | Q |
|
| R | S |
FIGURE 2Histograms of the bootstrap samples of the out-of-sample improvement measures for random (left column) and best (right column) for the response model of Eq. 11 for different values of n. is illustrated with a thick black line. The computed by the percentile method is illustrated by thin black lines.
FIGURE 3Histograms of the bootstrap samples of the cross-validated improvement measures for random (left column) and best (right column) for the response model of Eq. 12 for different values of n. is illustrated with a thick black line. The computed via the percentile method is illustrated by thin black lines. The true population improvement given the optimal rule is illustrated with a dotted black line.
Baseline characteristics of the subjects in the clinical trial example for the moderating variables employed in our personalization model. These statistics differ slightly from those found in the table of DeRubeis et al. (2005, page 412) as here they are tabulated for subjects only after dropout ().
| Variable | Sample Average or Proportion |
|---|---|
| Age | 40.3 |
| Chronicity | 55.1% |
| Life stressors | 6.6 |
| Personality disorder | 48.1% |
| Unemployed | 14.9% |
| Married | 37.6% |
FIGURE 4Histograms of the bootstrap samples of i.e. for the random business-as-usual allocation procedure. The thick black line is the best estimate of , the thin black lines are the confidence interval computed via the percentile method. More negative values are “better” as improvement is defined as lowering the HSRD composite score corresponding to a patient being less depressed.