| Literature DB >> 34806001 |
Carolyn Lou1,2, Mohamad Habes3, Nicholas A Illenberger2, Ali Ezzati4, Richard B Lipton4, Pamela A Shaw2, Alisa J Stephens-Shields2, Hamed Akbari3, Jimit Doshi3, Christos Davatzikos3, Russell T Shinohara1,2,3.
Abstract
A key factor in designing randomized clinical trials is the sample size required to achieve a particular level of power to detect the benefit of a treatment. Sample size calculations depend upon the expected benefits of a treatment (effect size), the accuracy of measurement of the primary outcome, and the level of power specified by the investigators. In this study, we show that radiomic models, which leverage complex brain MRI patterns and machine learning, can be utilized in clinical trials with protocols that incorporate baseline MR imaging to significantly increase statistical power to detect treatment effects. Akin to the historical control paradigm, we propose to utilize a radiomic prediction model to generate a pseudo-control sample for each individual in the trial of interest. Because the variability of expected outcome across patients can mask our ability to detect treatment effects, we can increase the power to detect a treatment effect in a clinical trial by reducing that variability through using radiomic predictors as surrogates. We illustrate this method with simulations based on data from two cohorts in different neurologic diseases, Alzheimer's disease and glioblastoma multiforme. We present sample size requirements across a range of effect sizes using conventional analysis and models that include a radiomic predictor. For our Alzheimer's disease cohort, at an effect size of 0.35, total sample size requirements for 80% power declined from 246 to 212 for the endpoint cognitive decline. For our glioblastoma multiforme cohort, at an effect size of 1.65 with the endpoint survival time, total sample size requirements declined from 128 to 74. This methodology can decrease the required sample sizes by as much as 50%, depending on the strength of the radiomic predictor. The power of this method grows with increased accuracy of radiomic prediction, and furthermore, this method is most helpful when treatment effect sizes are small. Neuroimaging biomarkers are a powerful and increasingly common suite of tools that are, in many cases, highly predictive of disease outcomes. Here, we explore the possibility of using MRI-based radiomic biomarkers for the purpose of improving statistical power in clinical trials in the contexts of brain cancer and prodromal Alzheimer's disease. These methods can be applied to a broad range of neurologic diseases using a broad range of predictors of outcome to make clinical trials more efficient.Entities:
Keywords: clinical trials; machine learning; neuroimaging
Year: 2021 PMID: 34806001 PMCID: PMC8600962 DOI: 10.1093/braincomms/fcab264
Source DB: PubMed Journal: Brain Commun ISSN: 2632-1297
Figure 1Method visualization and description. (A) Workflow for implementing the proposed method in a new clinical trial. B (continuous) and C (survival outcome): Schematic diagram for individualized predictions that are generated for each person in the current trial, where the solid red lines indicate observed outcome for the participants of the current trial and the dashed blue lines indicate predicted outcome for those participants had they not been treated. (B) illustrates the method for continuous outcomes, where the left side represents the outcomes of those randomized to the control arm and the right side represents the outcomes of treated participants. The predicted outcome values (dashed blue lines) for the control units had they not been treated would be exactly what they are observed to be (solid red lines), while the predicted outcome values (dashed blue lines) for the treated units had they not been treated are different from the observed outcome (solid red lines). (C) illustrates the analogous mechanism for survival outcomes, where the predicted survival times for the control units (dashed blue lines) are the same as the observed survival times (solid red lines), whereas the predicted survival for the treated individuals are lower than the observed survival times. Our method capitalizes on these differences to augment statistical power.
Figure 2Plasmode simulation results. Results from simulated studies under two scenarios. With the addition of historical controls, the minimum required sample size for 80% power is markedly lower than using classical two-sample clinical trial analysis. These figures show minimum sample size (vertical axes) required to achieve 80% power for a range of effect sizes (horizontal axes) based on observed outcome and radiomic predictions. (A) shows the results from simulations for continuous outcome measures of cognition in our Alzheimer’s cohort from ADNI, analysed using a linear regression model with and without incorporation of the radiomic predictor (left). (B) shows the results from simulations for survival in our glioblastoma cohort, comprised of 134 patients who were treated for newly diagnosed GBM at the Hospital of the University of Pennsylvania between 2006 and 2013 and analysed with an accelerated failure time model with and without incorporation of the radiomic predictor. Note that the proposed method that leverages historical controls to build radiomic predictions (red) requires lower samples sizes than the classical approach (blue). Minimum required sample size was calculated as the smallest sample size that achieved 80% power as calculated by the percentage of Monte Carlo simulations with a non-zero treatment effect that were significant at the = 0.05 level.
Minimum required sample size for different powers and effect sizes
| Cohort-outcome | Power | Effect size | Minimum sample size | |
|---|---|---|---|---|
| With historical controls | Without historical controls | |||
| ADNI-continuous | 0.8 | 0.40 | 174 | 200 |
| 0.46 | 146 | 170 | ||
| 0.61 | 74 | 88 | ||
| 0.9 | 0.40 | 228 | 263 | |
| 0.46 | 172 | 204 | ||
| 0.61 | 97 | 112 | ||
| ADNI-survival | 0.8 | 1.7 | 242 | 287 |
| 1.8 | 206 | 254 | ||
| 1.9 | 180 | 219 | ||
| 0.9 | 1.7 | — | — | |
| 1.8 | 271 | 332 | ||
| 1.9 | 238 | 292 | ||
| GBM-survival | 0.8 | 1.8 | 56 | 98 |
| 1.9 | 44 | 82 | ||
| 2 | 38 | 70 | ||
| 0.9 | 1.8 | 74 | 130 | |
| 1.9 | 60 | 108 | ||
| 2 | 50 | 90 | ||
In this table, we provide the minimum required sample size for both 80% and 90% power across a range of effect sizes in both our ADNI cohort and our cohort of patients with glioblastoma multiforme (GBM). For our ADNI dataset, because the radiomic predictor of interest is known to be an accurate predictor of MCI to Alzheimer’s disease conversion time, we also explored the utility of incorporating this method into a survival analysis.
Figure 3Synthetic data simulation results. Results from simulated studies with synthetic data generated to be homogenous across all cohorts except for random error and treatment status. These figures show the minimum sample size required to achieve 80% power for a range of effect sizes, where minimum sample size was calculated as the smallest sample size for which at least 80% of Monte Carlo simulations with a non-zero treatment effect were significant at the = 0.05 level. (A) shows the results for simulations with a continuous outcome, analysed using linear regression with and without incorporation of the radiomic predictor, and (B) shows the results for simulations with a survival outcome, analysed using an accelerated failure time model with and without incorporation of the radiomic predictor. In both cases, the proposed method that leverages historical controls in the form of radiomic predictions (red) requires lower sample sizes than the classical approach (blue).