| Literature DB >> 16613605 |
Rodney A Hayward1, David M Kent, Sandeep Vijan, Timothy P Hofer.
Abstract
BACKGROUND: When subgroup analyses of a positive clinical trial are unrevealing, such findings are commonly used to argue that the treatment's benefits apply to the entire study population; however, such analyses are often limited by poor statistical power. Multivariable risk-stratified analysis has been proposed as an important advance in investigating heterogeneity in treatment benefits, yet no one has conducted a systematic statistical examination of circumstances influencing the relative merits of this approach vs. conventional subgroup analysis.Entities:
Mesh:
Year: 2006 PMID: 16613605 PMCID: PMC1523355 DOI: 10.1186/1471-2288-6-18
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Panel A shows how overall treatment benefit (net RRR is a function of [treatment benefit] – [treatment harm]) varies as a function of pretreatment risk (as estimated by the control event rate [CER] of an RCT) when a treatment decreases pre-treatment risk by 50% but at a cost of 0.003 treatment-related adverse events per treatment-year. As a result, lower risk patients are harmed by treatment and higher risk patients benefit from treatment. Panel B demonstrates two statistical phenomena: (1) that statistical power can be greatly enhanced by combining risk factors (RF's) into a risk index, and (2) statistical power is greatest when the study population includes more low risk patients
Statistical power when subgroup analysis is done for each risk factor one-at-a-time
* Risk Factor is one of six independent risk factors, with the other five risk factors having a prevalence of 25% and an odds ratio of 2 when the 5-year CER for those without any risk factors is .75% (see Table 1 and Figure).
† Statistical subgroup comparisons tests the power to detect whether those with the risk factor had a greater relative benefit from treatment than those without the risk factor.
Table 3. Statistical power when risk factors are combined into a risk score
* In each instance all risk factor have a prevalence of 25% and the 5-year CER for those without any risk factors is .75% (see Table 1 and Figure).
† Statistical subgroup comparisons tests the power to detect whether the treatment's relative benefit varies as a function of the risk score. The Area Under the Receiver Operator Characteristic (AUROC) curve is a measure of the overall predictiveness of a model for predicting a dichotomous outcome (i.e, event occurred vs. event did not occur).
The importance of the accuracy of the weighting of the risk prediction tool
* Each of the 6 risk factors (RF's) has a prevalence of 25% and the 5-year CER for those without any risk factors is .75% (see Table 1 and Figure). The true predictiveness (relative risk) of the risk factor is shown as well as the relative weight used in the risk index.
† The Area Under the Receiver Operator Characteristic (AUROC) curve is a measure of the overall predictiveness of a model for predicting a dichotomous outcome (i.e, event occurred vs. event did not occur).
‡ Statistical subgroup comparisons tests the power to detect whether the treatment's relative benefit varies as a function of the risk score.
How does the degree of treatment-related risk influence the statistical power of the subgroup analysis? *
| 1 | 6% | 39% |
| 2 | 11% | 59% |
| 3 | 21% | 83% |
| 4 | 28% | 92% |
* There are 6 risk factors (RF's) that each have a prevalence of 25% and a relative risk of 2, and the 5-year CER for those without any risk factors is .75% (see Table 1 and Figure).
† Statistical subgroup comparisons tests the power to detect whether the treatment's relative benefit varies as a function of the presence of the RF (for conventional subgroup analysis) or the risk score (for the multivariable risk-stratified analysis)
Results of conventional vs. risk-stratified analyses when treatment decreases pre-treatment risk by 50% but at a cost of 3 serious adverse events per year of treatment (6 independent risk factors (RF's) exist, each with a prevalence of 25%)
| True Control Event Rate (CER) | True Relative Risk Reduction (RRR) | True Number Needed to Treat (NNT) | Statistical Power of Subgroup Comparison* | |
| N = 8,800 (% of study population) | For 5-Year Follow-up | P < 0.05 | ||
| Risk factor absent (75%) | 2.2 | -.19† | -239† | .23 |
| Risk factor present (25%] | 4.2 | .13 | 183 | |
| 0–1 Risk factors (53.4%) | 1.4 | -.57† | -125† | .72 |
| ≥ 2 Risk factors (46.6%) | 4.4 | .16 | 143 | |
| 0 Risk factors (17.8%) | 0.75 | -1.59† | -88† | .83 |
| 1 Risk factors (35.6%) | 1.5 | -.51† | -132† | |
| 2 Risk factors (29.7%) | 3.0 | -.02 | -1936 | |
| 3 Risk factors (13.2%) | 6.0 | .21 | 83 | |
| ≥ 4 Risk factors (3.7%) | 12.8 | .35 | 24 | |
* For the subgroup comparisons, the statistical comparison tests whether the subgroup with the risk factor receives more or less benefit (two-tailed testing) than the subgroup without the risk factor (testing for an interaction between the risk factor and intervention [treatment vs. control] in a logistic regression model. 21 For example, the conventional subgroup comparison had a statistical power of 23% for detecting that those with the risk factor had a greater relative benefit from treatment than those without the risk factor.
† The minus sign denotes that treatment had net harm, rather than benefit.
‡ Area Under the Receiver Operator Characteristic (AUROC) curve for the Risk Index was 0.65.
Glossary
Risk Prediction Model – Predicting overall risk based upon combining information from multiple risk factors. Prediction models can be presented as full regression prediction model (such as predicted probability of death using APACHE), as a simple risk index (such as predicting birth outcomes using a 10 point APGAR score) or condensed into risk categories (low, medium, high peri-operative risk).