| Literature DB >> 21283698 |
Edward J Mills1, Isabella Ghement, Christopher O'Regan, Kristian Thorlund.
Abstract
BACKGROUND: Indirect comparisons are becoming increasingly popular for evaluating medical treatments that have not been compared head-to-head in randomized clinical trials (RCTs). While indirect methods have grown in popularity and acceptance, little is known about the fragility of confidence interval estimations and hypothesis testing relying on this method.Entities:
Mesh:
Year: 2011 PMID: 21283698 PMCID: PMC3025012 DOI: 10.1371/journal.pone.0016237
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Combination of values for three of the parameters included in the simulation study, namely , and , along with corresponding values of and .
|
|
|
|
|
|
| 1.4 | 1.4 | 10% | 13% | 13% |
| 1.4 | 1.4 | 30% | 37% | 37% |
| 1.2 | 1.4 | 10% | 12% | 13% |
| 1.2 | 1.4 | 30% | 34% | 38% |
| 0.65 | 0.75 | 40% | 30% | 33% |
Percentage of simulations producing indirect estimates of exceeding a given threshold corresponding to the simulation settings where .
|
|
| |||||||
| Threshold for judging over-estimation of |
|
| ||||||
|
|
|
|
|
|
|
|
| |
| 5 | 1 | 1.40 | 36.98 | 35.98 | 38.34 | 28.66 | 32.70 | 36.72 |
| 1.52 | 31.00 | 30.86 | 34.04 | 21.04 | 25.96 | 31.34 | ||
| 1.75 | 21.90 | 22.08 | 27.00 | 11.44 | 16.44 | 23.14 | ||
| 2.05 | 14.38 | 14.56 | 19.98 | 6.04 | 9.36 | 15.40 | ||
| 10 | 1 | 1.40 | 35.30 | 36.56 | 37.58 | 27.92 | 31.58 | 34.88 |
| 1.52 | 29.58 | 30.30 | 32.90 | 19.84 | 24.22 | 29.88 | ||
| 1.75 | 21.54 | 22.66 | 26.22 | 10.58 | 14.74 | 22.20 | ||
| 2.05 | 14.52 | 15.14 | 18.80 | 5.72 | 7.96 | 15.32 | ||
| 25 | 1 | 1.40 | 34.38 | 36.22 | 37.92 | 26.06 | 30.86 | 36.18 |
| 1.52 | 27.72 | 30.44 | 33.18 | 18.18 | 23.94 | 30.84 | ||
| 1.75 | 19.26 | 21.70 | 25.28 | 10.60 | 14.40 | 21.78 | ||
| 2.05 | 12.56 | 14.46 | 18.10 | 5.40 | 7.58 | 13.96 | ||
| 100 | 1 | 1.40 | 34.12 | 35.16 | 37.18 | 26.02 | 31.24 | 36.70 |
| 1.52 | 27.90 | 29.62 | 32.50 | 19.22 | 23.82 | 31.06 | ||
| 1.75 | 19.94 | 21.12 | 25.12 | 10.16 | 14.30 | 22.76 | ||
| 2.05 | 12.90 | 13.74 | 18.1 | 5.18 | 8.04 | 14.42 | ||
| 5 | 5 | 1.40 | 24.68 | 26.18 | 30.20 | 15.30 | 19.74 | 28.20 |
| 1.52 | 16.34 | 18.08 | 23.24 | 7.16 | 11.26 | 20.52 | ||
| 1.75 | 6.26 | 8.54 | 13.92 | 1.44 | 3.58 | 10.32 | ||
| 2.05 | 2.12 | 3.04 | 7.14 | 0.18 | 0.68 | 4.34 | ||
| 10 | 5 | 1.40 | 22.00 | 22.88 | 28.78 | 11.36 | 16.56 | 24.98 |
| 1.52 | 12.38 | 14.54 | 20.66 | 4.22 | 8.22 | 16.86 | ||
| 1.75 | 4.24 | 5.66 | 10.28 | 0.54 | 2.16 | 6.78 | ||
| 2.05 | 0.88 | 1.18 | 3.84 | 0.00 | 0.26 | 1.90 | ||
| 25 | 5 | 1.40 | 18.52 | 19.56 | 25.16 | 13.46 | 13.46 | 22.88 |
| 1.52 | 9.58 | 11.24 | 17.46 | 5.78 | 5.78 | 14.28 | ||
| 1.75 | 2.34 | 3.50 | 7.98 | 0.84 | 0.84 | 4.96 | ||
| 2.05 | 0.30 | 0.62 | 2.8 | 0.08 | 0.08 | 1.40 | ||
| 100 | 5 | 1.40 | 17.16 | 18.40 | 23.58 | 6.62 | 8.76 | 20.66 |
| 1.52 | 8.90 | 9.84 | 14.78 | 1.80 | 2.56 | 11.92 | ||
| 1.75 | 2.22 | 2.36 | 5.92 | 0.12 | 0.06 | 4.20 | ||
| 2.05 | 0.34 | 0.36 | 1.62 | 0.00 | 0.06 | 0.66 | ||
Four different thresholds were considered for each simulation setting: 1.40, 1.52, 1.75 and 2.05. These thresholds were chosen to represent an approximate increase of 20%, 30%, 50% and 75% in the value of . Reported percentages quantify the degree to which Bucher's method over-estimates. (Note: The true average event rate in group A was either 10% or 30%).
Percentage of simulations producing indirect estimates of exceeding a given threshold corresponding to the simulation settings where (or, equivalently, ).
|
| |||||
| Threshold for judging over-estimation of |
| ||||
|
|
|
|
|
| |
| 5 | 1 | 1.38 | 28.48 | 30.98 | 37.4 |
| 1.49 | 21.12 | 24.92 | 32.04 | ||
| 1.72 | 11.54 | 15.68 | 23.78 | ||
| 2.01 | 5.54 | 8.78 | 15.92 | ||
| 10 | 1 | 1.38 | 26.46 | 31.56 | 34.72 |
| 1.49 | 19.24 | 25.12 | 29.64 | ||
| 1.72 | 10.26 | 14.52 | 21.86 | ||
| 2.01 | 5.44 | 7.46 | 14.38 | ||
| 25 | 1 | 1.38 | 27.26 | 30.00 | 35.78 |
| 1.49 | 20.12 | 23.58 | 30.52 | ||
| 1.72 | 9.98 | 13.66 | 22.20 | ||
| 2.01 | 5.36 | 7.10 | 14.34 | ||
| 100 | 1 | 1.38 | 26.12 | 30.00 | 35.76 |
| 1.49 | 19.46 | 23.84 | 30.18 | ||
| 1.72 | 9.52 | 14.00 | 21.18 | ||
| 2.01 | 4.92 | 8.22 | 13.46 | ||
| 5 | 5 | 1.38 | 15.3 | 20.72 | 28.64 |
| 1.49 | 7.88 | 12.50 | 21.06 | ||
| 1.72 | 1.24 | 3.24 | 11.18 | ||
| 2.01 | 0.10 | 0.66 | 4.64 | ||
| 10 | 5 | 1.38 | 11.46 | 17.42 | 25.04 |
| 1.49 | 4.62 | 9.28 | 17.46 | ||
| 1.72 | 0.44 | 2.20 | 7.70 | ||
| 2.01 | 0.04 | 0.32 | 2.16 | ||
| 25 | 5 | 1.38 | 8.68 | 14.64 | 22.76 |
| 1.49 | 2.58 | 7.04 | 14.44 | ||
| 1.72 | 0.10 | 1.18 | 4.82 | ||
| 2.01 | 0.00 | 0.06 | 1.04 | ||
| 100 | 5 | 1.38 | 16.84 | 12.14 | 21.16 |
| 1.49 | 2.26 | 5.02 | 12.94 | ||
| 1.72 | 0.02 | 0.84 | 4.42 | ||
| 2.01 | 0.00 | 0.04 | 0.88 | ||
Four different thresholds were considered for each simulation setting: 1.38, 1.49, 1.72 and 2.01. These thresholds were chosen to represent an approximate increase of 20%, 30%, 50% and 75% in the value of . Reported percentages quantify the degree to which Bucher's method over-estimates. (Note: The true average event rate in group A was 40%).
Coverage of the 95% confidence interval estimation method of Bucher for .
|
|
|
|
| ||||
|
|
| ||||||
|
|
|
|
|
|
| ||
| 5 | 1 | 96.02 | 94.28 | 87.90 | 95.44 | 89.94 | 81.10 |
| 10 | 1 | 96.06 | 94.00 | 87.78 | 95.68 | 90.38 | 79.52 |
| 25 | 1 | 96.20 | 93.70 | 86.98 | 95.42 | 89.70 | 77.04 |
| 100 | 1 | 96.18 | 93.40 | 85.84 | 95.12 | 88.88 | 76.24 |
| 5 | 5 | 95.24 | 93.20 | 91.70 | 95.02 | 92.86 | 91.22 |
| 10 | 5 | 96.00 | 95.22 | 92.54 | 95.90 | 93.96 | 92.26 |
| 25 | 5 | 96.72 | 95.22 | 92.50 | 96.90 | 93.28 | 91.32 |
| 100 | 5 | 96.68 | 94.80 | 92.50 | 96.88 | 93.44 | 90.44 |
For each simulation setting, coverage was assessed by tracking the percentage of simulations producing confidence intervals for that captured the true value of . For settings where , the true value of was . (Note: The true average event rate in group A was either 10% or 30%).
Coverage of the 95% confidence interval estimation method of Bucher for .
|
|
|
|
| ||||
|
|
| ||||||
|
|
|
|
|
|
| ||
| 5 | 1 | 95.86 | 92.44 | 88.12 | 95.34 | 90.74 | 81.80 |
| 10 | 1 | 96.20 | 93.82 | 86.80 | 95.42 | 90.14 | 78.44 |
| 25 | 1 | 96.68 | 93.26 | 86.12 | 94.92 | 90.32 | 77.20 |
| 100 | 1 | 95.80 | 92.84 | 86.06 | 95.50 | 88.58 | 74.16 |
| 5 | 5 | 95.08 | 93.74 | 91.74 | 95.08 | 92.58 | 91.36 |
| 10 | 5 | 96.28 | 95.04 | 92.70 | 96.00 | 93.76 | 92.12 |
| 25 | 5 | 96.80 | 94.60 | 92.24 | 96.06 | 93.62 | 90.66 |
| 100 | 5 | 97.04 | 95.28 | 90.88 | 97.30 | 93.00 | 89.50 |
For each simulation setting, coverage was assessed by tracking the percentage of simulations producing confidence intervals for that captured the true value of . For settings where , the true value of was . (Note: The true average event rate in group A was either 10% or 30%).
Coverage of the 95% confidence interval estimation method of Bucher for .
|
|
|
| ||
|
| ||||
|
|
|
| ||
| 5 | 1 | 94.94 | 89.12 | 80.34 |
| 10 | 1 | 94.70 | 89.12 | 78.58 |
| 25 | 1 | 95.72 | 89.14 | 75.82 |
| 100 | 1 | 95.30 | 87.78 | 75.66 |
| 5 | 5 | 95.28 | 93.00 | 91.10 |
| 10 | 5 | 95.92 | 93.22 | 91.64 |
| 25 | 5 | 96.70 | 93.02 | 90.72 |
| 100 | 5 | 96.72 | 93.00 | 89.70 |
For each simulation setting, coverage was assessed by tracking the percentage of simulations producing confidence intervals for that captured the true value of . For settings where , the true value of was 1.15. (Note: The true average event rate for group A was 40%).
Type I error associated with the test of the hypotheses versus .
|
|
| ||||||
|
|
| ||||||
|
|
|
|
|
|
|
|
|
| 5 | 1 | 3.98 | 5.72 | 12.10 | 4.56 | 10.06 | 18.90 |
| 10 | 1 | 3.94 | 6.00 | 12.22 | 4.32 | 9.62 | 20.48 |
| 25 | 1 | 3.80 | 6.30 | 13.02 | 4.58 | 10.30 | 22.96 |
| 100 | 1 | 3.82 | 6.60 | 14.16 | 4.88 | 11.12 | 23.76 |
| 5 | 5 | 4.76 | 6.80 | 8.30 | 4.98 | 7.14 | 8.78 |
| 10 | 5 | 4.00 | 7.78 | 7.46 | 4.10 | 6.04 | 7.74 |
| 25 | 5 | 3.28 | 4.78 | 7.50 | 3.10 | 6.72 | 8.68 |
| 100 | 5 | 3.32 | 5.20 | 7.50 | 3.12 | 6.06 | 9.56 |
For each simulation setting where (or, equivalently, ), Type I error was assessed by tracking the percentage of simulations that produced 95% confidence intervals that excluded the value . (Note: The true average event rate in group A was either 10% or 30%).
Power associated with the test of the hypotheses versus .
|
|
| ||||||
|
|
| ||||||
|
|
|
|
|
|
|
|
|
| 5 | 1 | 6.06 | 7.56 | 13.04 | 7.06 | 12.16 | 19.60 |
| 10 | 1 | 5.60 | 7.58 | 13.70 | 8.18 | 12.70 | 22.38 |
| 25 | 1 | 4.88 | 8.12 | 14.94 | 8.50 | 13.46 | 24.62 |
| 100 | 1 | 5.60 | 8.18 | 15.54 | 7.94 | 14.42 | 27.14 |
| 5 | 5 | 8.38 | 9.54 | 9.76 | 13.04 | 12.32 | 11.20 |
| 10 | 5 | 8.76 | 9.04 | 10.22 | 14.08 | 12.94 | 11.12 |
| 25 | 5 | 9.38 | 9.82 | 11.54 | 15.58 | 15.36 | 14.64 |
| 100 | 5 | 10.42 | 10.76 | 12.98 | 16.60 | 17.74 | 14.84 |
For each simulation setting where (or, equivalently, ), power was assessed by tracking the percentage of simulations that produced 95% confidence intervals for that excluded the value . (Note: The true average event rate in group A was either 10% or 30%).
Power associated with the test of the hypotheses versus .
|
| ||||
|
| ||||
|
|
|
|
|
|
| 5 | 1 | 7.80 | 12.96 | 21.16 |
| 10 | 1 | 7.42 | 13.00 | 22.92 |
| 25 | 1 | 8.08 | 13.06 | 24.76 |
| 100 | 1 | 8.36 | 15.02 | 26.18 |
| 5 | 5 | 12.42 | 12.12 | 11.40 |
| 10 | 5 | 13.14 | 12.98 | 12.40 |
| 25 | 5 | 14.18 | 14.86 | 13.68 |
| 100 | 5 | 16.84 | 15.96 | 15.04 |
For each simulation setting where (or, equivalently, ), power was assessed by tracking the percentage of simulations that produced 95% confidence intervals for that excluded the value . (Note: The true average event rate in group A was 40%).