| Literature DB >> 31478240 |
L Nab1, R H H Groenwold1, P M J Welsing2, M van Smeden1.
Abstract
In randomised trials, continuous endpoints are often measured with some degree of error. This study explores the impact of ignoring measurement error and proposes methods to improve statistical inference in the presence of measurement error. Three main types of measurement error in continuous endpoints are considered: classical, systematic, and differential. For each measurement error type, a corrected effect estimator is proposed. The corrected estimators and several methods for confidence interval estimation are tested in a simulation study. These methods combine information about error-prone and error-free measurements of the endpoint in individuals not included in the trial (external calibration sample). We show that, if measurement error in continuous endpoints is ignored, the treatment effect estimator is unbiased when measurement error is classical, while Type-II error is increased at a given sample size. Conversely, the estimator can be substantially biased when measurement error is systematic or differential. In those cases, bias can largely be prevented and inferences improved upon using information from an external calibration sample, of which the required sample size increases as the strength of the association between the error-prone and error-free endpoint decreases. Measurement error correction using already a small (external) calibration sample is shown to improve inferences and should be considered in trials with error-prone endpoints. Implementation of the proposed correction methods is accommodated by a new software package for R.Entities:
Keywords: bias; clinical trials; continuous endpoints; correction methods; measurement error
Mesh:
Substances:
Year: 2019 PMID: 31478240 PMCID: PMC6900013 DOI: 10.1002/sim.8359
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Illustration of impact of hypothetical measurement error in the example trial16: (A) No measurement error; (B) Classical measurement error; (C) Systematic measurement error; (D) Differential measurement error. The left plots depict every thousandth estimated ordinary least squares regression line (grey lines), the average estimated treatment effect (dashed line), and the true effect (black line). The right plots depict the density distribution of the Wald test‐statistic of the slope of the regression line [Colour figure can be viewed at http://wileyonlinelibrary.com]
Percentage bias, empirical standard error (EmpSE), square root of mean squared error (SqrtMSE), coverage, and average width of CIs of the naive estimator and the corrected estimator for systematic measurement error (θ 0=0 and θ 1=1.05 or θ 1=1.25) for different values of R‐squared and different sample sizes of the calibration data set. Each scenario is based on 10 000 replicates, the value of the estimand is 6.9, based on example trial 1 by Makrides et al16
|
|
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Naive | 5 | 7 | 10 | 15 | 20 | 30 | 40 | 50 | ||||
| Percentage bias (%) | 0.8 | 1.25 | 24.9 | 88.9 | 29 | 3.7 | 2 | 1.6 | 0.9 | 0.7 | 0.4 | |
| 0.8 | 1.05 | 4.9 | 88.9 | 29 | 3.7 | 2 | 1.6 | 0.9 | 0.7 | 0.4 | ||
| 0.5 | 4.9 | 55.3 | 57.5 | −2.4 | 7.6 | 5.8 | 4.3 | 3 | 2 | |||
| 0.2 | 4.9 | 168.2 | −62.6 | 98.8 | 33.4 | −142.2 | −28.3 | 23.9 | 14.6 | |||
| EmpSE | 0.8 | 1.25 | 1.8 | 524.8 | 139.1 | 3 | 1.9 | 1.7 | 1.6 | 1.5 | 1.5 | |
| 0.8 | 1.05 | 1.5 | 524.8 | 139.1 | 3 | 1.9 | 1.7 | 1.6 | 1.5 | 1.5 | ||
| 0.5 | 1.9 | 267 | 329.1 | 83.7 | 14.4 | 11 | 2.5 | 2.3 | 2.1 | |||
| 0.2 | 3 | 1131.2 | 210.8 | 723.2 | 462.2 | 1044.4 | 225.5 | 70.5 | 24.8 | |||
| SqrtMSE | 0.8 | 1.25 | 2.5 | 524.8 | 139.1 | 3.1 | 1.9 | 1.7 | 1.6 | 1.5 | 1.5 | |
| 0.8 | 1.05 | 1.5 | 524.8 | 139.1 | 3.1 | 1.9 | 1.7 | 1.6 | 1.5 | 1.5 | ||
| 0.5 | 1.9 | 267 | 329.1 | 83.7 | 14.4 | 11 | 2.5 | 2.3 | 2.1 | |||
| 0.2 | 3 | 1131.2 | 210.8 | 723.1 | 462.2 | 1044.4 | 225.5 | 70.5 | 24.8 | |||
| Coverage (%) | 0.8 | 1.25 | 83.5 | Zero‐Variance | 70.3 | 74 | 77.4 | 80.3 | 82.8 | 84.4 | 85.3 | 86.3 |
| Delta | 93.8 | 95.3 | 95.7 | 95.9 | 96 | 96 | 95.9 | 95.7 | ||||
| Fieller | ‐ | ‐ | 94.5 | 94.7 | 95 | 95.3 | 95.2 | 95 | ||||
| Bootstrap | 95.9 | 96.1 | 95.5 | 94.9 | 94.8 | 95 | 95.1 | 94.8 | ||||
| 0.8 | 1.05 | 94.6 | Zero‐Variance | 77.8 | 81.3 | 84.4 | 87.1 | 89.2 | 90.9 | 92 | 92.2 | |
| Delta | 92.1 | 93.9 | 94.3 | 94.8 | 95.1 | 95.3 | 95.4 | 95.2 | ||||
| Fieller | ‐ | ‐ | 94.5 | 94.7 | 95 | 95.3 | 95.2 | 95 | ||||
| Bootstrap | 95.9 | 96.1 | 95.5 | 94.9 | 94.8 | 95 | 95.1 | 94.8 | ||||
| 0.5 | 94.8 | Zero‐Variance | 69.1 | 73.5 | 78.1 | 81.7 | 84.5 | 87.5 | 88.7 | 89.9 | ||
| Delta | 89.7 | 92 | 92.9 | 93.9 | 94.3 | 95.2 | 95.4 | 95.3 | ||||
| Fieller | ‐ | ‐ | 94.5 | 95.2 | 95.2 | 95 | 94.8 | 94.9 | ||||
| Bootstrap | 93.9 | 95.9 | 96.3 | 95.8 | 95.4 | 94.8 | 94.8 | 94.8 | ||||
| 0.2 | 95.1 | Zero‐Variance | 57.1 | 64.5 | 71 | 76.8 | 80.3 | 84.3 | 86 | 87.6 | ||
| Delta | 86.8 | 89.7 | 90.9 | 92.2 | 93.5 | 94.4 | 94.6 | 94.9 | ||||
| Fieller | ‐ | ‐ | 89.8 | 93.2 | 94.9 | 95.8 | 95.8 | 95.7 | ||||
| Bootstrap | 88.9 | 93.8 | 95.5 | 96.4 | 96.7 | 96.8 | 96.8 | 96.1 | ||||
| Av. CI width | 0.8 | 1.25 | 6.9 | Zero‐Variance | 30333 | 1141.5 | 5.5 | 4.7 | 4.7 | 4.6 | 4.5 | 4.5 |
| Delta | 40.7 | 13.6 | 8.7 | 7.5 | 7 | 6.5 | 6.3 | 6.1 | ||||
| Fieller | ‐ | ‐ | 11.8 | 8.3 | 7 | 6.4 | 6.1 | 6 | ||||
| Bootstrap | 86.9 | 29.3 | 14.1 | 8.3 | 7.1 | 6.4 | 6.1 | 6 | ||||
| 0.8 | 1.05 | 5.8 | Zero‐Variance | 36110.7 | 1359 | 6.5 | 5.6 | 5.5 | 5.4 | 5.4 | 5.4 | |
| Delta | 35 | 12.2 | 8 | 7 | 6.7 | 6.3 | 6.1 | 6 | ||||
| Fieller | ‐ | ‐ | 11.8 | 8.3 | 7 | 6.4 | 6.1 | 6 | ||||
| Bootstrap | 86.9 | 29.3 | 14.1 | 8.3 | 7.1 | 6.4 | 6.1 | 6 | ||||
| 0.5 | 7.4 | Zero‐Variance | 7228.9 | 9759.5 | 763.1 | 37.5 | 17.8 | 7.7 | 7.3 | 7.1 | ||
| Delta | 58.1 | 43.2 | 21.2 | 12.6 | 11 | 9.3 | 8.7 | 8.4 | ||||
| Fieller | ‐ | ‐ | 67.9 | 63.2 | 25 | 12.4 | 9.8 | 9 | ||||
| Bootstrap | 146.8 | 87.4 | 65.2 | 34.7 | 22.8 | 12.4 | 9.9 | 9 | ||||
| 0.2 | 11.6 | Zero‐Variance | 126830.3 | 11677.5 | 87123.4 | 30709.4 | 324870.7 | 12430.8 | 774.6 | 126.8 | ||
| Delta | 179.3 | 102.5 | 112.7 | 69.9 | 65.7 | 34.1 | 19.7 | 16.6 | ||||
| Fieller | ‐ | ‐ | 92.6 | 95.1 | 72.1 | 82.2 | 60.6 | 59.2 | ||||
| Bootstrap | 176 | 121.9 | 126.2 | 118.7 | 107.7 | 77.6 | 54.8 | 39.7 | ||||
Monte Carlo standard errors of Bias, EmpSE, MSE, and Coverage are subsequently EmpSE , EmpSE ), , and .26
Results of the Fieller method are shown if less than 5% of cases resulted in undefined confidence intervals (see last paragraph of Section 5.2).
Coverage of the true intervention effect and average confidence interval width using regular Wald confidence intervals of the naive effect estimator.
Type‐II error using the naive effect estimator is 0.2%, 2.9%, and 31.6% for (for both θ 1=1.05 and θ 1=1.25), and , respectively. Type‐II error using the corrected effect estimator using the Zero‐Variance, Delta, and Bootstrap method was 0% in all scenarios. For the considered cases, Type‐II error of the corrected effect estimator using the Fieller method was 0.2% and 2.9% for (for both θ 1=1.05 and θ 1=1.25) and , respectively.
Figure 2Estimates of the treatment effect using the naive estimator and corrected estimator for different values of R‐squared (row grids) and different sample sizes of the external calibration set (column grids) under systematic measurement error (θ 1=1.05 (0.2; 0.5; 0.8a) or θ 1=1.25 (0.8b)). Each grid is based on every 10th estimate of a simulation of 10 000 replicates, using an estimand of 6.9 (indicated by the red line), based on the example trial 1 by Makrides et al16 [Colour figure can be viewed at http://wileyonlinelibrary.com]
Figure 3Estimates of θ 1 (ie, slope of the systematic measurement error model) for different values of R‐squared (row grids) and different sample sizes of the external calibration set (column grids). Each grid is based on every 10th estimate of a simulation of 10 000 replicates, using an estimand of 1.05 (indicated by the red line) [Colour figure can be viewed at http://wileyonlinelibrary.com]
Percentage bias, empirical standard error (EmpSE), mean squared error (MSE), square root of mean squared error (SqrtMSE), coverage, and average width of CIs of the corrected estimator for differential measurement error (θ 00=0, θ 10=1, θ 01=0, θ 11=1.05) for different values of R‐squared and different sample sizes of the calibration data set. Each scenario is based on 10 000 replicates, the value of the estimand is 6.9, based on example trial 1 by Makrides et al16
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|
| Naive | 10 | 20 | 30 | 40 | 50 | |||
| Percentage bias (%) | 0.8 | 91.8 | 5.2 | 1.2 | −0.4 | −0.2 | −0.1 | |
| 0.5 | 91.8 | −9.7 | 33 | 154.2 | −21.4 | −0.1 | ||
| 0.2 | 91.9 | −319.4 | 152.9 | 193.1 | −21.5 | 2.2 | ||
| EmpSE | 0.8 | 1.4 | 52 | 6.8 | 2.9 | 2.6 | 2.3 | |
| 0.5 | 1.8 | 949.1 | 369.1 | 1080.4 | 142.1 | 4.5 | ||
| 0.2 | 2.9 | 2658 | 8425.8 | 1569.7 | 443.7 | 92.1 | ||
| SqrtMSE | 0.8 | 6.5 | 52 | 6.8 | 2.9 | 2.6 | 2.3 | |
| 0.5 | 6.6 | 949.1 | 369.1 | 1080.4 | 142.1 | 4.5 | ||
| 0.2 | 7 | 2658 | 8425.4 | 1569.7 | 443.7 | 92.1 | ||
| Coverage (%) | 0.8 | 0.7 | Zero‐Variance | 43.8 | 59.9 | 67.9 | 72.7 | 76.8 |
| Delta | 97.1 | 96.6 | 96 | 95.7 | 95.9 | |||
| Bootstrap | 97.9 | 95.7 | 94.7 | 94.5 | 95 | |||
| 0.5 | 6.7 | Zero‐Variance | 30.3 | 43.3 | 50.2 | 55.5 | 61 | |
| Delta | 97.6 | 97.6 | 97.3 | 96.9 | 97 | |||
| Bootstrap | 98.4 | 98 | 96.6 | 95.8 | 95.5 | |||
| 0.2 | 41.1 | Zero‐Variance | 25.7 | 35 | 41.9 | 46.6 | 52.2 | |
| Delta | 98.4 | 99 | 98.9 | 98.9 | 98.9 | |||
| Bootstrap | 99 | 99.6 | 99.2 | 99 | 98.7 | |||
| Av. CI width | 0.8 | 5.7 | Zero‐Variance | 8.2 | 5.9 | 5.7 | 5.7 | 5.6 |
| Delta | 2688.7 | 18.3 | 12.1 | 10.5 | 9.5 | |||
| Bootstrap | 142.6 | 24.3 | 13.1 | 10.7 | 9.5 | |||
| 0.5 | 7.2 | Zero‐Variance | 33 | 17.9 | 30.3 | 10.6 | 7.5 | |
| Delta | 463975.1 | 49493.3 | 660587.5 | 13238 | 18.5 | |||
| Bootstrap | 303.5 | 118.8 | 58.4 | 34.2 | 24 | |||
| 0.2 | 11.4 | Zero‐Variance | 64.6 | 150.5 | 53.1 | 43.1 | 26.8 | |
| Delta | 1219162.5 | 26998502.1 | 486295.4 | 85139.8 | 3407.5 | |||
| Bootstrap | 562.9 | 353.8 | 283.3 | 221.4 | 170.2 | |||
Monte Carlo standard errors of Bias, EmpSE, MSE, and Coverage are subsequently EmpSE , EmpSE/(2 ), , and .26
Coverage of the true intervention effect and average confidence interval width using regular Wald confidence intervals of the naive effect estimator.
Type‐II error of the naive effect estimator was 0%, 0%, and 0.4% for , , and , respectively. Type‐II error using the Zero‐variance, Delta, and Bootstrap method was 0%.
Figure 4Estimates of the treatment effect using the naive estimator and corrected estimator for different values of R‐squared (row grids) and different sample sizes of the external calibration set (column grids) under differential measurement error (θ 00=0, θ 10=1, θ 01=0, θ 11=%1.05). Each grid is based on every 10th estimate of a simulation of 10 000 replicates, using an estimand of 6.9 (indicated by the red line), based on example trial 1 by Makrides et al16 [Colour figure can be viewed at http://wileyonlinelibrary.com]