| Literature DB >> 29736434 |
Ming Zhu1, Yijian Huang2.
Abstract
In many medical research studies, survival time is typically the primary outcome of interest. The Cox proportional hazards model is the most popular method to investigate the relationship between covariates and possibly right-censored survival time. However, in many clinical trials, the true covariates may not always be accurately measured due to natural biological fluctuation or instrument error. It is well know that for regression analysis in general, naively using mismeasured covariates in conventional inference procedures may incur substantial estimation bias. In the presence of covariate measurement error, several functional modeling methods have been proposed under the situation where the distribution of the measurement error is known. Among them are parametric corrected score and conditional score. Although both methods are consistent, each suffers from severe problem of multiple roots or absence of appropriate root when the measurement error is substantial. The problem persists even when the sample size is practically large. We conduct a detailed investigation on the pathological behaviors of parametric corrected score and propose an approach of incorporating additional estimating functions to remedy these pathological behaviors. The estimation and inference are then accomplished by means of quadratic inference function. Extensive simulation studies are conducted to evaluate the performance of proposed method.Entities:
Keywords: Cox proportional hazards model; Estimating equation; Functional modeling; Measurement error; Survival analysis
Year: 2015 PMID: 29736434 PMCID: PMC5935832 DOI: 10.1016/j.conctc.2015.08.001
Source DB: PubMed Journal: Contemp Clin Trials Commun ISSN: 2451-8654
Fig. 1Observed root patterns of the parametric corrected score η(b). The true β is −1 and these two corrected curves correspond to the same profile score (with true covariates). Portion of a corrected curve is thickened to indicate negative derivative.
Prevalence (%) of single-root pattern for the parametric corrected score with E(X) = 0, Var(X) = 1, β = −1, and ε ∼ Normal(0,1).
| Censoring rate | Distribution of X | Size | |||
|---|---|---|---|---|---|
| 100 | 200 | 400 | 800 | ||
| 20% | Normal | 68.0 | 52.1 | 38.0 | 20.3 |
| Modified chi-square | 65.8 | 57.9 | 52.3 | 40.1 | |
| Uniform | 64.5 | 51.9 | 37.6 | 19.6 | |
| 40% | Normal | 62.4 | 49.0 | 36.3 | 17.8 |
| Modified chi-square | 64.4 | 58.0 | 50.9 | 43.1 | |
| Uniform | 60.4 | 49.5 | 37.1 | 19.3 | |
| 60% | Normal | 61.0 | 46.5 | 32.1 | 16.0 |
| Modified chi-square | 62.7 | 57.3 | 50.9 | 42.6 | |
| Uniform | 58.5 | 47.9 | 36.8 | 21.2 | |
Fig. 2Observed root patterns of the conditional score estimating function. The true β is −1.
Prevalence (%) of root-finding failure for the conditional score with E(X) = 0, Var(X) = 1, β = −1, and ε∼ Normal(0,1).
| Censoring rate | Distribution of X | Size | |||
|---|---|---|---|---|---|
| 100 | 200 | 400 | 800 | ||
| 20% | Normal | 2.7 | 1.5 | .6 | .3 |
| Modified chi-square | 14.4 | 12.2 | 15.1 | 13.9 | |
| Uniform | 3.1 | 1.6 | .6 | .1 | |
| 40% | Normal | 4.6 | 2.3 | .8 | .2 |
| Modified chi-square | 20.0 | 16.7 | 20.9 | 18.7 | |
| Uniform | 3.1 | 2.1 | .3 | .5 | |
| 60% | Normal | 4.7 | 2.5 | .7 | .3 |
| Modified chi-square | 24.6 | 21.7 | 25.7 | 25.0 | |
| Uniform | 5.7 | 2.7 | 1.4 | 1.0 | |
Fig. 3Parametric corrected score and additional estimation function based on Equation (7) for a single-covariate model with β = −1.
Fig. 4Quadratic inference functions for the two datasets in Fig. 1.
Simulation summary statistics for the single-covariate models with 20% censoring rate: Ideal, naive(NV), regression calibration (RC), re-defined conditional score (ConS), re-defined parametric corrected score (CS), and first k augmented parametric corrected score (ACS: k), k = 2, 3, 4.
| Size | Ideal | NV | RC | ConS | CS | ACS: 2 | ACS: 3 | ACS: 4 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B | SD | B | SD | B | SD | F | B | SD | F | B | SD | B | SD | B | SD | B | SD | |
| Normal covariate | ||||||||||||||||||
| 100 | −15 | 154 | 593 | 99 | 148 | 254 | 2.7 | −376 | 920 | 68.0 | −11 | 287 | 46 | 431 | −43 | 414 | −103 | 381 |
| 200 | −8 | 104 | 602 | 65 | 189 | 163 | 1.5 | −251 | 719 | 52.1 | −89 | 273 | −60 | 330 | −130 | 348 | −172 | 351 |
| 400 | −5 | 73 | 601 | 48 | 195 | 113 | .6 | −202 | 575 | 38.0 | −146 | 282 | −99 | 287 | −153 | 313 | −184 | 327 |
| 800 | 1 | 49 | 604 | 32 | 206 | 76 | .3 | −99 | 351 | 20.0 | −129 | 267 | −87 | 246 | −83 | 226 | −123 | 256 |
| 1600 | 1 | 36 | 605 | 23 | 208 | 53 | 0 | −43 | 200 | 5.6 | −75 | 206 | −45 | 163 | −60 | 178 | −62 | 166 |
| 100 | −25 | 204 | 639 | 91 | 242 | 247 | 14.4 | −485 | 1221 | 65.8 | 145 | 241 | 110 | 387 | 41 | 409 | 9 | 409 |
| 200 | −5 | 136 | 640 | 63 | 267 | 155 | 12.2 | −396 | 1022 | 57.9 | 47 | 241 | −10 | 351 | −81 | 381 | −128 | 381 |
| 400 | −6 | 98 | 642 | 43 | 280 | 101 | 15.1 | −329 | 895 | 52.3 | −21 | 237 | −10 | 285 | −76 | 306 | −122 | 323 |
| 800 | −3 | 70 | 641 | 32 | 281 | 72 | 13.9 | −249 | 740 | 40.1 | −70 | 245 | −34 | 241 | −81 | 237 | −95 | 241 |
| 1600 | 0 | 47 | 644 | 21 | 289 | 47 | 12.5 | −139 | 486 | 30.3 | −82 | 242 | −9 | 152 | −29 | 133 | −42 | 139 |
Note: F: root-finding failure (%); B: mean bias (×103); SD: standard deviation(×103).
Simulation summary statistics for the single-covariate models with 60% censoring rate: Ideal, naive(NV), regression calibration (RC), re-defined conditional score (ConS), re-defined parametric corrected score (CS), and first k augmented parametric corrected score (ACS: k), k = 2, 3, 4.
| Size | Ideal | NV | RC | ConS | CS | ACS: 2 | ACS: 3 | ACS: 4 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B | SD | B | SD | B | SD | F | B | SD | F | B | SD | B | SD | B | SD | B | SD | |
| Normal covariate | ||||||||||||||||||
| 100 | −24 | 206 | 550 | 134 | 57 | 335 | 4.7 | −429 | 1014 | 61.0 | −27 | 301 | 94 | 478 | −24 | 502 | −90 | 551 |
| 200 | −15 | 142 | 562 | 88 | 109 | 206 | 2.5 | −272 | 738 | 46.5 | −101 | 303 | 10 | 298 | −99 | 347 | −188 | 395 |
| 400 | −8 | 97 | 563 | 63 | 119 | 142 | .7 | −197 | 578 | 32.1 | −139 | 305 | −27 | 235 | −96 | 257 | −152 | 294 |
| 800 | −2 | 65 | 567 | 42 | 132 | 96 | .3 | −100 | 379 | 16.0 | −113 | 271 | −33 | 189 | −72 | 199 | −103 | 223 |
| 1600 | 1 | 46 | 567 | 30 | 133 | 69 | 0 | −51 | 238 | 4.9 | −68 | 205 | −17 | 128 | −38 | 125 | −57 | 148 |
| 100 | −66 | 351 | 687 | 124 | 351 | 288 | 24.6 | −259 | 1154 | 62.7 | 247 | 297 | 196 | 459 | 219 | 531 | 260 | 568 |
| 200 | −24 | 239 | 691 | 84 | 370 | 190 | 21.7 | −291 | 1042 | 57.3 | 133 | 287 | 81 | 359 | 44 | 429 | 54 | 496 |
| 400 | −16 | 159 | 693 | 57 | 382 | 124 | 25.7 | −202 | 841 | 50.9 | 57 | 268 | 37 | 322 | −22 | 368 | −52 | 423 |
| 800 | −9 | 112 | 693 | 40 | 385 | 86 | 25.0 | −172 | 684 | 42.6 | −8 | 263 | 1 | 280 | −55 | 276 | −94 | 306 |
| 1600 | 0 | 78 | 697 | 28 | 394 | 60 | 21.5 | −77 | 412 | 34.0 | −29 | 258 | −3 | 232 | −29 | 194 | −57 | 207 |
Note: Same as in that of Table 3.
Fig. 5Quantile–quantile plots for β in the single-covariate models with 20% censoring rate, where β = −1. Red, yellow, green, blue, and black correspond to sample sizes 100, 200, 400, 800, and 1600.
Fig. 6Quantile–quantile plots for β in the single-covariate models with 60% censoring rate, where β = −1. Red, yellow, green, blue, and black correspond to sample sizes 100, 200, 400, 800, and 1600.
Simulation summary statistics for the double-covariate models with 20% censoring rate: Ideal, naive(NV), regression calibration (RC), re-defined conditional score (ConS), re-defined parametric corrected score (CS), and augmented parametric corrected score (ACS: 1/3, ACS: 2/3, ACS: 2).
| Size | Ideal | NV | RC | ConS | CS | ACS: 1/3 | ACS: 2/3 | ACS: 2 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B | SD | B | SD | B | SD | F | B | SD | F | B | SD | B | SD | B | SD | B | SD | ||
| 100 | −23 | 175 | 637 | 101 | 66 | 381 | 57.5 | 215 | 348 | 72.0 | 267 | 322 | 170 | 532 | 245 | 514 | 308 | 538 | |
| 23 | 171 | −400 | 142 | −113 | 246 | −67 | 336 | −100 | 327 | −60 | 456 | −97 | 404 | −142 | 399 | ||||
| 200 | −13 | 114 | 648 | 66 | 136 | 224 | 46.9 | 155 | 315 | 62.8 | 186 | 294 | −18 | 382 | 42 | 375 | 92 | 339 | |
| 16 | 117 | −412 | 101 | −154 | 161 | −35 | 310 | −47 | 286 | 54 | 349 | 18 | 327 | −20 | 276 | ||||
| 400 | −5 | 82 | 649 | 48 | 168 | 144 | 33.4 | 65 | 297 | 46.2 | 71 | 280 | −89 | 301 | −56 | 277 | −28 | 255 | |
| 4 | 80 | −416 | 68 | −177 | 103 | 5 | 257 | 15 | 243 | 85 | 269 | 60 | 233 | 40 | 210 | ||||
| 800 | −2 | 55 | 651 | 33 | 181 | 93 | 20.0 | 22 | 224 | 30.5 | −3 | 252 | −99 | 282 | −80 | 262 | −56 | 220 | |
| 1 | 56 | −420 | 52 | −185 | 74 | 16 | 186 | 44 | 203 | 88 | 255 | 72 | 230 | 52 | 186 | ||||
| 1600 | −2 | 40 | 652 | 23 | 185 | 66 | 8.0 | −13 | 178 | 10.5 | −43 | 207 | −71 | 219 | −67 | 204 | −61 | 190 | |
| 1 | 39 | −419 | 34 | −186 | 49 | 21 | 134 | 45 | 155 | 59 | 180 | 57 | 163 | 51 | 145 | ||||
F: root-finding failure (%); B: mean bias (×103); SD: standard deviation(×103).
ACS: 1/3 and ACS: 2/3 correspond to the additional estimating functions containing the (1,1) element and the first row elements of matrix (7), respectively. ACS: 2 is the second order augmented parametric corrected score.
Simulation summary statistics for the double-covariate models with 60% censoring rate: Ideal, naive(NV), regression calibration (RC), re-defined conditional score (ConS), re-defined parametric corrected score (CS), and augmented parametric corrected score (ACS: 1/3, ACS: 2/3, ACS: 2).
| Size | Ideal | NV | RC | ConS | CS | ACS: 1/3 | ACS: 2/3 | ACS: 2 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B | SD | B | SD | B | SD | F | B | SD | F | B | SD | B | SD | B | SD | B | SD | ||
| 100 | −32 | 242 | 610 | 140 | −13 | 513 | 43.3 | 96 | 428 | 64.8 | 239 | 320 | 225 | 668 | 326 | 666 | 392 | 660 | |
| 33 | 245 | −352 | 206 | −40 | 336 | −27 | 408 | −72 | 362 | −50 | 536 | −104 | 500 | 147 | 465 | ||||
| 200 | −20 | 160 | 617 | 95 | 59 | 286 | 36.1 | 29 | 393 | 57.8 | 145 | 300 | 14 | 387 | 93 | 360 | 140 | 344 | |
| 21 | 161 | −362 | 138 | −81 | 209 | 24 | 344 | −8 | 301 | 48 | 369 | −7 | 310 | −41 | 264 | ||||
| 400 | −6 | 109 | 619 | 65 | 97 | 182 | 21.5 | −2 | 293 | 40.4 | 45 | 279 | −65 | 321 | −9 | 273 | 30 | 241 | |
| 6 | 111 | −366 | 97 | −106 | 137 | 31 | 225 | 36 | 254 | 79 | 301 | 35 | 246 | 6 | 197 | ||||
| 800 | −1 | 72 | 621 | 45 | 112 | 120 | 10.8 | −23 | 249 | 25.7 | −16 | 257 | −73 | 267 | −32 | 213 | −9 | 192 | |
| 2 | 76 | −369 | 69 | −115 | 94 | 34 | 198 | 55 | 214 | 72 | 236 | 40 | 189 | 23 | 160 | ||||
| 1600 | −2 | 53 | 622 | 33 | 114 | 87 | 7.6 | −24 | 186 | 11.3 | −46 | 212 | −55 | 193 | −40 | 171 | −29 | 162 | |
| 0 | 53 | −371 | 46 | −117 | 65 | 28 | 142 | 47 | 161 | 46 | 162 | 36 | 140 | 26 | 124 | ||||
Note: Same as in that of Table 5.
Coverage of 95% confidence interval for the second order augmented parametric corrected score with 20% censoring rate. C (chi-square distribution), BC (bootstrap calibration) and W (Wald-type) indicate the type of confidence interval.
| Size | 100 | 200 | 400 | 800 | 1600 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C | BC | W | C | BC | W | C | BC | W | C | BC | W | C | BC | W | |
| Single-covariate: normal covariate | |||||||||||||||
| 90.2 | 96.8 | 87.0 | 91.7 | 96.6 | 91.1 | 87.1 | 96.6 | 93.8 | 87.6 | 97.1 | 94.8 | 91.8 | 95.6 | 94.9 | |
| Single-covariate: | |||||||||||||||
| 82.0 | 92.9 | 76.8 | 88.8 | 92.2 | 84.3 | 87.6 | 94.3 | 85.6 | 90.0 | 94.6 | 89.2 | 91.6 | 94.8 | 91.8 | |
| Double-covariate | |||||||||||||||
| 75.8 | 94.5 | 78.4 | 86.3 | 95.4 | 87.3 | 88.3 | 96.3 | 94.0 | 89.7 | 96.7 | 95.8 | 89.4 | 96.2 | 97.2 | |
| 79.2 | 93.0 | 83.9 | 86.8 | 93.8 | 92.3 | 88.1 | 92.3 | 95.6 | 86.7 | 93.5 | 96.2 | 89.4 | 95.5 | 97.6 | |
Coverage of 95% confidence interval for the second order augmented parametric corrected score with 60% censoring rate. C (chi-square distribution), BC (bootstrap calibration) and W (Wald-type) indicate the type of confidence interval.
| Size | 100 | 200 | 400 | 800 | 1600 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C | BC | W | C | BC | W | C | BC | W | C | BC | W | C | BC | W | |
| Single-covariate: normal covariate | |||||||||||||||
| 89.5 | 97.6 | 86.8 | 94.2 | 96.7 | 92.0 | 91.5 | 96.8 | 94.7 | 93.6 | 96.2 | 96.3 | 98.5 | 95.5 | 97.5 | |
| Single-covariate: | |||||||||||||||
| 87.5 | 93.7 | 73.9 | 89.7 | 92.4 | 79.1 | 88.4 | 93.2 | 80.4 | 90.4 | 93.9 | 85.8 | 91.8 | 94.3 | 88.6 | |
| Double-covariate | |||||||||||||||
| 92.0 | 92.6 | 80.7 | 93.8 | 96.6 | 89.6 | 92.6 | 94.9 | 94.1 | 93.2 | 96.9 | 95.8 | 92.6 | 96.2 | 98.0 | |
| 92.0 | 93.9 | 87.3 | 93.5 | 94.5 | 93.7 | 92.3 | 92.7 | 96.0 | 91.5 | 93.4 | 96.3 | 92.7 | 95.8 | 97.1 | |
Comparison of regression coefficient estimators in the ACTG 175 data.
| log(CD4) | ZDV + ddl | ZDV + ddC | ddl | |||||
|---|---|---|---|---|---|---|---|---|
| Est | Var | Est | Var | Est | Var | Est | Var | |
| NV | −1.838 | .1183 | −.652 | .0881 | −.895 | .1006 | −.598 | .0802 |
| ConS | −2.172 | .1698 | −.659 | .0916 | −.892 | .1024 | −.604 | .0835 |
| CS | −2.177 | .1678 | −.659 | .0900 | −.892 | .1012 | −.604 | .0819 |
| ACS | −2.177 | .1422 | −.657 | .0991 | −.876 | .1028 | −.596 | .0827 |
| .1678 | .0905 | .0964 | .0811 | |||||
Note: For proposed estimator, the first row of variance estimator is obtained by inverting hypothese testing statistics with bootstrap critical value; the second row of values is from sandwich variance estimator. Est: Estimated coefficient; Var: Variance.