| Literature DB >> 25473827 |
Yong Ma1, Yinglei Lai2, John M Lachin1.
Abstract
Isotonic regression is a useful tool to investigate the relationship between a quantitative covariate and a time-to-event outcome. The resulting non-parametric model is a monotonic step function of a covariate X and the steps can be viewed as change points in the underlying hazard function. However, when there are too many steps, over-fitting can occur and further reduction is desirable. We propose a reduced isotonic regression approach to allow combination of small neighboring steps that are not statistically significantly different. In this approach, a second stage, the reduction stage, is integrated into the usual monotonic step building algorithm by comparing the adjacent steps using appropriate statistical testing. This is achieved through a modified dynamic programming algorithm. We implemented the approach with the simple exponential distribution and then its extension, the Weibull distribution. Simulation studies are used to investigate the properties of the resulting isotonic functions. We apply this methodology to the Diabetes Control and Complication Trial (DCCT) data set to identify potential change points in the association between HbA1c and the risk of severe hypoglycemia.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25473827 PMCID: PMC4256386 DOI: 10.1371/journal.pone.0113948
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Underlying true model and time-to-event data.
(A) Weibull scale parameter is an isotonic function of . (B) Event or censored time follows the Weibull distribution with a fixed shape parameter () and scale parameter shown in (A). The green open circles represent event times and the blue open diamonds represent censored times.
Sample data following Weibull distributions with and as a step function of .
| True Parameters | |||||||
| Steps | 1 | 2 | 3 | 4 | |||
|
| 0 | 1/3 | 2/3 | 1 | 4/3 | 5/3 | 2 |
|
| 94 | 159 | 166 | 170 | 160 | 184 | 67 |
|
| 1 | 1 | 1 | 2 | 2 | 4 | 5 |
|
| |||||||
|
| 1.09 | 1.04 | 0.95 | 2.02 | 1.91 | 3.87 | 5.31 |
|
| 0.07 | 0.05 | 0.05 | 0.09 | 0.08 | 0.15 | 0.33 |
: covariate with 7 distinct values; : number of observations at each value; : step function of with 4 distinct values; : estimated at each value before the implementation of the reduced isotonic regression algorithm; : standard error of the .
Figure 2Simulation results from 1000 repetitions.
(A) Regular isotonic regression without testing between steps. (B) Reduced isotonic regression with nominal The dark green lines represent the underlying true model.
Number of steps identified with various event numbers and percent censored.
| Event N | Sample N | % Censored mean(range) | Number of Steps | ||||
| 2 | 3 | 4 | 5 | 6 | |||
| ∼500 | 1000 | 50.5(37.4–60.1) | 0 | 86 | 785 | 126 | 3 |
| 800 | 37.9(26.4–47.4) | 0 | 126 | 745 | 122 | 7 | |
| 500 | 0 | 0 | 223 | 680 | 96 | 1 | |
| ∼200 | 400 | 50.7(34.3–62.5) | 0 | 376 | 527 | 95 | 2 |
| 320 | 37.5(24.1–49.1) | 0 | 440 | 485 | 72 | 3 | |
| 200 | 0 | 0 | 509 | 429 | 59 | 3 | |
| ∼100 | 200 | 50.8(30.5–65.5) | 0 | 550 | 384 | 62 | 4 |
| 160 | 37.4(23.1–53.1) | 0 | 577 | 372 | 49 | 2 | |
| 100 | 0 | 0 | 621 | 326 | 51 | 2 | |
| ∼50 | 100 | 50.5(31.0–70.0) | 20 | 694 | 266 | 20 | 0 |
| 80 | 37.4(20.0–60.0) | 8 | 699 | 268 | 25 | 0 | |
| 50 | 0 | 19 | 737 | 229 | 18 | 0 | |
The combination of “Sample N” and “%censore” is used to yield the targeted number of events in the “Event N” column, repeated 1000 times.
Precision of parameter estimates with various event numbers and percent censored.
| Event N | Sample N |
| x = | 0 | 1/3 | 2/3 | 1 | 4/3 | 5/3 | 2 |
|
| 1 | 1 | 1 | 2 | 2 | 4 | 5 | |||
| mean and mean squared error for the estimates of | ||||||||||
| ∼500 | 1000 | 2.01 | 0.99 | 1.00 | 1.01 | 1.99 | 2.01 | 4.02 | 5.00 | |
| 0.0051 | 0.0043 | 0.0027 | 0.0030 | 0.0061 | 0.0065 | 0.040 | 0.13 | |||
| 800 | 2.01 | 0.99 | 1.00 | 1.01 | 1.99 | 2.01 | 4.02 | 4.99 | ||
| 0.0052 | 0.0037 | 0.0022 | 0.0025 | 0.0072 | 0.0073 | 0.044 | 0.16 | |||
| 500 | 2.01 | 0.99 | 1.00 | 1.01 | 1.99 | 2.01 | 4.06 | 4.95 | ||
| 0.0050 | 0.0021 | 0.0014 | 0.0017 | 0.0079 | 0.0085 | 0.072 | 0.22 | |||
| ∼200 | 400 | 2.01 | 0.97 | 1.00 | 1.01 | 2.00 | 2.01 | 4.10 | 4.90 | |
| 0.014 | 0.013 | 0.0081 | 0.0093 | 0.015 | 0.016 | 0.11 | 0.39 | |||
| 320 | 2.02 | 0.99 | 1.00 | 1.01 | 1.99 | 2.02 | 4.10 | 4.85 | ||
| 0.013 | 0.0078 | 0.0053 | 0.0062 | 0.016 | 0.018 | 0.12 | 0.47 | |||
| 200 | 2.03 | 0.99 | 1.00 | 1.01 | 2.00 | 2.02 | 4.14 | 4.86 | ||
| 0.015 | 0.0047 | 0.0035 | 0.0055 | 0.018 | 0.022 | 0.18 | 0.59 | |||
| ∼100 | 200 | 2.04 | 0.97 | 1.01 | 1.03 | 1.99 | 2.04 | 4.16 | 4.83 | |
| 0.030 | 0.023 | 0.016 | 0.022 | 0.039 | 0.043 | 0.22 | 0.66 | |||
| 160 | 2.05 | 0.98 | 1.01 | 1.03 | 1.99 | 2.04 | 4.20 | 4.82 | ||
| 0.030 | 0.019 | 0.012 | 0.016 | 0.034 | 0.042 | 0.27 | 0.65 | |||
| 100 | 2.06 | 0.98 | 1.00 | 1.03 | 2.01 | 2.05 | 4.18 | 4.89 | ||
| 0.032 | 0.0088 | 0.0070 | 0.014 | 0.041 | 0.055 | 0.32 | 1.28 | |||
| ∼50 | 100 | 2.07 | 0.97 | 1.00 | 1.08 | 1.96 | 2.07 | 4.17 | 4.82 | |
| 0.069 | 0.043 | 0.044 | 0.085 | 0.11 | 0.13 | 0.35 | 1.36 | |||
| 80 | 2.08 | 0.97 | 1.02 | 1.07 | 1.98 | 2.10 | 4.23 | 4.90 | ||
| 0.072 | 0.032 | 0.027 | 0.057 | 0.11 | 0.14 | 0.53 | 1.65 | |||
| 50 | 2.14 | 0.99 | 1.02 | 1.07 | 1.98 | 2.13 | 4.24 | 4.99 | ||
| 0.091 | 0.021 | 0.020 | 0.050 | 0.11 | 0.24 | 0.79 | 2.60 | |||
For the mean and mean squared error columns, the first row is the mean and the second row is the mean squared error. Percent and range of censoring is the same as shown in Table 2.
Figure 3Modeling HbA1c and risk of severe hypoglycemia.
(A) Regular isotonic regression without testing between steps. (B) Reduced isotonic regression with nominal (C) Cox-Snell residual plot of Model B. Dotted lines in (A) and (B) represent 95% Confidence Intervals of .