| Literature DB >> 30730899 |
Katya L Masconi1,2, Tandi E Matsha3, Rajiv T Erasmus1, Andre P Kengne2,4.
Abstract
BACKGROUND: Prediction model updating methods are aimed at improving the prediction performance of a model in a new setting. This study sought to critically assess the impact of updating techniques when applying existent prevalent diabetes prediction models to a population different to the one in which they were developed, evaluating the performance in the mixed-ancestry population of South Africa.Entities:
Mesh:
Year: 2019 PMID: 30730899 PMCID: PMC6366743 DOI: 10.1371/journal.pone.0211528
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographic characteristics of the Bellville South cohort, by the training and test datasets.
| Variables | Overall (1083) | Training dataset | Test dataset | P-value |
|---|---|---|---|---|
| Prevalent undiagnosed diabetes (Yes, %) | 162 (15.0) | 118 (15.6) | 44 (13.5) | 0.354 |
| Sex (Male, %) | 251 (23.2) | 169 (22.3) | 82 (25.2) | 0.174 |
| Age (mean years, SD) | 51.9 (14.9) | 52.0 (14.9) | 51.6 (15.1) | 0.691 |
| Body mass index (mean kg/m2, SD) | 29.7 (7.0) | 29.6 (6.9) | 30.0 (7.4) | 0.503 |
| Waist circumference (mean cm, SD) | 95.8 (15.3) | 95.8 (15.2) | 95.9 (15.6) | 0.919 |
| Systolic blood pressure (mean mmHg, SD) | 124.3 (20.0) | 123.6 (20.2) | 126.1 (19.4) | 0.056 |
| Diastolic blood pressure (mean mmHg, SD) | 76.0 (12.7) | 75.4 (12.8) | 77.3 (12.4) | 0.026 |
| Use of hypertensive medication (Yes, %) | 374 (34.5) | 268 (35.4) | 106 (32.6) | 0.142 |
| Hypertensive status (Yes, %) | 817 (75.4) | 570 (75.2) | 247 (76.0) | 0.755 |
| Use of lipid-lowering medication (Yes, %) | 40 (3.7) | 30 (4.0) | 10 (3.1) | 0.579 |
| Use of corticosteroids (Yes, %) | 12 (1.1) | 5 (0.7) | 7 (2.2) | 0.062 |
| Mother having diabetes (Yes, %) | 124 (11.5) | 87 (11.5) | 37 (11.4) | 0.999 |
| Father having diabetes (Yes, %) | 61 (5.6) | 38 (5.0) | 23 (7.1) | 0.195 |
| Sister having diabetes (Yes, %) | 103 (9.5) | 71 (9.4) | 32 (9.9) | 0.873 |
| Brother having diabetes (Yes, %) | 67 (6.2) | 49 (6.5) | 18 (5.5) | 0.631 |
| Alcohol use (Current, %) | 272 (25.1) | 186 (24.5) | 86 (26.5) | 0.384 |
| Smoking status (Current, %) | 433 (40.0) | 304 (40.1) | 129 (39.7) | 0.847 |
| Education (High School, %) | 131 (12.1) | 94 (12.4) | 37 (11.4) | 0.663 |
SD, standard deviation
Estimated parameters of the updating methods 1–3.
| Method 1 | Method 2 | Method 3 | ||
|---|---|---|---|---|
| Cambridge Diabetes Risk model | Correction factor (1) / Calibration intercept (2–3) | -0.883 | - 1.617 | - |
| Calibration slope used for linear predictor correction | - | 0.263 | - | |
| Kuwaiti Risk model | Correction factor (1) / Calibration intercept (2–3) | 0.304 | -1.008 | - |
| Calibration slope used for linear predictor correction | - | 0.342 | - | |
| Omani Diabetes Risk model | Correction factor (1) / Calibration intercept (2–3) | -0.383 | -1.264 | -0.837 |
| Calibration slope used for linear predictor correction | - | 0.402 | 0.950 | |
| Deviation from recalibration regression coefficient: WC ≥ 94cm in men and ≥ 80cm in women | - | - | 0.890 | |
| Parent or sibling history of diabetes | - | - | -0.253 | |
| BMI ≥ 30 kg/m2 | - | - | 0.115 | |
| Rotterdam Predictive model | Correction factor (1) / Calibration intercept (2–3) | 0.593 | -0.595 | 0.391 |
| Calibration slope used for linear predictor correction | - | 0.541 | 1.134 | |
| Deviation from recalibration regression coefficient: Male gender | - | - | -0.783 | |
| Simplified Finnish Diabetes Risk model | Correction factor (1) / Calibration intercept (2–3) | 1.212 | -0.639 | -0.256 |
| Calibration slope used for linear predictor correction | - | 0.388 | 0.874 | |
| Deviation from recalibration regression coefficient: 45 years ≤ age ≤ 54 years | - | - | -0.390 | |
| Prescribed antihypertensive medication | - | - | 0.330 |
Method 1: correction factor updated intercept; Method 2: both the intercept and the regression coefficients of the variables using the intercept and calibration slope from Method 1; Method 3: Extra adjustment of predictors with a different effect in the updating set compared to the derivation set, after recalibration by Method 2
Intercept and regression coefficients of the updated models per existing model updated.
| Method 0 | Method 4 | Method 5 | Method 0 | Method 4 | Method 5 | ||||
|---|---|---|---|---|---|---|---|---|---|
| Cambridge | Intercept | -6.322 | -3.419 | -0.905 | Rotterdam | Intercept | -3.020 | -2.653 | -1.435 |
| Diabetes | Female gender | -0.879 | -0.540 | - | Predictive | Age per 5 years: 55 years to >75 | 0.190 | 0.236 | - |
| Risk model | Use of antihypertensive medication | 1.222 | 0.587 | - | model | Male gender | 0.460 | 0.184 | - |
| Prescribed steroids | 2.191 | -0.657 | - | Use of antihypertensive medication | 0.420 | 0.664 | - | ||
| Age | 0.063 | 0.030 | - | BMI ≥ 30 kg/m2 | 0.510 | 0.767 | - | ||
| 25 kg/m2 ≤ BMI ≥ 27.49 kg/m2 | 0.699 | 0.291 | - | Linear predictor | - | - | 0.857 | ||
| 27.5 kg/m2 ≤ BMI ≤ 29.99 kg/m2 | 1.970 | 0.002 | - | Parent and sibling has diabetes | - | - | 1.136 | ||
| BMI ≥ 30 kg/m2 | 2.518 | 0.695 | - | WC ≥ 94cm (M) and ≥ 80cm (W) | - | - | 1.290 | ||
| Parent or sibling has diabetes | 0.728 | -0.142 | - | Simplified | Intercept | -5.514 | -4.080 | -1.627 | |
| Parent and sibling has diabetes | 0.753 | 0.989 | - | Finnish | 45 years ≤ age ≤ 54 years | 0.628 | 0.108 | - | |
| Ex-smoker | -0.218 | -11.312 | - | Diabetes | 55 years ≤ age ≤ 64 years | 0.892 | 0.692 | - | |
| Current smoker | 0.855 | -0.054 | - | Risk model | 25 kg/m2 ≤ BMI < 30 kg/m2 | 0.165 | 2.569 | - | |
| Linear predictor | - | - | 0.865 | BMI > 30 kg/m2 | 1.096 | 3.075 | - | ||
| WC ≥ 94cm (M) and ≥ 80cm (W) | - | - | 0.789 | 94cm ≤ WC < 102cm in men | 0.857 | -0.337 | - | ||
| Kuwaiti | Intercept | -5.018 | -3.576 | -1.204 | WC ≥ 102cm (M) and ≥ 88cm (W) | 1.350 | -1.053 | - | |
| Risk model | Sibling history of diabetes | 0.979 | 0.680 | - | Use of antihypertensive medication | 0.711 | 1.077 | - | |
| Use of antihypertensive medication | 0.978 | 0.755 | - | History of high blood glucose | - | - | - | ||
| Age ≥ 35 years | 1.315 | 1.029 | - | Linear predictor | - | - | 0.885 | ||
| Waist circumference > 100 cm | 1.930 | 0.992 | - | SBP | - | - | 0.011 | ||
| Linear predictor | - | - | 0.955 | Parent and sibling has diabetes | - | - | 0.956 | ||
| SBP | - | - | 0.009 | ||||||
| Omani | Intercept | -4.700 | -4.716 | - | Developed | Intercept | -5.136 | - | - |
| Diabetes | 40 years ≤ age ≤ 59 years | 1.800 | 0.941 | - | model | Age in years | 0.028 | - | - |
| Risk model | Age ≥ 60 years | 2.300 | 1.544 | - | Parent and sibling has diabetes | 1.058 | - | - | |
| WC ≥ 94cm (M) and ≥ 80cm (W) | 0.380 | 0.088 | - | WC ≥ 94cm (M) and ≥ 80cm (W) | 1.154 | - | - | ||
| 25 kg/m2 ≤ BMI < 30 kg/m2 | 0.540 | 1.880 | - | Use of antihypertensive medication | 0.516 | - | - | ||
| BMI ≥ 30 kg/m2 | 0.690 | 1.967 | - | SBP | 0.004 | - | - | ||
| Parental or sibling history of diabetes | 1.900 | 0.447 | - | Use of lipid lowering medication | -0.146 | - | - | ||
| SBP≥140 and/or DBP≥90 | 0.730 | 0.529 | - | BMI > 30 kg/m2 | 0.281 | - | - |
Method 0: original risk model; Method 1: correction factor updated intercept; Method 2: both the intercept and the regression coefficients of the variables using the intercept and calibration slope from Method 1; Method 3: Extra adjustment of predictors with a different effect in the updating set compared to the derivation set, after recalibration by Method 2; Method 4: complete re-estimation of the intercept and the regression coefficients, fitting the variables from the original models in the validation dataset; Developed model: stepwise regression with shrinkage to develop to new model.
P-values
* <0.05
** <0.01
*** <0.001.
† assumed to be 0 for all participants due to the nature of this study.
Performance of the prevalent diabetes risk prediction models across updating methods 0–3.
| Models | Reference method | Intercept adjustment (Method 1) | Logistic calibration (Method 2) | Revision (Method 3) | ||||
|---|---|---|---|---|---|---|---|---|
| Train | Test | Train | Test | Train | Test | |||
| Cambridge | E/O (95% CI) | 0.48 (0.41; 0.56) | 0.79 (0.66; 0.95) | 0.67 (0.50–0.90) | 1.00 (0.84; 1.20) | 0.85 (0.63–1.14) | - | - |
| Diabetes | Brier score | 0.181 | 0.139 | 0.147 | 0.122 | 0.118 | - | - |
| Risk | Yates slope | 0.174 | 0.153 | 0.100 | 0.079 | 0.049 | - | - |
| Model | C-statistic (95% CI) | 0.69 (0.65–0.73) | 0.72 (0.67–0.76) | 0.62 (0.53–0.71) | 0.72 (0.67–0.76) | 0.62 (0.53–0.71) | - | - |
| Kuwaiti Risk | E/O (95% CI) | 1.27 (1.09; 1.48) | 1.05 (0.88; 1.26) | 0.97 (0.72–1.30) | 1.00 (0.84; 1.20) | 0.91 (0.67–1.22) | - | - |
| model | Brier score | 0.122 | 0.123 | 0.123 | 0.120 | 0.117 | - | - |
| Yates slope | 0.097 | 0.127 | 0.078 | 0.093 | 0.055 | - | - | |
| C-statistic (95% CI) | 0.70 (0.66–0.74) | 0.73 (0.68–0.78) | 0.62 (0.53–0.71) | 0.73 (0.68–0.78) | 0.62 (0.53–0.71) | - | - | |
| Omani | E/O (95% CI) | 0.70 (0.60; 0.82) | 0.93 (0.78; 1.12) | 0.80 (0.59–1.07) | 1.00 (0.84; 1.20) | 0.86 (0.64–1.16) | 1.00 (0.84; 1.20) | 1.01 (0.75–1.36) |
| Diabetes | Brier score | 0.142 | 0.135 | 0.127 | 0.125 | 0.114 | 0.123 | 0.114 |
| Risk model | Yates slope | 0.110 | 0.093 | 0.092 | 0.052 | 0.049 | 0.063 | 0.036 |
| C-statistic (95% CI) | 0.67 (0.63–0.71) | 0.68 (0.63–0.73) | 0.65 (0.57–0.73) | 0.68 (0.63–0.73) | 0.65 (0.57–0.73) | 0.70 (0.66–0.70) | 0.63 (0.55–0.72) | |
| Rotterdam | E/O (95% CI) | 1.62 (1.38; 1.88) | 1.01 (0.85; 1.22) | 0.88 (0.65–1.18) | 1.00 (0.84; 1.20) | 0.86 (0.64–1.16) | 1.00 (0.84; 1.20) | 1.03 (0.76–1.38) |
| Predictive | Brier score | 0.126 | 0.125 | 0.118 | 0.125 | 0.119 | 0.122 | 0.114 |
| model | Yates slope | 0.024 | 0.043 | 0.017 | 0.055 | 0.022 | 0.071 | 0.020 |
| C-statistic (95% CI) | 0.66 (0.61–0.70) | 0.68 (0.63–0.74) | 0.58 (0.49–0.67) | 0.68 (0.63–0.74) | 0.58 (0.49–0.67) | 0.69 (0.64–0.75) | 0.62 (0.53–0.72) | |
| Simplified | E/O (95% CI) | 2.92 (2.51; 3.41) | 1.09 (0.91; 1.31) | 1.00 (0.74–1.34) | 1.00 (0.84; 1.20) | 0.90 (0.67–1.21) | 1.00 (0.84; 1.20) | 0.97 (0.72–1.31) |
| Finnish | Brier score | 0.133 | 0.128 | 0.119 | 0.125 | 0.116 | 0.124 | 0.115 |
| Diabetes | Yates slope | 0.026 | 0.069 | 0.050 | 0.051 | 0.035 | 0.061 | 0.023 |
| Risk model | C-statistic (95% CI) | 0.66 (0.62–0.70) | 0.68 (0.64–0.73) | 0.61 (0.52–0.70) | 0.68 (0.64–0.73) | 0.61 (0.52–0.70) | 0.79 (0.66–0.75) | 0.59 (0.50–0.68) |
Performance of the prevalent diabetes risk prediction models across updating methods 4 and 5.
| Models | Re-estimation (Method 4) | Addition of new variables (Method 5) | |||
|---|---|---|---|---|---|
| Train | Test | Train | Test | ||
| Cambridge | E/O (95% CI) | 1.00 (0.84; 1.20) | 0.89 (0.67–1.20) | 1.00 (0.84; 1.20) | 1.00 (0.75–1.35) |
| Diabetes | Brier score | 0.127 | 0.123 | 0.121 | 0.114 |
| Risk | Yates slope | 0.070 | 0.030 | 0.081 | 0.029 |
| Model | C-statistic (95% CI) | 0.66 (0.61–0.72) | 0.56 (0.45–0.66) | 0.72 (0.68–0.77) | 0.63 (0.54–0.72) |
| Kuwaiti Risk | E/O (95% CI) | 1.00 (0.84; 1.20) | 0.91 (0.67–1.22) | 1.00 (0.84; 1.20) | 0.98 (0.73–1.31) |
| model | Brier score | 0.119 | 0.116 | 0.118 | 0.114 |
| Yates slope | 0.090 | 0.054 | 0.098 | 0.031 | |
| C-statistic (95% CI) | 0.73 (0.68–0.73) | 0.62 (0.53–0.70) | 0.74 (0.69–0.74) | 0.61 (0.52–0.70) | |
| Omani | E/O (95% CI) | 1.00 (0.84; 1.20) | 0.88 (0.66–1.19) | - | - |
| Diabetes | Brier score | 0.126 | 0.119 | - | - |
| Risk model | Yates slope | 0.070 | 0.045 | - | - |
| C-statistic (95% CI) | 0.69 (0.64–0.74) | 0.61 (0.52–0.70) | - | - | |
| Rotterdam | E/O (95% CI) | 1.00 (0.84; 1.20) | 0.87 (0.65–1.17) | 1.00 (0.84; 1.20) | 0.99 (0.74–1.33) |
| Predictive | Brier score | 0.122 | 0.118 | 0.120 | 0.115 |
| model | Yates slope | 0.067 | 0.033 | 0.087 | 0.034 |
| C-statistic (95% CI) | 0.70 (0.65–0.70) | 0.60 (0.51–0.69) | 0.73 (0.68–0.78) | 0.62 (0.53–0.72) | |
| Simplified | E/O (95% CI) | 1.00 (0.84; 1.20) | 0.88 (0.66–1.18) | 1.00 (0.84; 1.20) | 0.95 (0.71–1.28) |
| Finnish | Brier score | 0.132 | 0.125 | 0.122 | 0.114 |
| Diabetes | Yates slope | 0.058 | 0.036 | 0.071 | 0.033 |
| Risk model | C-statistic (95% CI) | 0.64 (0.59–0.69) | 0.57 (0.48–0.67) | 0.71 (0.66–0.76) | 0.63 (0.54–0.72) |
| Developed | E/O (95% CI) | 1.00 (0.84; 1.20) | 0.89 (0.67; 1.20) | ||
| Model | Brier score | 0.118 | 0.123 | ||
| Yates slope | 0.094 | 0.031 | |||
| C-statistic (95% CI) | 0.74 (0.70–0.79) | 0.56 (0.45–0.66) | |||
Fig 1Calibration curves for the risk prediction models across the updating methods.