| Literature DB >> 35321465 |
Xi Shi1,2, Tingyu Qu3, Gijs Van Pottelbergh4, Marjan van den Akker4,5, Bart De Moor1.
Abstract
Background: Prognostic models can help to identify patients at risk for end-stage kidney disease (ESKD) at an earlier stage to provide preventive medical interventions. Previous studies mostly applied the Cox proportional hazards model. The aim of this study is to present a resampling method, which can deal with imbalanced data structure for the prognostic model and help to improve predictive performance.Entities:
Keywords: chronic disease; logistic regression; machine learning; predictive performance; resampling method
Year: 2022 PMID: 35321465 PMCID: PMC8935060 DOI: 10.3389/fmed.2022.730748
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Baseline characteristics of patients with CKD over 50-years old in Intego (2005–2015) by future progression or no progression to ESKD.
|
|
|
| |
|---|---|---|---|
| No. of observation (No. of patients) | 58,529 (11,424) | 2,257 (221) | |
| Hemoglobin (g/dL) (mean ± SD) | 13.2 ± 1.68 | 11.9 ± 1.60 | <0.001 |
| Uric Acid (mg/dL) (mean ± SD) | 6.5 ± 1.81 | 7.2 ± 2.18 | <0.001 |
| Age (year) (mean ± SD) | 75.3 ± 9.66 | 75.5 ± 9.59 | 0.46 |
| Type 2 diabetes mellitus (%) | 14.3 | 13.0 | 0.09 |
| Hypercholesterolaemia (%) | 10.4 | 6.0 | <0.001 |
| Use of antihypertensives drugs (%) | 3.9 | 14.9 | <0.001 |
| Use of diuretics (%) | 30.1 | 35.7 | <0.001 |
| Use of beta blocking agents (%) | 41.6 | 41.4 | 0.88 |
| Use of calcium channel blockers (%) | 17.9 | 31.7 | <0.001 |
| Use of agents acting on the renin-angiotensin system (%) | 41.8 | 41.9 | 0.98 |
| Use of lipid modifying agents (%) | 42.4 | 38.0 | <0.001 |
| Obesity (%) | 1.4 | 0.6 | 0.001 |
| Hypertension (%) | 15.2 | 11.9 | <0.001 |
| Malignancy (%) | 17.3 | 24.0 | <0.001 |
| Male gender (%) | 60.0 | 53.2 | <0.001 |
CKD, chronic kidney disease; ESKD, end-stage kidney disease.
Parameter estimates of the logistic regression analysis and the Cox proportional hazards model to predict the risk of future progression to ESKD.
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Hemoglobin (g/dL) | 0.501 | 0.495–0.507 | 0.587 | 0.561–0.616 | 0.640 | 0.612–0.670 |
| Uric acid (mg/dL) | 1.189 | 1.178–1.199 | 1.088 | 1.040–1.137 | 1.055 | 1.010–1.102 |
| Age (year) | 0.978 | 0.976–0.979 | 0.976 | 0.967–0.986 | 0.967 | 0.958–0.976 |
| Type 2 diabetes mellitus | 0.777 | 0.740–0.817 | 0.849 | 0.637–1.114 | 0.711 | 0.541–0.933 |
| Hypercholesterolaemia | 0.624 | 0.585–0.666 | 0.661 | 0.431–0.971 | 0.590 | 0.396–0.877 |
| Male gender | 0.550 | 0.531–0.569 | 1.341 | 1.110–1.619 | 1.211 | 1.007–1.456 |
| Antihypertensives | 3.095 | 2.884–3.323 | 2.684 | 1.963–3.618 | 2.697 | 2.005–3.628 |
| Diuretics | 0.862 | 0.830–0.897 | 1.121 | 0.907–1.381 | 1.268 | 1.034–1.555 |
| Beta blocking agents | 0.962 | 0.927–0.998 | 0.890 | 0.719–1.101 | 0.847 | 0.687–1.045 |
| Calcium channel blockers | 1.844 | 1.768–1.924 | 1.990 | 1.581–2.493 | 1.935 | 1.551–2.415 |
| Agents acting on the renin-angiotensin system | 0.885 | 0.853–0.918 | 0.429 | 0.341–0.538 | 0.443 | 0.355–0.553 |
| Lipid modifying agents | 0.761 | 0.733–0.791 | 0.865 | 0.693–1.077 | 0.722 | 0.581–0.897 |
| Obesity | 0.142 | 0.103–0.192 | 0.215 | 0.012–0.969 | 0.150 | 0.021–1.070 |
| Hypertension | 0.668 | 0.636–0.702 | 0.749 | 0.548–1.003 | 0.598 | 0.444–0.805 |
| Malignancy | 1.385 | 1.330–1.442 | 1.015 | 0.803–1.272 | 0.875 | 0.699–1.096 |
OR, odds ratio; HR, hazard ratio.
The participants are patients with chronic kidney disease (CKD) older than 50 years old with at least 2 records from 2005 to 2015. The start of the follow-up is the first observation after 2005 and the end of follow-up is the last observation.
Prediction performance of the logistic regression analysis and the Cox proportional hazards model.
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Predicted Positive | TP = 44 | FP = 1291 | TP = 2 | FP = 6 | TP = 1 | FP = 2 |
| Predicted Negative | FN = 7 | TN = 1567 | FN = 49 | TN = 2852 | FN = 50 | TN = 2856 |
| Accuracy | 0.554 | 0.981 | 0.982 | |||
| AUC | 0.687 | 0.700 | 0.693 | |||
| Recall | 0.863 | 0.039 | 0.020 | |||
| Precision | 0.033 | 0.250 | 0.333 | |||
| F3 | 0.245 | 0.043 | 0.023 | |||
TP, true positive; FP, false positive; TN, true negative; FN, false negative.
The performance of all the three models was evaluated on the original test set.
Prediction performance of the Cox proportional hazards model with different thresholds.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||
|
|
|
|
|
|
|
|
| |
| Predicted Positive | TP = 36 | FP = 705 | TP = 41 | FP = 987 | TP = 26 | FP = 773 | TP = 43 | FP = 2378 |
| Predicted Negative | FN = 15 | TN = 2153 | FN = 10 | TN = 1871 | FN = 25 | TN = 2085 | FN = 8 | TN = 480 |
| Accuracy | 0.752 | 0.657 | 0.726 | 0.180 | ||||
| Recall | 0.706 | 0.804 | 0.510 | 0.843 | ||||
| Precision | 0.049 | 0.040 | 0.033 | 0.018 | ||||
| F3 | 0.300 | 0.276 | 0.207 | 0.149 | ||||
RP, risk probability; SP, survival probability; TP, true positive; FP, false positive; TN, true negative; FN, false negative.
The performance of all the three models was evaluated on the original test set.