| Literature DB >> 33988129 |
Qiongxiu Zhou1, Xiaohan You2, Haiyan Dong3, Zhe Lin1, Yanling Shi1, Zhen Su1, Rongrong Shao1, Chaosheng Chen1, Ji Zhang1.
Abstract
Premature all-cause mortality is high in patients receiving peritoneal dialysis (PD). The accurate and early prediction of mortality is critical and difficult. Three prediction models, the logistic regression (LR) model, artificial neural network (ANN) classic model and a new structured ANN model (ANN mixed model), were constructed and evaluated using a receiver operating characteristic (ROC) curve analysis. The permutation feature importance was used to interpret the important features in the ANN models. Eight hundred fifty-nine patients were enrolled in the study. The LR model performed slightly better than the other two ANN models on the test dataset; however, in the total dataset, the ANN models fit much better. The ANN mixed model showed the best prediction performance, with area under the ROC curves (AUROCs) of 0.8 and 0.79 for the 6-month and 12-month datasets. Our study showed that age, diastolic blood pressure (DBP), and low-density lipoprotein cholesterol (LDL-c) levels were common risk factors for premature mortality in patients receiving PD. Our ANN mixed model had incomparable advantages in fitting the overall data characteristics, and age is a steady risk factor for premature mortality in patients undergoing PD. Otherwise, DBP and LDL-c levels should receive more attention for all-cause mortality during follow-up.Entities:
Keywords: age; all-cause mortality; artificial neural networks; peritoneal dialysis; risk factors
Mesh:
Year: 2021 PMID: 33988129 PMCID: PMC8202888 DOI: 10.18632/aging.203033
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Baseline characteristics of the included patients with CAPD.
| Case (n) | 777 | 82 | |
| Age (years, median [IQR]) | 48.0 [38.0, 58.0] | 63.0 [54.0, 70.0] | <0.001 |
| Male (n, %) | 432 (55.6) | 51 (62.2) | 0.3 |
| SBP (mmHg, median [IQR]) | 145.0 [132.0, 159.0] | 148.0 [133.2, 164.0] | 0.2 |
| DBP (mmHg, median [IQR]) | 88.0 [78.0, 97.0] | 79.0 [71.0, 91.5] | <0.001 |
| Tg (mmol/L, median [IQR]) | 1.6 [1.2, 2.1] | 1.6 [1.2, 2.1] | 0.9 |
| Tc (mmol/L, median [IQR]) | 4.6 [3.9, 5.4] | 4.8 [4.1, 5.5] | 0.3 |
| LDL-c (mmol/L, median [IQR]) | 2.5 [2.0, 3.1] | 2.8 [2.2, 3.4] | 0.01 |
| HDL-c (mmol/L, median [IQR]) | 1.0 [0.9, 1.2] | 1.0 [0.8, 1.2] | 0.05 |
| Serum albumin (g/L, median [IQR]) | 37.1 [33.5, 40.4] | 35.1 [32.0, 39.0] | 0.004 |
| Hemoglobin (g/L, median [IQR]) | 95.0 [82.0, 108.0] | 93.0 [81.2, 104.0] | 0.2 |
| BUN (mmol/L, median [IQR]) | 19.1 [13.9, 24.6] | 18.8 [12.9, 24.8] | 0.9 |
| SCR (μmol/L, median [IQR]) | 646.0 [326.0, 888.0] | 526.5 [306.2, 773.5] | 0.05 |
| Serum calcium (mmol/L, median [IQR]) | 2.2 [2.0, 2.3] | 2.1 [2.0, 2.2] | 0.3 |
| Serum phosphorus (mmol/L, median [IQR]) | 1.6 [1.3, 1.8] | 1.5 [1.3, 1.8] | 0.3 |
| iPTH (pg/mL, median [IQR]) | 212.6 [108.4, 371.9] | 181.6 [82.0, 309.4] | 0.07 |
| Kt/V (median [IQR]) | 1.9 [1.7, 2.2] | 1.8 [1.6, 2.1] | 0.1 |
| Diabetes (n, %) | 176 (22.7) | 37 (45.1) | <0.001 |
| Hypertension (n, %) | 619 (94.8) | 78 (98.7) | 0.2 |
| Chronic heart disease (n, %) | 178 (22.9) | 38 (46.3) | <0.001 |
| Malignancy (n, %) | 53 (6.8) | 10 (12.2) | 0.1 |
| Follow-up time (month, median [IQR]) | 38.0 [16.0, 66.0] | 40.5 [18.2, 59.8] | 0.9 |
SBP: systolic blood pressure; DBP: diastolic blood pressure; Tg: triglycerides; Tc: total cholesterol; LDL-c: low-density lipoprotein cholesterol; HDL-c: high-density lipoprotein cholesterol; BUN: urea nitrogen; SCR: serum creatinine; iPTH: intact parathyroid hormone.
Multivariable logistic regression models for the three full datasets.
| Age | 0.071 (0.013) | <0.001* | 0.065 (0.013) | <0.001* | 0.068 (0.013) | <0.001* |
| CHD | 0.578 (0.272) | 0.03* | 0.479 (0.276) | 0.08 | 0.462 (0.274) | 0.09 |
| DBP | -0.014 (0.012) | 0.2 | -0.028 (0.015) | 0.07 | -0.026 (0.016) | 0.1 |
| Diabetes | 0.355 (0.294) | 0.2 | 0.051 (0.305) | 0.9 | -0.023 (0.315) | 0.9 |
| Malignancy | 0.342 (0.416) | 0.4 | 0.272 (0.416) | 0.5 | 0.308 (0.411) | 0.5 |
| Albumin | -0.041 (0.029) | 0.2 | -0.1 (0.033) | 0.003* | -0.117 (0.034) | 0.001* |
| BUN | 0 (0.024) | 1 | 0.006 (0.028) | 0.8 | -0.015 (0.031) | 0.6 |
| Ca | 0.854 (0.682) | 0.2 | 1.469 (0.876) | 0.09 | 1.379 (0.87) | 0.1 |
| SCR | -0.001 (0.001) | 0.2 | -0.001 (0.001) | 0.2 | 0 (0.001) | 0.4 |
| Hb | -0.014 (0.008) | 0.09 | -0.015 (0.009) | 0.09 | -0.013 (0.009) | 0.2 |
| HDL-c | -0.457 (0.562) | 0.4 | -0.162 (0.632) | 0.8 | 0.345 (0.264) | 0.2 |
| LDL-c | 0.514 (0.309) | 0.1 | 0.87 (0.412) | 0.04* | 0.791 (0.36) | 0.03* |
| P | 0.478 (0.423) | 0.3 | 0.343 (0.49) | 0.5 | 0.488 (0.51) | 0.3 |
| iPTH | 0.054 (0.158) | 0.7 | 0.061 (0.172) | 0.7 | 0.12 (0.178) | 0.5 |
| Tc | -0.134 (0.272) | 0.6 | -0.532 (0.364) | 0.1 | -0.554 (0.289) | 0.06 |
| Tg | -0.177 (0.192) | 0.4 | 0.2 (0.216) | 0.4 | 0.282 (0.173) | 0.1 |
| SBP | 0.012 (0.007) | 0.1 | 0.014 (0.009) | 0.1 | 0.015 (0.009) | 0.1 |
| Sex | 0.293 (0.291) | 0.3 | 0.421 (0.318) | 0.2 | 0.56 (0.321) | 0.08 |
| Kt/V | -0.259 (0.313) | 0.4 | 0 (0.376) | 1 | 0.124 (0.395) | 0.8 |
Model 0: 0-month datasets; Model 1: 6-month datasets; Model 2: 12-month datasets; iPTH: intact parathyroid hormone; SBP: systolic blood pressure; DBP: diastolic blood pressure; MAP: mean arterial pressure; BMI: body mass index; RAAS: renin–angiotensin–aldosterone system agents; CCBs: calcium channel blockers.
Figure 1ROC curves of selected models for predicting the primary outcome in different datasets. The dark solid lines indicate the median curve of the three types of models (ANN mixed model, ANN classic model, and logistic model). (A) Performance of selected models in the test dataset, (B) Performance of selected models in the total dataset, (C) Performance of selected models in the 6-month dataset, (D) Performance of selected models in the 12-month dataset.
Figure 2Post hoc test of performance. (A) Performance of the models for the negative prediction in the test dataset; (B) performance of the models for the positive prediction in the test dataset; (C) performance of the models for the negative prediction in the total dataset; and (D) performance of the models for the positive prediction in the total dataset. The short bar indicates the difference in the mean value with a 95% confidence interval.
Figure 3Distribution of the performance outcomes of the models for the 6-month and 12-month datasets. (A) Performance of the models for the negative prediction in the 6-month dataset, (B) Performance of the models for the positive prediction in the 6-month dataset, (C) Performance of the models for the negative prediction in the 12-month dataset, (D) Performance of the models for the positive prediction in the 12-month dataset.
Performance of the models in the follow-up datasets.
| Accuracy | 0.89 (0.07) | 0.85 (0.03) | 0.80 (0.02) | <0.001 | 0.89 (0.07) | 0.85 (0.03) | 0.80 (0.02) | <0.001 |
| F1 score | 0.93 (0.08) | 0.92 (0.02) | 0.89 (0.01) | <0.001 | 0.43 (0.07) | 0.05 (0.04) | 0.09 (0.02) | <0.001 |
| Precision | 0.93 (0.08) | 0.90 (0.00) | 0.90 (0.00) | <0.001 | 0.44 (0.09) | 0.06 (0.04) | 0.08 (0.02) | <0.001 |
| Recall | 0.93 (0.08) | 0.93 (0.03) | 0.87 (0.02) | <0.001 | 0.44 (0.10) | 0.05 (0.04) | 0.11 (0.03) | <0.001 |
| Accuracy | 0.88 (0.07) | 0.85 (0.03) | 0.80 (0.02) | <0.001 | 0.88 (0.07) | 0.85 (0.03) | 0.80 (0.02) | <0.001 |
| F1 score | 0.93 (0.08) | 0.92 (0.02) | 0.89 (0.01) | <0.001 | 0.39 (0.07) | 0.05 (0.03) | 0.09 (0.03) | <0.001 |
| Precision | 0.93 (0.08) | 0.90 (0.00) | 0.90 (0.00) | <0.001 | 0.40 (0.07) | 0.07 (0.04) | 0.08 (0.02) | <0.001 |
| Recall | 0.93 (0.08) | 0.93 (0.03) | 0.88 (0.02) | <0.001 | 0.39 (0.10) | 0.05 (0.04) | 0.10 (0.04) | <0.001 |
Values are presented as the means (SDs).
Figure 4Permutation feature importance for the ANN models in the total dataset (0-month), 6-month dataset and 12-month dataset. Higher positive values indicated greater importance of the model, and negative values may indicate that the feature is worse than noise.