| Literature DB >> 18441002 |
Navdeep Tangri1, David Ansell, David Naimark.
Abstract
BACKGROUND: Early technique failure has been a major limitation on the wider adoption of peritoneal dialysis (PD). The objectives of this study were to use data from a large, multi-centre, prospective database, the United Kingdom Renal Registry (UKRR), in order to determine the ability of an artificial neural network (ANN) model to predict early PD technique failure and to compare its performance with a logistic regression (LR)-based approach.Entities:
Mesh:
Year: 2008 PMID: 18441002 PMCID: PMC2517147 DOI: 10.1093/ndt/gfn187
Source DB: PubMed Journal: Nephrol Dial Transplant ISSN: 0931-0509 Impact factor: 5.992
Fig. 1Artificial neural network (ANN) architecture. ANNs consist of artificial neurons. Each artificial neuron has a processing node (‘body’) represented by circles in the figure as well as connections from (‘dendrites’) and connections to (‘axons’) other neurons which are represented as arrows in the figure. In a commonly used ANN architecture, the multilayer perceptron, the neurons are arranged in layers. An ordered set (a vector) of predictor variables is presented to the input layer. Each neuron of the input layer distributes its value to all of the neurons in the middle layer. Along each connection between input and middle neurons there is a connection weight so that the middle neuron receives the product of the value from the input neuron and the connection weight. Each neuron in the middle layer takes the sum of its weighted inputs and then applies a non-linear (usually logistic) function to the sum. The result of the function then becomes the output from that particular middle neuron. Each middle neuron is connected to the output neuron. Along each connection between a middle neuron and the output neuron there is a connection weight. In the final step, the output neuron takes the weighted sum of its inputs and applies the non-linear function to the weighted sum. The result of this function becomes the output for the entire ANN. More details are provided in the appendix.
Summary of the predictor variables included in the artificial neural network and logistic regression analyses for subjects who did not and who did suffer peritoneal dialysis technique failure.
| Technique survival | Technique failure | ||
|---|---|---|---|
| Variable | Mean ± SD or % (N) | Mean ± SD or % ( | |
| Observation time (days) | 617 ± 494 (1811) | 299 ± 344 (1458) | <0.001* |
| Age (years) | 60.9 ± 15.8 (1811) | 55.8 ± 16 (1458) | <0.001* |
| Female sex | 38.5 (1811) | 38 (1458) | 0.775 |
| Caucasian race | 91.1 (1383) | 90.6 (1242) | 0.641 |
| Diabetes | 22.1 (1811) | 18.2 (1458) | 0.005 |
| Glomerulopathy | 12.1 (1811) | 15.9 (1458) | 0.002 |
| Renovascular disease | 12.9 (1811) | 11.8 (1458) | 0.355 |
| PCKD | 5.6 (1811) | 7.6 (1458) | 0.021 |
| Pyelonephritis | 6.5 (1811) | 8 (1458) | 0.088 |
| Other | 12.2 (1811) | 12.1 (1458) | 0.956 |
| Unknown | 28.7 (1811) | 26.3 (1458) | 0.139 |
| Sympt. CVD | 11.5 (615) | 8.3 (504) | 0.072 |
| Angina | 24.1 (615) | 15.5 (502) | <0.001* |
| Past MI | 14.5 (615) | 11.3 (504) | 0.114 |
| Past CABG | 7.8 (614) | 5.4 (504) | 0.096 |
| Angioplasty | 4.9 (613) | 2.8 (498) | 0.069 |
| PVD | 3.1 (613) | 2.2 (501) | 0.346 |
| Leg ulcer | 4.9 (610) | 3.6 (501) | 0.272 |
| Claudication | 15.7 (613) | 9.6 (500) | 0.002 |
| Smoking | 20.5 (599) | 18.8 (490) | 0.467 |
| COPD | 7 (613) | 5.1 (505) | 0.190 |
| Diabetes | 10.3 (610) | 10.6 (502) | 0.901 |
| Malignancy | 9.5 (612) | 9.5 (504) | 0.979 |
| Liver disease | 1.6 (615) | 2.2 (502) | 0.495 |
| Weight (kg) | 69.8 ± 14.3 (497) | 71.4 ± 14.6 (438) | 0.091 |
| Height (cm) | 168 ± 10 (431) | 169 ± 10 (386) | 0.207 |
| SysBP (mmHg) | 141 ± 26 (819) | 144 ± 25 (701) | 0.051 |
| DiasBP (mmHg) | 79 ± 14.5 (819) | 82.1 ± 14.1 (701) | <0.001* |
| Calcc (mmol/L) | 2.43 ± 0.21 (1685) | 2.43 ± 0.21 (1380) | 0.827 |
| Phos (mmol/L) | 1.59 ± 0.49 (1667) | 1.61 ± 0.46 (1357) | 0.159 |
| Alb (g/L) | 32 ± 6 (1701) | 33.5 ± 5.1 (1376) | <0.001* |
| IPTH (pmol/L) | 26 ± 28.3 (909) | 25.6 ± 29.4 (789) | 0.756 |
| Creat (μmol/L) | 608 ± 211 (1737) | 664 ± 226 (1405) | <0.001* |
| Urea (mmol/L) | 18.7 ± 7.2 (1727) | 18.9 ± 6.4 (1397) | 0.459 |
| Haem (g/dL) | 11.1 ± 1.7 (1715) | 11 ± 1.7 (1377) | 0.425 |
| Ferritin (μg/L) | 323 ± 497 (1387) | 292 ± 377 (1182) | 0.082 |
| Cholesterol (mmol/L) | 5.22 ± 1.28 (787) | 5.3 ± 1.45 (629) | 0.330 |
| Bicarbonate (mmol/L) | 26.3 ± 4.2 (1534) | 26.1 ± 3.8 (1212) | 0.112 |
| HbA1c (%) | 7.48 ± 1.86 (176) | 7.51 ± 1.91 (164) | 0.893 |
| Aluminium (μmol/L) | 0.318 ± 0.634 (305) | 0.307 ± 0.47 (307) | 0.804 |
Continuous variables are shown as means ± standard deviations (SD) while categorical variables are shown as percentages (%). The number of subjects on which a mean value or percentage was calculated is given in the parentheses. Statistical significance was computed with independent t-tests for continuous variables and chi-square tests for categorical variables. The nominal significance level of 0.05 was Bonferroni-adjusted for the number of tests such that a P-value of 0.00125 or less (*) was considered to be significant. For each subject, blood pressure values, weight and laboratory values represent measurements closest to the date of peritoneal dialysis initiation while height values are the average of all available measurements.
ESRD = end-stage renal disease, CVD = cardiovascular disease, MI = myocardial infarction, CABG = coronary artery bypass grafting, PVD = peripheral vascular disease, COPD = chronic obstructive pulmonary disease, SysBP = systolic blood pressure, DiasBP = diastolic blood pressure, Calcc = calcium concentration adjusted for albumin, Phos = phosphate concentration, Alb = albumin concentration, iPTH = intact parathyroid hormone concentration, Creat = creatinine concentration, Haem = haemoglobin concentration and HbA1c = haemoglobin A1c percentage.
Fig. 2The Kaplan–Meier curve for probability of peritoneal dialysis (PD) technique failure after the initiation of PD. Survival until 31 December 2004, death, transplantation or loss to follow-up with functioning PD were considered to be censored observations.
Mean predictive performance for 20 artificial neural network (ANN) models and logistic regression models created using a bootstrap approach.
| Model | Artificial neural | Logistic |
|---|---|---|
| network | regression | |
| AUROC | 0.7602 | 0.7090* |
| Standard error | 0.0167 | 0.0208 |
| Optimal threshold | 1.4627 | 0.4016 |
| Sensitivity | 0.7043 | 0.6021 |
| Specificity | 0.6818 | 0.6856 |
| Positive predictive value | 0.6392 | 0.5469 |
| Negative predictive value | 0.7421 | 0.7320 |
| IABC | 1.3655 | 1.2411 |
*P = 0.016.
AUroc = area under the receiver operating characteristic curve (a value of 1.0 implies perfect discrimination between PD technique failure and success whereas 0.5 implies no discrimination); Optimal threshold. = the optimal threshold ANN output value that maximizes sensitivity and specificity; IABC = improvement in accuracy beyond chance (the ratio of the observed number of true positive plus true negative cases at the optimal threshold to the number expected by chance).
Fig. 3Receiver-operating characteristic (ROC) curves for the artificial neural network (ANN) and logistic regression bootstrap analyses (see the text for a description of the bootstrap procedure). The curves represent the average curves for the 20 ANN and 20 logistic models. The area under the ROC curve (AUROC) is an index of predictive performance: an AUROC of 1.0 represents perfect discrimination while a value of 0.5 indicates no discrimination between subjects with and without PD technique failure. The average AUROC values for the ANN and logistic regression models were 0.760 and 0.709, respectively (P = 0.0164).
Fig. 4Histogram of the artificial neural network (ANN) model output when applied to the validation set for subjects who did and did not suffer PD technique failure. For each of 20 bootstrap samples, the data were randomly divided into a training set from which an ANN model was derived, and a validation set on which the ANN was validated. The histogram data represent one of the 20 sets of validation set predictions selected at random.
Fig. 5Histogram of the logistic regression model output when applied to the validation set for subjects who did and did not suffer PD technique failure. For each of 20 bootstrap samples, the data were randomly divided into a ‘training set’ from which a regression model was derived, and a validation set on which the regression model was validated. The histogram data represent one of the 20 sets of validation set predictions selected at random.