| Literature DB >> 22563399 |
Hon-Yi Shi1, King-Teh Lee, Hao-Hsien Lee, Wen-Hsien Ho, Ding-Ping Sun, Jhi-Joung Wang, Chong-Chi Chiu.
Abstract
BACKGROUND: Since most published articles comparing the performance of artificial neural network (ANN) models and logistic regression (LR) models for predicting hepatocellular carcinoma (HCC) outcomes used only a single dataset, the essential issue of internal validity (reproducibility) of the models has not been addressed. The study purposes to validate the use of ANN model for predicting in-hospital mortality in HCC surgery patients in Taiwan and to compare the predictive accuracy of ANN with that of LR model. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2012 PMID: 22563399 PMCID: PMC3338531 DOI: 10.1371/journal.pone.0035781
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient characteristics and hospital characteristics (N = 22,926).
| Variable | No. of patients (%) |
| Operation years | |
| 1998–2001 | 6,857 (29.9) |
| 2002–2005 | 7,245 (31.6) |
| 2006–2009 | 8,824 (38.5) |
| Age [mean ± standard deviation], years | 58.6±12.7 |
| Gender | |
| Female | 6,028 (26.3) |
| Male | 16,898 (73.7) |
| Charlson co-morbidity index [mean ± standard deviation], scores | 3.6±1.6 |
| Hospital volume | |
| Low | 4,218 (18.4) |
| Medium | 6,145 (26.8) |
| High | 6,086 (26.6) |
| Very high | 6,477 (28.2) |
| Surgeon volume | |
| Low | 5,250 (22.9) |
| Medium | 5,860 (25.6) |
| High | 5,806 (25.3) |
| Very high | 6,010 (26.2) |
| Length of stay [mean ± standard deviation], days | 17.8±9.7 |
| In-hospital mortality | |
| Survival | 22,307 (97.3) |
| Death | 619 (2.7) |
The LR model using selected variables related to in-hospital mortality.
| Variable | Un-standardized coefficient | Standardized error | Odds ratio (OR) | P value |
| Age | −0.042 | 0.005 | 1.04 | <0.001 |
| Gender | ||||
| Male | −0.213 | 0.054 | 1.24 | 0.002 |
| Charlson co-morbidity index | −0.208 | 0.027 | 1.23 | <0.001 |
| Hospital volume | ||||
| Medium | 0.284 | 0.131 | 1.13 | <0.001 |
| High | 0.660 | 0.265 | 1.52 | <0.001 |
| Very High | 0.719 | 0.273 | 1.84 | <0.001 |
| Surgeon volume | ||||
| Medium | 0.659 | 0.143 | 1.22 | <0.001 |
| High | 0.937 | 0.155 | 1.79 | <0.001 |
| Very High | 1.549 | 0.215 | 2.41 | <0.001 |
| Length of stay | −0.039 | 0.004 | 1.04 | <0.001 |
| Constant | 7.267 | 0.355 | 2.01 | <0.001 |
LR = logistic regression.
Reference variables are female gender, low hospital volume, low surgeon volume.
Figure 1Schematic representation of artificial neural network model with 6 input nodes, 3 nodes in a single hidden layer, and a single output node representing in-hospital mortality.
X1, age; X2, gender; X3, Charlson co-morbidity index; X4, hospital volume; X5, surgeon volume; X6, length of stay; IB, input layer bias; HB, hidden layer bias.
Comparison of 1000 pairs of ANN and LR models for predicting in-hospital mortality.
| Performance indices | ANN (95% C.I.) | LR (95% C.I.) | P value |
| Accuracy rate | 97.28 (95.88, 98.68) | 88.29 (86.49, 90.09) | <0.001 |
| H-L statistics | 41.18 (34.67, 47.68) | 54.53 (49.53, 59.52) | <0.001 |
| AUROC | 0.84 (0.88, 0.80) | 0.76 (0.71, 0.81) | <0.001 |
ANN = artificial neural network; LR = logistic regression; Hosmer-Lemeshow statistics = H-L statistics; AUROC = area under the receiver operating characteristic.
Global sensitivity analysis of the ANN model in predicting in-hospital mortality.
| Rank | |||
| First | Second | Third | |
| Variable | Surgeon volume | Age | Lengths of stay |
| VSR | 1.22 | 1.10 | 1.09 |
ANN = artificial neural network; VSR = variable sensitivity ratio.
Comparative performance indices of ANN and LR models when using 100 new data sets to predict in-hospital mortality.
| Model | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | Accuracy rate (%) | AUROC |
| ANN | 78.40 | 94.57 | 84.22 | 96.91 | 95.93 | 0.82 |
| LR | 62.64 | 91.92 | 76.65 | 87.18 | 84.47 | 0.73 |
ANN = artificial neural network; LR = logistic regression; PPV = positive predictive value; NPV = negative predictive value; AUROC = area under the receiver operating characteristic.