| Literature DB >> 32221367 |
Chien-Liang Liu1, Ruey-Shyang Soong2,3, Wei-Chen Lee4,5, Guo-Wei Jiang6, Yun-Chun Lin6.
Abstract
Liver transplantation is one of the most effective treatments for end-stage liver disease, but the demand for livers is much higher than the available donor livers. Model for End-stage Liver Disease (MELD) score is a commonly used approach to prioritize patients, but previous studies have indicated that MELD score may fail to predict well for the postoperative patients. This work proposes to use data-driven approach to devise a predictive model to predict postoperative survival within 30 days based on patient's preoperative physiological measurement values. We use random forest (RF) to select important features, including clinically used features and new features discovered from physiological measurement values. Moreover, we propose a new imputation method to deal with the problem of missing values and the results show that it outperforms the other alternatives. In the predictive model, we use patients' blood test data within 1-9 days before surgery to construct the model to predict postoperative patients' survival. The experimental results on a real data set indicate that RF outperforms the other alternatives. The experimental results on the temporal validation set show that our proposed model achieves area under the curve (AUC) of 0.771 and specificity of 0.815, showing superior discrimination power in predicting postoperative survival.Entities:
Mesh:
Year: 2020 PMID: 32221367 PMCID: PMC7101323 DOI: 10.1038/s41598-020-62387-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Model performances with different feature combinations. We use RF to identify the top eight features, and conduct experiments to evaluate model performances with different feature combinations.
| Model | Feature | AUC |
|---|---|---|
| 1 | BMI, age | |
| 2 | BMI, age, Na | |
| 3 | BMI, age, Na, Lymphocyte | |
| 4 | BMI, age, Na, Lymphocyte, INR | |
| 5 | BMI, age, Na, Lymphocyte, INR, WBC | |
| 6 | BMI, age, Na, Lymphocyte, INR, WBC, Platelets | |
| 7 | BMI, age, Na, Lymphocyte, INR, WBC, Platelets, Mg |
Performance comparisons for different learning algorithms on derivation and temporal sets. The values in the cell of the top table are the mean and 1.96 standard deviations of 10-fold cross-validation. Random forest and XGBoost both work well, and RF outperforms XGBoost in AUC.
| Performance on Derivation Set | |||
|---|---|---|---|
| Method | AUC | Specificity | Sensitivity |
| RF | 0.787 ± 0.185 | 0.955 ± 0.187 | 0.653 ± 0.334 |
| XGBoost | 0.782 ± 0.268 | 0.905 ± 0.335 | 0.729 ± 0.329 |
| Decision Tree | 0.576 ± 0.229 | 0.698 ± 0.349 | 0.517 ± 0.335 |
| Logistic Regression | 0.538 ± 0.273 | 0.717 ± 0.230 | 0.695 ± 0.250 |
| RF | 0.771 | 0.815 | 0.5 |
| XGBoost | 0.759 | 0.796 | 0.5 |
| Decision Tree | 0.632 | 0.888 | 0.25 |
| Logistic Regression | 0.671 | 0.870 | 0.5 |
Figure 1The top important features selected from RF with step-wise selection. These variables are verified by the physician. (DX1: reason for liver transplantation, type: type of hepatitis, HCC: Hepatocellular carcinoma).
Performance comparison of different imputation methods. The values in the cell are the mean and 1.96 standard deviations of 10-fold cross-validation on derivation set. The proposed method considers the characteristics of features, giving a base to outperform other alternatives.
| Imputation Methods | AUC | Specificity | Sensitivity |
|---|---|---|---|
| Proposed Method | |||
| Minimum | |||
| Maximum | |||
| Median | |||
| Average | |||
| Classification and Regression Tree |
Figure 2Experimental results with different range of days as the data source. The AUC increases as more training data are used in the model, and the data of day 1 to day 9 is the most important one.
Performance comparison of RF and survival analysis with different combinations of features. The experiments were conducted with 10-fold cross-validation. The features used in MELD model comprise MELD score, hepatitis, HCC, DX1, age, gender, and BMI, whereas the features selected by RF are the top eight features identified by RF.
| RF Model on Derivation Set | |||
|---|---|---|---|
| Features | AUC | Specificity | Sensitivity |
| Features selected by RF | 0.787 ± 0.185 | 0.955 ± 0.187 | 0.653 ± 0.334 |
| Features used by MELD | 0.596 ± 0.315 | 0.707 ± 0.302 | 0.720 ± 0.395 |
| Features selected by RF | 0.85 | p-value = 6e-08 | p-value = 2e-05 |
| Features used by MELD | 0.695 | p-value = 4e-04 | p-value = 8e-05 |
The effect of features on model performance in temporal validation data set. We applied RF with two combinations of features to the temporal validation set, and the experimental results point out that the model could benefit from the features selected by RF.
| RF Model on Derivation Set | |||
|---|---|---|---|
| Features | AUC | Specificity | Sensitivity |
| Features selected by RF | 0.771 | 0.815 | 0.5 |
| Features selected by MELD | 0.681 | 0.703 | 0.5 |
Figure 3Hazard ratios (HR) from Cox proportional hazards model with the data of day 9. In the results, HR >1indicates an increased risk of death, and HR < 1 represents a decreased risk. The p-values of the variables show that INR, Platelets and age are significant features.
Figure 4Experimental flow. The experimental flow comprises data pre-processing, feature selection, imputation of missing values, model training and evaluation. The purpose of training data is for model training, whereas testing data is used for model evaluation.
Patients’ characteristics at 9 days before the surgery in derivation and temporal validation sets. The characteristics comprise basic information, blood test, preoperative status and other details. Moreover, statistical tests are applied to the data. *Mann-Whitney U Test were performed for continuous data, and Pearson Chi-Squared Tests for categorical data between groups comparison.
| Predictor candidates | Derivation set (year = 2004–2012) ( | Temporal validation set (year = 2013) ( | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Survival | Death | p-value* | Survival | Death | p-value* | ||||
| 1 | Gender, n(%) | 1.0 | 1.0 | ||||||
| Male | 358 | 326 (74.60) | 32 (74.42) | 43 | 40 (74.07) | 3 (0.75) | |||
| Female | 122 | 111 (25.40) | 11 (25.58) | 15 | 14 (25.93) | 1 (0.25) | |||
| 2 | Age | 480 | 52.69 (9.18) | 47.93 (15.29) | 0.0982 | 58 | 52.63 (10.48) | 55.25 (7.72) | 0.6896 |
| 3 | BMI | 470 | 24.52 (3.90) | 87.00 (39.65) | 0.3626 | 57 | 26.25 (4.26) | 24.05 (2.30) | 0.2918 |
| 4 | Survival days | 480 | 1411.25 (1014.81) | 12.14 (7.04) | <2.2E-16 | 58 | 172.70 (76.51) | 11.75 (11.95) | 0.00097 |
| 5 | INR | 470 | 1.73 (0.85) | 2.51 (2.74) | 4.724E-06 | 56 | 1.58 (2.04) | 0.56 (0.56) | 0.07164 |
| 6 | Lymphocyte | 467 | 22.33 (12.31) | 18.55 (13.33) | 0.02734 | 57 | 22.14 (11.40) | 18.23 (8.87) | 0.5219 |
| 7 | Mg | 415 | 1.68 (0.24) | 1.65 (0.33) | 0.1257 | 50 | 1.69 (0.26) | 1.55 (0.23) | 0.3133 |
| 8 | Na | 268 | 137.06 (5.82) | 138.17 (10.12) | 0.9906 | 8 | 123.28 (5.91) | 122.14 (2.32) | 0.9908 |
| 9 | Platelets | 476 | 79.21 (49.16) | 63.39 (35.46) | 0.0814 | 57 | 81.64 (68.08) | 85.08 (18.66) | 0.2811 |
| 10 | WBC | 476 | 4.86 (3.26) | 5.99 (3.79) | 0.03277 | 57 | 4.83 (5.16) | 6.67 (3.56) | 0.1378 |
| 11 | MELD score | 480 | 17.84 (8.90) | 24.21 (9.36) | 1.617E-05 | 58 | 16.69 (10.04) | 29.75 (11.90) | 0.021 |
| 12 | Hepatitis, n(%) | 0.1996 | 0.2852 | ||||||
| Nil | 83 | 71 (16.25) | 12 (27.91) | 19 | 16 (29.63) | 3 (0.75) | |||
| Hepatitis B Virus (HBV) | 277 | 253 (57.89) | 24 (55.81) | 22 | 21 (38.89) | 1 (0.25) | |||
| Hepatitis C Virus (HCV) | 96 | 90 (20.59) | 6 (13.95) | 16 | 16 (29.63) | 0 (0) | |||
| Dual | 24 | 23 (5.27) | 1 (2.33) | 11 | 1 (1.85) | 0 (0) | |||
| 13 | HCC, n(%) | 0.004467 | 0.2002 | ||||||
| No | 264 | 231 (52.86) | 33 (76.74) | 33 | 29 (53.70) | 4 (1.0) | |||
| Yes | 216 | 206 (47.14) | 10 (23.26) | 25 | 25 (46.30) | 0 (0) | |||
| 14 | DX1, n(%) | 0.1942 | — | ||||||
| Virtual hepatitis | 396 | 363 (83.26) | 33 (76.74) | 43 (79.63) | 2 (0.50) | ||||
| Alcoholic cirrhosis | 35 | 32 (7.34) | 3 (6.98) | 6 (11.11) | 1 (0.25) | ||||
| Wilson’s disease | 6 | 4 (0.92) | 2 (4.65) | 1 (1.85) | 0 (0) | ||||
| Primary Biliary cirrhosis | 7 | 7 (1.61) | 0 (0) | 0 (0) | 0 (0) | ||||
| Biliary atesia | 2 | 1 (0.23) | 1 (2.33) | 0 (0) | 0 (0) | ||||
| Fulminiant hepatitis | 4 | 3 (0.69) | 1 (2.33) | 0 (0) | 0 (0) | ||||
| Secondary Biliary cirrhosis | 1 | 1 (0.23) | 0 (0) | 0 (0) | 0 (0) | ||||
| Other malignancy | 2 | 2 (0.46) | 0 (0) | 0 (0) | 0 (0) | ||||
| Others | 26 | 23 (5.26) | 3 (6.98) | 4 (7.41) | 1 (0.25) | ||||
| 15 | Graft-recipient weight ratio (GRWR, %) | 387 | 1.01 (0.24) | 1.03 (0.48) | 0.4077 | 53 | 0.94 (0.2) | 1.02 (0.23) | 0.5004 |
| 16 | Liver weight | 413 | 633.15 (146.82) | 520.61 (134.30) | 50 | 644.68 (129.73) | 620 (256.60) | 0.8381 | |
| 17 | Types of liver transplant, n(%) | 0.03325 | 0.7745 | ||||||
| Living donor | 414 | 382 (87.41) | 32 (74.42) | 54 | 51 (94.44) | 3 (0.75) | |||
| Deceased donor | 66 | 55 (12.59) | 11 (25.58) | 4 | 3 (5.56) | 1 (0.25) | |||
The causes of death for the short-term survival patients in derivation and temporal validation sets.
| Cause of death | Derivation set (n = 43) | Temporal validation set (n = 4) |
|---|---|---|
| Acute Cellular Rejection | 9 | — |
| Acute Humeral Rejection | 5 | 1 |
| Primary non function | 3 | — |
| Sepsis | 13 | 2 |
| Cardiopulmonary | 7 | 1 |
| Complication | — | — |
| Small for size graft | 3 | — |
| Others | 3 | — |