| Literature DB >> 33198650 |
Georgios Kantidakis1,2,3, Hein Putter4, Carlo Lancia5, Jacob de Boer6, Andries E Braat6, Marta Fiocco5,4,7.
Abstract
BACKGROUND: Predicting survival of recipients after liver transplantation is regarded as one of the most important challenges in contemporary medicine. Hence, improving on current prediction models is of great interest.Nowadays, there is a strong discussion in the medical field about machine learning (ML) and whether it has greater potential than traditional regression models when dealing with complex data. Criticism to ML is related to unsuitable performance measures and lack of interpretability which is important for clinicians.Entities:
Keywords: Neural networks; Post-transplantation; Predictive performance; Random survival forest; Risk factors; Survival analysis
Mesh:
Year: 2020 PMID: 33198650 PMCID: PMC7667810 DOI: 10.1186/s12874-020-01153-1
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Hazard ratios along with their 95% confidence intervals for the 12 most influential variables for the Cox models. Variables are presented in decreasing order according to the absolute z-score values (12.90 to 5.16) for the Cox model with all variables. Predictors shown are the most prognostic as their z-scores values correspond to low and very significant p-values. These variables were also selected by both Cox backward and Cox LASSO model which verifies their prognostic ability for Cox models
| Cox all variables | Cox backward | Cox LASSO | |
|---|---|---|---|
| HR (95% CI) | HR (95% CI) | HR | |
| Re-transplantation | 1.602 (1.491-1.721) | 1.608 (1.501-1.722) | 1.558 |
| Donor age | 1.010 (1.008-1.011) | 1.011 (1.009-1.012) | 1.009 |
| Donor type DCD ( | 1.483 (1.362-1.616) | 1.443 (1.338-1.556) | 1.298 |
| log(Total cold ischemic time) | 1.258 (1.192-1.327) | 1.285 (1.221-1.353) | 1.191 |
| Diabetes | 1.173 (1.125-1.225) | 1.176 (1.128-1.226) | 1.136 |
| Race Black ( | 1.240 (1.171, 1.314) | 1.261 (1.193-1.332) | 1.186 |
| Life support | 1.343 (1.240-1.454) | 1.375 (1.272-1.487) | 1.304 |
| Recipient age | 1.007 (1.005-1.009) | 1.008 (1.006-1.010) | 1.006 |
| Incidental tumour | 1.314 (1.202, 1.437) | 1.315 (1.203-1.437) | 1.203 |
| Hypertensive bleeding | 1.296 (1.185, 1.418) | 1.301 (1.190-1.423) | 1.214 |
| HCV ( | 1.147 (1.091-1.206) | 1.148 (1.094-1.205) | 1.166 |
| Pre-treatment status ICU ( | 1.240 (1.143, 1.346) | 1.253 (1.160-1.354) | 1.164 |
(a): Donor type DCD (Donor Circulatory Dead) vs DBD (Donor after Brain-Dead), (b): Race Black vs White, (c): Chronic hepatitis C virus, (d): Intense Care Unit vs Non-hospitalised/Hospitalised
Integrated Brier Score (IBS) and C-index on the test data. Neural network 1h and 2h refer to a neural network with one and two hidden layers respectively
| IBS | C-index | |
|---|---|---|
| Cox all variables | 0.183 | 0.620 |
| Cox backward | 0.183 | 0.615 |
| Cox LASSO | 0.183 | 0.614 |
| RSF | 0.182 | |
| Neural Network 1h | - | |
| Neural Network 2h | - |
Fig. 1Prediction error curves for all models
The 12 most prognostic factors for the neural networks with 1 and 2 hidden layers (Rel-Imp: relative importance) and for the Random Survival Forest (VIMP: variable importance). Note that the NN utilises time intervals as one of the input variables (check the contribution of time intervals in Table 1 of Additional file 1). For RSF importance is measured for each variable without distinction for each level
| Neural network 1h | Rel-Imp | Neural network 2h | Rel-Imp | RSF | VIMP |
|---|---|---|---|---|---|
| Re-transplantation | 0.035 | Re-transplantation | 0.028 | Donor age | 0.010 |
| Life-support | 0.025 | HCV ( | 0.025 | Re-transplantation | 0.009 |
| Pre-treatment status ICU ( | 0.023 | Life-support | 0.024 | Life support | 0.007 |
| Donor type DCD ( | 0.023 | Donor age | 0.023 | HCV ( | 0.007 |
| Race Black ( | 0.022 | Diabetes | 0.021 | Pre-treatment status | 0.006 |
| HCV ( | 0.022 | Pre-treatment status ICU ( | 0.020 | Recipient age | 0.004 |
| Diabetes | 0.020 | Working income | 0.020 | Aetiology | 0.003 |
| Donor age | 0.020 | Race Black ( | 0.019 | log(Last serum creatinine) | 0.003 |
| Working income | 0.018 | Previous abdominal surgery | 0.015 | Functional status | 0.002 |
| Functional status Total assistance ( | 0.017 | Donor pre-recovery diuretics | 0.015 | log(Total cold ischemic time) | 0.002 |
| Aetiology HCV | 0.017 | Aetiology Cholestatic | 0.011 | Race | 0.002 |
| Hypertensive bleeding | 0.017 | Functional status Total assistance ( | 0.015 | Diabetes | 0.002 |
(a): Intense Care Unit vs Non-hospitalised/Hospitalised (b): Donor type DCD (Donor Circulatory Dead) vs DBD (Donor after Brain-Dead), (c): Race Black vs White, (d): Chronic hepatitis C virus, (e): Total assistance vs No assistance
Fig. 2a Predicted survival probabilities for 3 new hypothetical patients using the Cox model with all variables (solid lines), the tuned RSF (short dashed lines) and the tuned NN with 1 hidden layer (long dashed lines). The green lines correspond to a reference patient with the median values for the continuous and the mode value for categorical variables. The patient in the orange line has diabetes (the other covariates as in reference patient). The patient in the red line has been transplanted before and has diabetes simultaneously (the other covariates as in reference patient). Values for 10 prognostic variables for the reference patient are provided in Table 2 of Additional file 1. b Predicted survival probabilities for 3 patients selected from the test data based on the Cox model with all variables (solid lines), the tuned RSF (short dashed lines) and the tuned NN with 1 hidden layer (long dashed lines). Green lines correspond to a patient censored at 1.12 years. Patient in the orange line was censored at 6.86 years. Patient in the red line died at 0.12 years. Values for 10 prognostic variables for the patients are provided in Tables 3-5 of Additional file 1
Fig. 3Calibration plots at 2 years on the test data: a Cox model with all variables, b Random Survival Forest, c Partial Logistic Artificial Neural Network with 1 hidden layer, d Partial Logistic Artificial Neural Network with 2 hidden layers