| Literature DB >> 28827646 |
Kyung Don Yoo1, Junhyug Noh2, Hajeong Lee3, Dong Ki Kim3, Chun Soo Lim4, Young Hoon Kim5, Jung Pyo Lee4, Gunhee Kim6, Yon Su Kim7.
Abstract
Accurate prediction of graft survival after kidney transplant is limited by the complexity and heterogeneity of risk factors influencing allograft survival. In this study, we applied machine learning methods, in combination with survival statistics, to build new prediction models of graft survival that included immunological factors, as well as known recipient and donor variables. Graft survival was estimated from a retrospective analysis of the data from a multicenter cohort of 3,117 kidney transplant recipients. We evaluated the predictive power of ensemble learning algorithms (survival decision tree, bagging, random forest, and ridge and lasso) and compared outcomes to those of conventional models (decision tree and Cox regression). Using a conventional decision tree model, the 3-month serum creatinine level post-transplant (cut-off, 1.65 mg/dl) predicted a graft failure rate of 77.8% (index of concordance, 0.71). Using a survival decision tree model increased the index of concordance to 0.80, with the episode of acute rejection during the first year post-transplant being associated with a 4.27-fold increase in the risk of graft failure. Our study revealed that early acute rejection in the first year is associated with a substantially increased risk of graft failure. Machine learning methods may provide versatile and feasible tools for forecasting graft survival.Entities:
Mesh:
Year: 2017 PMID: 28827646 PMCID: PMC5567098 DOI: 10.1038/s41598-017-08008-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Baseline demographic, immunologic characteristics and treatment- associated factors according to graft survival in kidney transplants recipients (KTRs).
| Graft failure (n = 304, 9.8%) | No graft failure (n = 2813, 90.2%) |
| Number of missing value (%) | |
|---|---|---|---|---|
| Demographics – Recipients | ||||
| Age (years) | 41.4 ± 12.1 | 42.1 ± 11.5 | 0.332 | 0 (0.0) |
| Male gender | 201 (66.1) | 1655 (58.8) | 0.014 | 0 (0.0) |
| Body mass index (kg/m2) | 22.6 ± 3.2 | 22.3 ± 3.1 | 0.172 | 132 (4.2) |
| Cause of ESRD | <0.001 | 168 (5.3) | ||
| Diabetes | 40 (13.5) | 355 (13.4) | ||
| Hypertension | 13 (4.4) | 202 (7.6) | ||
| GN | 54 (18.2) | 631 (23.8) | ||
| Other | 38 (12.8) | 472 (17.8) | ||
| Unknown | 151 (51.0) | 993 (37.4) | ||
| Pre-transplant RRT modality | 0.055 | 1233 (39.5) | ||
| Preemptive transplant | 17 (6.7) | 195 (12.0) | ||
| Hemodialysis | 186 (73.2) | 1117 (68.6) | ||
| Peritoneal dialysis | 39 (15.4) | 264 (16.2) | ||
| ABO type | 0.206 | 40 (1.2) | ||
| A | 133 (43.8) | 1056 (37.5) | ||
| B | 65 (21.4) | 678 (24.1) | ||
| O | 35 (11.5) | 341 (12.1) | ||
| AB | 71 (23.4) | 472 (24.7) | ||
| ABOi transplant | 25 (8.4) | 199 (7.2) | 0.476 | 67 (2.1) |
| Pre-transplant comorbities | ||||
| Smoking status | ||||
| Never | 234 (77.0) | 2205 (78.4) | 0.001 | 1 (0.03) |
| Former | 19 (6.2) | 305 (10.8) | ||
| Current | 51 (16.8) | 302 (10.7) | ||
| Diabetes mellitus | 53 (17.4) | 480 (17.1) | 0.873 | 1 (0.03) |
| Hypertension | 256 (84.2) | 2369 (84.3) | 0.976 | 2 (0.06) |
| HBsAg positive | 25 (8.2) | 162 (5.8) | 0.003 | 0 (0.0) |
| HCV Ab positive | 23 (7.5) | 89 (3.3) | <0.001 | 0 (0.0) |
| Ischemic heart disease | 19 (6.2) | 108 (3.8) | 0.044 | 1 (0.03) |
| Cerebral vascular disease | 7 (2.3) | 77 (2.7) | 0.807 | 1 (0.03) |
| Peripheral vascular disease | 2 (0.7) | 12 (0.4) | 0.567 | 1 (0.03) |
| Immunologic factors | ||||
| Donor specific antibody | 1 (0.2) | 22 (0.7) | 0.884 | 2526 (81.0) |
| High PRA I titer (>50%) | 1 (0.2) | 66 (2.3) | 0.021 | 525 (16.8) |
| High PRA II titer (>50%) | 2 (0.2) | 38 (1.3) | 0.308 | 635 (20.3) |
| HLA cross mismatch (+) | 6 (1.9) | 94 (3.3) | 0.509 | 2 (0.06) |
| HLA-A mismatch | 0.072 | 105 (3.3) | ||
| 0 | 68 (23.1) | 796 (29.3) | ||
| 1 | 178 (60.3) | 1483 (54.6) | ||
| 2 | 49 (16.6) | 438 (16.1) | ||
| HLA-B mismatch | 0.001 | 104 (3.3) | ||
| 0 | 34 (11.5) | 448 (16.5) | ||
| 1 | 130 (44.1) | 1349 (49.6) | ||
| 2 | 131 (44.4) | 921 (33.9) | ||
| HLA-DR mismatch | 0.024 | 105 (3.3) | ||
| 0 | 45 (15.5) | 606 (22.3) | ||
| 1 | 173 (59.7) | 1452 (53.4) | ||
| 2 | 72 (24.8) | 662 (24.3) | ||
| Treatment factors | ||||
| CNIs | <0.001 | 1 (0.03) | ||
| Cyclosporine A | 191 (62.8) | 1312 (46.7) | ||
| Tacrolimus | 112 (36.8) | 1497 (53.2) | ||
Abbreviations: ESRD, end-stage renal disease; GN, glomerular nephritis; RRT, renal replacement therapy; ABOi, ABO incompatible; HBsAg, Hepatitis B virus surface antigen; HCV Ab, hepatitis C virus antibody; CMV IgG, cytomegalovirus immunoglobulin G; PRA, panel reactive antibody; HLA, human leukocyte antigen.
Baseline characteristics of kidney donors and post-transplant findings according to graft survival in kidney transplants recipients (KTRs).
| Graft failure (n = 304, 9.8%) | No graft failure (n = 2813, 90.2%) |
| Number of missing value (%) | |
|---|---|---|---|---|
| Demographics – Donors | ||||
| Age (years) | 38.4 ± 12.2 | 39.4 ± 12.1 | 0.194 | 96 (3.0) |
| Male gender | 170 (56.9) | 1559 (56.9) | 0.983 | 79 (2.5) |
| Serum creatinine (mg/dL) | 1.0 ± 0.6 | 0.9 ± 0.5 | 0.039 | 740 (23.7) |
| Body mass index (kg/m2) | 23.3 ± 3.4 | 23.6 ± 3.3 | 0.271 | 402 (12.9) |
| Donor relation | 0.001 | 38 (1.2) | ||
| Living related | 128 (42.4) | 1475 (53.1) | ||
| Living unrelated | 82 (27.2) | 660 (23.8) | ||
| Deceased | 92 (30.5) | 642 (23.1) | ||
| Donor CMV IgG | 154 (51.3) | 1820 (66.7) | 0.001 | 87 (2.7) |
| Post-transplant laboratory values at 3 month | ||||
| Serum creatinine (mg/dL) | 1.7 ± 0.7 | 1.2 ± 0.5 | <0.001 | 187 (6.0) |
| Serum Na (mg/dL) | 138.8 ± 3.7 | 140.3 ± 2.8 | <0.001 | 725 (23.2) |
| Serum total CO2 (mg/dL) | 23.9 ± 3.4 | 24.9 ± 3.1 | 0.001 | 759 (24.3) |
| Serum anion gap (mg/dL) | 9.5 ± 3.5 | 9.5 ± 2.6 | 0.932 | 766 (24.5) |
| Post-transplant observational findings | ||||
| Infection episode within 1year | 54 (17.8) | 373 (13.3) | 0.031 | 6 (0.19) |
| Rejection episode within 1year | 73 (24.0) | 332 (11.8) | <0.001 | 0 (0.0) |
| Total observation period (month) | 67.4 ± 51.9 | 87.9 ± 54.9 | <0.001 | |
Abbreviations: ESRD, end-stage renal disease; GN, glomerular nephritis; RRT, renal replacement therapy; ABOi, ABO incompatible; HBsAg, Hepatitis B virus surface antigen; HCV Ab, hepatitis C virus antibody; CMV IgG, cytomegalovirus immunoglobulin G; PRA, panel reactive antibody; HLA, human leukocyte antigen.
Figure 1Model structure.
Performance of the prediction model by conventional decision tree using different model setting.
| Setting | Imputation method | Use One | Weight method | Validation method | Validation ratio | N folds | Train set size | Test set size | Parameters | Train Performance | Test Performance |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | MICE/CART | T | Zupan | Cross-validation | 5 | 993 | 331 | cp = −1/maxdepth = 14 | 0.62 | 0.64 | |
| 1 | MICE/CART | T | nothing | Cross-validation | 5 | 993 | 331 | cp = 0.002/maxdepth = 6 | 0.78 | 0.62 | |
| 1 | MICE/CART | F | nothing | Cross-validation | 5 | 855 | 285 | cp = −1/maxdepth = 2 | 0.70 | 0.61 | |
| 1 | MICE/CART | T | Zupan | One validation | 0.285 | 993 | 331 | cp = −1/maxdepth = 14 | 0.62 | 0.59 | |
| 1 | MICE/CART | T | nothing | One validation | 0.285 | 993 | 331 | cp = −1/maxdepth = 2 | 0.72 | 0.59 | |
| 1 | MICE/CART | F | Zupan | One validation | 0.285 | 855 | 285 | cp = −1/maxdepth = 12 | 0.61 | 0.52 | |
| 1 | MICE/CART | F | Zupan | Cross-validation | 5 | 855 | 285 | cp = −1/maxdepth = 20 | 0.62 | 0.48 | |
| 1 | MICE/CART | F | nothing | One validation | 0.285 | 855 | 285 | cp = −1/maxdepth = 8 | 0.91 | 0.46 | |
| 2 | nothing | T | nothing | One validation | 0.285 | 288 | 96 | cp = 0.002/maxdepth = 6 | 0.83 | 0.71 | |
| 2 | nothing | T | Zupan | Cross-validation | 5 | 288 | 96 | cp = −1/maxdepth = 20 | 0.63 | 0.71 | |
| 2 | nothing | F | Zupan | Cross-validation | 5 | 231 | 77 | cp = −1/maxdepth = 6 | 0.60 | 0.70 | |
| 2 | MICE/CART | T | Zupan | Cross-validation | 5 | 993 | 331 | cp = −1/maxdepth = 24 | 0.64 | 0.67 | |
| 2 | nothing | T | nothing | Cross-validation | 5 | 288 | 96 | cp = −1/maxdepth = 8 | 0.92 | 0.66 | |
| 2 | nothing | T | Zupan | One validation | 0.285 | 288 | 96 | cp = −1/maxdepth = 10 | 0.61 | 0.65 | |
| 2 | MICE/CART | T | nothing | Cross-validation | 5 | 993 | 331 | cp = 0.008/maxdepth = 10 | 0.73 | 0.64 | |
| 2 | MICE/CART | T | nothing | One validation | 0.285 | 993 | 331 | cp = −1/maxdepth = 6 | 0.83 | 0.62 | |
| 2 | nothing | F | nothing | One validation | 0.285 | 231 | 77 | cp = 0.002/maxdepth = 4 | 0.88 | 0.62 | |
| 2 | MICE/CART | F | Zupan | One validation | 0.285 | 855 | 285 | cp = −1/maxdepth = 16 | 0.62 | 0.55 | |
| 2 | MICE/CART | F | Zupan | Cross-validation | — | 5 | 855 | 285 | cp = −1/maxdepth = 18 | 0.61 | 0.55 |
| 2 | MICE/CART | T | Zupan | One validation | 0.285 | 993 | 331 | cp = −1/maxdepth = 16 | 0.63 | 0.54 | |
| 2 | nothing | F | nothing | Cross-validation | 5 | 231 | 77 | cp = −1/maxdepth = 2 | 0.79 | 0.53 | |
| 2 | nothing | F | Zupan | One validation | 0.285 | 231 | 77 | cp = −1/maxdepth = 6 | 0.61 | 0.53 | |
| 2 | MICE/CART | F | nothing | Cross-validation | 5 | 855 | 285 | cp = −1/maxdepth = 4 | 0.82 | 0.49 | |
| 2 | MICE/CART | F | nothing | One validation | 0.285 | 855 | 285 | cp = −1/maxdepth = 6 | 0.92 | 0.47 | |
| 3 | MICE/CART | T | nothing | Cross-validation | 5 | 993 | 331 | cp = −1/maxdepth = 6 | 0.86 | 0.62 | |
| 3 | MICE/CART | T | Zupan | Cross-validation | 5 | 993 | 331 | cp = −1/maxdepth = 16 | 0.63 | 0.61 | |
| 3 | MICE/CART | T | nothing | One validation | 0.285 | 993 | 331 | cp = −1/maxdepth = 6 | 0.83 | 0.61 | |
| 3 | MICE/CART | T | Zupan | One validation | 0.285 | 993 | 331 | cp = −1/maxdepth = 14 | 0.64 | 0.60 | |
| 3 | MICE/CART | F | Zupan | One validation | 0.285 | 855 | 285 | cp = −1/maxdepth = 16 | 0.61 | 0.58 | |
| 3 | MICE/CART | F | nothing | One validation | 0.285 | 855 | 285 | cp = −1/maxdepth = 4 | 0.76 | 0.53 | |
| 3 | MICE/CART | F | nothing | Cross-validation | 5 | 855 | 285 | cp = −1/maxdepth = 6 | 0.83 | 0.51 | |
| 3 | MICE/CART | F | Zupan | Cross-validation | 5 | 855 | 285 | cp = −1/maxdepth = 26 | 0.62 | 0.48 |
*Test performance were presented as concordance index for time to graft failure data. Model setting 1: Use overall attributes with multiple imputation. Model setting 2: Use attributes except DSA, with/without imputation. Model setting 3: Use attributes except 3 month follow up data, with imputation. Test ratio fixed at 0.3. Use One False (F) is used if the follow-up period is shorter than the period to be predicted in the classification; these cases were excluded from the training process. Use One True (T) is used as a positive example if the patient has experienced a graft failure even though the follow-up period is short. Weight method by Zupan et al. is used when the follow-up period is short, with both positive and negative examples used, but with different weights[6].
Figure 2The 10-year graft failure prediction using a decision tree model. Decision tree for the training, test and validation data set, after stratified sampling, with ‘Y’ indicating a positive conclusion and ‘N’ a negative conclusion. The 10-year graft failure rate is reported as a percentage. (A) Model setting 1: Use overall attributes with multiple imputation. (B) Model setting 2: Use attributes except DSA, with/without imputation
Comparison of AUC values between baseline model (only with uncensored data) and weighting model in decision-tree modeling for classification problem.
| Period after KT for prediction (year) | 1 yr | 2 yr | 3 yr | 4 yr | 5 yr | 6 yr | 7 yr | 8 yr | 9 yr | 10 yr |
|---|---|---|---|---|---|---|---|---|---|---|
| Baseline model AUCa,b (complete case analysis) | 0.9728 | 0.8856 | 0.7916 | 0.7523 | 0.7121 | 0.7164 | 0.6707 | 0.6859 | 0.6714 | 0.6583 |
| Weighting model AUCa,b | 0.9754 | 0.8845 | 0.7958 | 0.7389 | 0.7013 | 0.720 | 0.7107 | 0.7150 | 0.7013 | 0.7066 |
aSetting 1: Use overall attributes with multiple imputation, bInstance weighting methods by Zupan et al.[6]; yr, year.
Performance of the prediction model by survival hazard ratio decision tree using different model setting.
| Setting | Imputation method | Use One | Weight method | Validation method | Validation ratio | N folds | Train set size | Test set size | Parameters | Training Performance | Test Performance |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | MICE/CART | F | nothing | Cross-validation | 5 | 2796 | 932 | cp = 0.01/maxdepth = 6 | 0.73 | 0.71 | |
| 1 | MICE/CART | T | nothing | Cross-validation | 5 | 2796 | 932 | cp = 0.01/maxdepth = 6 | 0.71 | 0.68 | |
| 1 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | cp = −1/maxdepth = 2 | 0.72 | 0.67 | |
| 1 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | cp = 0.004/maxdepth = 14 | 0.92 | 0.59 | |
| 2 | nothing | F | nothing | One validation | 0.285 | 930 | 310 | cp = 0.028/maxdepth = 4 | 0.78 | 0.80 | |
| 2 | MICE/CART | F | nothing | Cross-validation | 5 | 2796 | 932 | cp = 0.008/maxdepth = 4 | 0.74 | 0.69 | |
| 2 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | cp = 0.01/maxdepth = 6 | 0.77 | 0.68 | |
| 2 | MICE/CART | T | nothing | Cross-validation | 5 | 2796 | 932 | cp = 0.008/maxdepth = 6 | 0.71 | 0.68 | |
| 2 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | cp = −1/maxdepth = 2 | 0.72 | 0.67 | |
| 2 | nothing | F | nothing | Cross-validation | 5 | 930 | 310 | cp = −1/maxdepth = 12 | 0.94 | 0.62 | |
| 2 | nothing | T | nothing | Cross-validation | 5 | 930 | 310 | cp = 0.002/maxdepth = 12 | 0.94 | 0.61 | |
| 2 | nothing | T | nothing | One validation | 0.285 | 930 | 310 | cp = 0.02/maxdepth = 8 | 0.91 | 0.60 | |
| 3 | MICE/CART | F | nothing | Cross-validation | 5 | 2796 | 932 | cp = 0.012/maxdepth = 4 | 0.72 | 0.68 | |
| 3 | MICE/CART | T | nothing | Cross-validation | 5 | 2796 | 932 | cp = 0.014/maxdepth = 4 | 0.70 | 0.65 | |
| 3 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | cp = 0.01/maxdepth = 6 | 0.74 | 0.61 | |
| 3 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | cp = 0.014/maxdepth = 4 | 0.72 | 0.58 |
*Test performance were presented as concordance index for time to graft failure data. Model setting 1: Use overall attributes with multiple imputation. Model setting 2: Use attributes except DSA, with/without imputation. Model setting 3: Use attributes except 3 month follow up data, with imputation. Test ratio fix 0.3. Use One False (F) is used if the follow-up period is shorter than the period to be predicted in the classification; these cases were excluded from the training process. Use One True (T) is used as a positive example if the patient has experienced a graft failure even though the follow-up period is short. Weight method by Zupan et al. is used when the follow-up period is short, with both positive and negative examples used, but with different weights[6].
Figure 3The graft survival prediction tree using survival hazard ratio modelling. Decision tree using the training, test and validation data set, after stratified sampling using survival statics, with ‘Y’ indicating a positive conclusion and ‘N’ a negative conclusion. The graft failure risk is presented as a survival hazard ratio (HR). (A) Model setting 1: Use overall attributes with multiple imputation. (B) Model setting 2: Use attributes except DSA, with/without imputation
Performance of the prediction model by survival cox regression analysis using different model setting.
| Setting | Imputation method | Use One | Weight method | Validation method | Validation ratio | Train set size | Test set size | Parameters | Training Performance | Test Performance |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.78 | 0.62 |
| 1 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.78 | 0.62 |
| 1 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.77 | 0.60 |
| 1 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.77 | 0.60 |
| 2 | Nothing | F | nothing | One validation | 0.285 | 930 | 310 | Nothing | 0.82 | 0.65 |
| 2 | Nothing | F | nothing | One validation | 0.285 | 930 | 310 | Nothing | 0.82 | 0.65 |
| 2 | Nothing | T | nothing | One validation | 0.285 | 930 | 310 | Nothing | 0.81 | 0.63 |
| 2 | Nothing | T | nothing | One validation | 0.285 | 930 | 310 | Nothing | 0.81 | 0.63 |
| 2 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.78 | 0.62 |
| 2 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.78 | 0.62 |
| 2 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.77 | 0.60 |
| 2 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.77 | 0.60 |
| 3 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.79 | 0.62 |
| 3 | MICE/CART | F | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.79 | 0.62 |
| 3 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.78 | 0.61 |
| 3 | MICE/CART | T | nothing | One validation | 0.285 | 2796 | 932 | Nothing | 0.78 | 0.61 |
*Test performance were presented as concordance index for time to graft failure data. Model setting 1: Use overall attributes with multiple imputation. Model setting 2: Use attributes except DSA, with/without imputation. Model setting 3: Use attributes except 3 month follow up data, with imputation. Test ratio fix 0.3. Use One False (F) is used if the follow-up period is shorter than the period to be predicted in the classification; these cases were excluded from the training process. Use One True (T) is used as a positive example if the patient has experienced a graft failure even though the follow-up period is short. Weight method by Zupan et al. is used when the follow-up period is short, with both positive and negative examples used, but with different weights[6].
Figure 4Limitation of conventional prediction model using censored data. The blue line indicates the time point ‘t’, from the starting point of observation, for each case.