Literature DB >> 27057518

Identifying Important Risk Factors for Survival in Kidney Graft Failure Patients Using Random Survival Forests.

Omid Hamidi1, Jalal Poorolajal2, Maryam Farhadian3, Leili Tapak4.   

Abstract

BACKGROUND: Kidney transplantation is the best alternative treatment for end-stage renal disease. Several studies have been devoted to investigate predisposing factors of graft rejection. However, there is inconsistency between the results. The objective of the present study was to utilize an intuitive and robust approach for variable selection, random survival forests (RSF), and to identify important risk factors in kidney transplantation patients.
METHODS: The data set included 378 patients with kidney transplantation obtained through a historical cohort study in Hamadan, western Iran, from 1994 to 2011. The event of interest was chronic nonreversible graft rejection and the duration between kidney transplantation and rejection was considered as the survival time. RSF method was used to identify important risk factors for survival of the patients among the potential predictors of graft rejection.
RESULTS: The mean survival time was 7.35±4.62 yr. Thirty-seven episodes of rejection were occurred. The most important predictors of survival were cold ischemic time, recipient's age, creatinine level at discharge, donors' age and duration of hospitalization. RSF method predicted survival better than the conventional Cox-proportional hazards model (out-of-bag C-index of 0.965 for RSF vs. 0.766 for Cox model and integrated Brier score of 0.081 for RSF vs. 0.088 for Cox model).
CONCLUSION: A RSF model in the kidney transplantation patients outperformed traditional Cox-proportional hazard model. RSF is a promising method that may serve as a more intuitive approach to identify important risk factors for graft rejection.

Entities:  

Keywords:  Cox proportional hazards; Kidney transplantation; Random survival forest

Year:  2016        PMID: 27057518      PMCID: PMC4822390     

Source DB:  PubMed          Journal:  Iran J Public Health        ISSN: 2251-6085            Impact factor:   1.429


Introduction

The frequency of chronic kidney disease (CKD) has been progressively increasing over the last two decades (1) and has become a worldwide public health problem. The prevalence of CKD is estimated to be 8–16% worldwide (2). Kidney transplantation is the best alternative treatment for end-stage renal disease and health-related quality of life and survival of the patients are improved compared with dialysis (3, 4). Worldwide, more than 1.4 million patients with CKD receive renal replacement therapy with incidence growing by approximately 8% annually (5). Unfortunately, despite significant improvement in graft function, kidney transplants can still fail due to acute rejection and chronic allograft nephropathy (1, 3) that can lead to three fold greater risk of death compared to patients with functioning grafts (1, 6). Due to the increasing demand for renal transplants, identifying potential risk factors implicated in graft failure is essential to improve patient survival and quality of life (1). To achieve this purpose, traditional statistical techniques such as Cox proportional hazards (PH) model has been widely used to analyze survival data and to determine potential risk factors. However, it relies on restrictive assumptions such as proportionality of hazards and linearity of effects on log hazard function (linearity assumption) (7). Besides, the performance of traditional methods like Cox regression is not reliable in the presence of high rate of censoring (8). Potential prognostic factors affecting renal graft have also been investigated by several studies with Cox PH model (3, 9, 10). However, there were inconsistencies among the results. Ideally, it would be important to improve the predictive performance of the models identifying potential prognostic factors affecting renal graft via learning theory and data mining techniques for survival time that require no assumptions. Machine learning methods such as tree-based approaches have recently been developed to handle right censored survival data and their effective performance has been confirmed in different areas (11). Random survival forests (RSF), is a non-parametric tree-based ensemble learning method that can automatically handle the difficulties of Cox model and can also be used to select and rank variables (7, 11). Due to the limitations of the Cox model, using RSF to identify effective risk factors for survival has been suggested (7). Although, several studies have confirmed the promising performance of RSF compared to traditional Cox model (8, 12–14) in different disease, there is no attempt to use RSF in renal transplantation and compare its performance with Cox model. This study aimed to identify prognostic factors affecting renal graft by RSF and compare its performance with Cox proportional hazard model.

Material and Methods

The present study utilized a data set corresponds to a retrospective cohort study which was conducted in Hamadan, western Iran, from 1994 to 2011. The number of 475 patients underwent kidney transplantation in Ekbatan or Besaat hospitals and was eligible to enroll the study. To identify important risk factors, the patients who did not have any information about risk factors were eliminated from the analysis. In this regard, only 378 out of 475 patients were considered in the present study because the information about potential risk factors was not observed for the rest of the patients. The risk factors were age, sex of donors and recipients, type of donor (living-donor or deceased donor), familial relationship, hemoglobin level, blood groups of donors and recipients, duration of dialysis before transplantation (year), cold ischemic time (min), creatinine level at discharge, body mass index (BMI) of donor (kg/m2), left or right kidney, type of immunosuppressive drugs used (Imuran, prednisolone, cyclosporine vs. cellcept, prednisolone, cyclosporine), duration of hospitalization (day), volume of urine excretion during the first 24 h after transplantation (ml/24 h), and occurrence of acute or hyperacute rejection. In this regard, acute rejection is related to formation of cellular immunity, which usually occurs to some degree in all grafts, except between identical twins and hyperacute rejection is initiated by preexisting humoral immunity and usually manifests within minutes after transplantation (3). The event of interest was chronic nonreversible graft rejection. The survival time was the time between kidney transplantation and episode of rejection (3). RSF is an extension of random forest RF to right-censored survival data with the same principles underlying RF, which enjoys all its important properties (7, 15). Random forests consist of several trees based on a random sample with replacement. Each tree consists of nodes (variables) in which classification or split was implemented. In survival settings, tree node splits according to maximizing survival differences between daughter nodes (new nodes). In this regard, in each tree, survival time and status of the patients were considered as response variables. Then, the ensemble estimate for the cumulative hazard function (CHF) is drawn by calculating the CHF for each sample in a data set, and summing this ensemble over the observed survival times yields the predicted outcome referred to as ensemble mortality (a measure of mortality for a patient that has been shown to be an effective predictor of survival) (15). Each run of RSF was performed for the kidney transplant data set based on 1000 trees under log-rank splitting rule. The importance of each model covariate was also determined by a rapidly computable internal measure of variable importance (VIMP) that can be used to rank variables. The larger VIMP, the more predictive the variable (the threshold value is 0.002) (11). Moreover, multiple imputation strategy based on RF was utilized for treating missing data (7). Five imputed data set were provided and then combining rules (16, 17) were applied to calculate evaluation criteria and VIMP. In order to compare the performance of RSF and traditional Cox PH, two criteria were used including integrated Brier score (18) and C-index (19) using out-of-bag (OOB) data. A perfect prediction rule would have a concordance of 1 (20). Analyses were performed by using “random-ForestSRC”, a freely available package from the Comprehensive R Archive Network (CRAN).

Results

The mean survival time for 378 patients was 7.35±4.62 yr, the median survival time was 6.81 yr. Out of 378 transplantations, 37 (10%) episodes of rejection occurred, and the remaining 341 patients (90%) were censored. Table 1 shows the VIMP of the variables obtained from RSF.
Table 1:

Mean and standard deviation of variable importance (VIMP) for kidney transplant data over five imputed data set. Each run based on 1000 trees under log-rank splitting

VariablesVIMPSD
Cold ischemic time (min)0.01530.0003*
Recipient’s age (yr)0.01390.0023*
Creatinine level at discharge (mg/dl)0.01220.0018*
Donors’ age (yr)0.01130.0002*
Duration of hospitalization (day)0.00360.0003**
Immunosuppressive drug usage0.00160.0013
Imuran, Prednisolone, Cyclosporinev.s.
CellCept, Prednisolone, Cyclosporine
Hemoglobin level (mg/dl)0.00150.0004
Type of donors0.00080.0001
Living-donor
Deceased-donor
Post-transplantation condition−0.00030.0003
No complication,
Acute rejection,
Hyperacute rejection
Recipient sex−0.00030.0003
Familial relationship−0.00040.0002
Donor sex−0.00050.0000
Urine volume (ml/24 h)−0.00050.0006
Donor blood group−0.00060.0000
Side of the kidney−0.00070.0001
Recipient blood group−0.00210.0001
Duration of dialysis (yr)−0.00470.0014
Mean and standard deviation of variable importance (VIMP) for kidney transplant data over five imputed data set. Each run based on 1000 trees under log-rank splitting The cold ischemic time, recipient’s age, creatinine level at discharge and donors’ age are highly predictive, and duration of hospitalization is moderately predictive. However, type of donors, hemoglobin level, donor’s sex, immunosuppressive drug usage, post-transplantation condition, recipient sex, familial relationship, donor and recipient blood group, side of the kidney, duration of dialysis and urine volume are unlikely to be predictive. According to Cox PH model, three variables of recipient age, type of donor (living vs. deceased), and episode of post-transplantation acute and hyperacute rejection were identified as most important variables. Two criteria of evaluation were also computed for Cox PH model. RSF had lower prediction error based on integrated Brier score (0.081) compared to Cox model (0.088). In addition, the C-index of RSF was considerably higher (0.965) than that of the Cox model (0.766). The effect on survival of the most five influential covariates found in the RSF analysis was displayed with 5-yr partial survival plots in Fig. 1. The estimated partial survival for a covariate indicates estimated survival for different levels of the covariate when the effects of all other covariates are justified. It can be seen from figure that, as cold ischemic time increases up to about 35 minute, the five-year predicted survival increases as well and it tends to decline after 35 minute.
Fig.1:

Partial 5-year predicted survival for five most influential covariates on survival in kidney transplant data. Dashed red lines are ± 2 standard error bars

Partial 5-year predicted survival for five most influential covariates on survival in kidney transplant data. Dashed red lines are ± 2 standard error bars

Discussion

RSF identified cold ischemic time, recipient’s age, creatinine level at discharge, donor’s age and duration of hospitalization as the top five most important predictors of survival for graft failure patients in the present study. Several authors estimated the survival rate of kidney transplantation and detected the risk factors of graft rejection (21–24). Our results showed that the cold ischemic time variable was the most important factor in the risk of graft rejection, which is consistent with the results of some other studies (25, 26). Cold ischemic time is one of the risk factors that is involved in immediate anemia in renal transplant recipients (27). The second top risk factor was recipient’s age. Based on the results, as recipient’s age increases predicted five-year survival time increases as well. This may be a result of stronger and more efficient immune system in younger recipients (3). Previous studies confirm this finding (28–31). The third important variable was creatinine level at discharge. Previous studies have reported creatinine level at discharge as a risk factor in rejection of kidney transplantation (9, 10, 32). Donor age was the fourth top risk factor, which had a negative correlation with graft rejection, i.e. kidney rejection is more likely among those recipients who receive kidney from older donors. This result is also similar to the result of other studies (3, 25, 26, 33, 34). The fifth top most important variable was duration of hospitalization, confirmed eralier (35, 36). This study focused on the performance of RSF method in identifying potential risk factors for survival of kidney graft failure patients compared to traditional Cox model. The results demonstrated that the RSF model performed significantly better than the conventional Cox-proportional hazard model. Several studies also confirmed the promising performance of RSF compared to Cox PH model in real data sets (8, 12, 14). RSF had better performance compared to Cox PH model based on prediction error criterion (13). Therefore, it can be applied successfully for identifying risk factors of the kidney transplantation survival. RSF deals with the traditional Cox model issues such as proportionality assumption coherently and automatically (37) and analysts do not require knowing in advance the relationship (i.e. linear, nonlinear) of a variable over time (8). Besides, the performance of Cox regression is not reliable in the presence of high rate of censoring which was the case in the present study (about 90% censor rate). While, RSF is a robust extension of random forest a highly used machine learning method that has gained much interest in a variety of fields of application and generated a vast amount of computational literature in the last decade (8, 38). However, the performance of different methods is data dependent and conducting additional studies is needed to compare RSF to Cox regression to document further its performance in clinical settings (8). There were some limitations in the present study. Reliable sources of data obtained from prospective design were required for estimation of survival rate and associated prognostic factors, but the present study used a data set of a retrospective cohort study and medical records. Quality and accuracy of estimates depends primarily on the quality of recorded data, but verifying the accuracy of data was not possible in the present study. Besides, quality of the services and technology may vary over time, but we have no document to justify this issue. These issues might bias results. In addition, long-term follow-up duration results in losing some patients, which in turn may lead to biased results (3).

Conclusion

RSF identified a different subset of risk factors in chronic nonreversible renal graft rejection than the Cox PH model. Moreover, RSF model outperformed traditional Cox PH model. The RSF is a promising method for intuitive variable selection and is a way to eliminate the doubt in the “black box” approach to statistical analysis that should be further investigated in survival analysis of other diseases (8).

Ethical considerations

Ethical issues (Including plagiarism, informed consent, misconduct, data fabrication and/or falsification, double publication and/or submission, redundancy, etc.) have been completely observed by the authors.
  32 in total

1.  Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation.

Authors:  Michael J Pencina; Ralph B D'Agostino
Journal:  Stat Med       Date:  2004-07-15       Impact factor: 2.373

2.  Consistent estimation of the expected Brier score in general survival models with right-censored event times.

Authors:  Thomas A Gerds; Martin Schumacher
Journal:  Biom J       Date:  2006-12       Impact factor: 2.207

3.  Identifying important risk factors for survival in patient with systolic heart failure using random survival forests.

Authors:  Eileen Hsich; Eiran Z Gorodeski; Eugene H Blackstone; Hemant Ishwaran; Michael S Lauer
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2010-11-23

4.  Predicting potential survival benefit of renal transplantation in patients with chronic kidney disease.

Authors:  Carl van Walraven; Peter C Austin; Greg Knoll
Journal:  CMAJ       Date:  2010-03-29       Impact factor: 8.262

5.  The PlA2 polymorphism of the platelet glycoprotein IIIA gene as a risk factor for acute renal allograft rejection.

Authors:  E Salido; B Martín; Y Barrios; J D Linares; D Hernández; M Cobos; M D Checa; L Hortal; A Fernández; J J García; A Torres
Journal:  J Am Soc Nephrol       Date:  1999-12       Impact factor: 10.121

6.  The impact of gender and age matching for long-term graft survival in living donor renal transplantation.

Authors:  O J Kwon; J Y Kwak; C M Kang
Journal:  Transplant Proc       Date:  2005-03       Impact factor: 1.066

7.  Novel head and neck cancer survival analysis approach: random survival forests versus Cox proportional hazards regression.

Authors:  Frank R Datema; Ana Moya; Peter Krause; Thomas Bäck; Lars Willmes; Ton Langeveld; Robert J Baatenburg de Jong; Henk M Blom
Journal:  Head Neck       Date:  2011-02-14       Impact factor: 3.147

8.  How can we achieve global equity in provision of renal replacement therapy?

Authors:  Sarah L White; Steven J Chadban; Stephen Jan; Jeremy R Chapman; Alan Cass
Journal:  Bull World Health Organ       Date:  2008-03       Impact factor: 9.408

9.  Racial/ethnic disparity in kidney transplantation outcomes: influence of donor and recipient characteristics.

Authors:  Eyob Feyssa; Charlotte Jones-Burton; Gary Ellison; Benjamin Philosophe; Charles Howell
Journal:  J Natl Med Assoc       Date:  2009-02       Impact factor: 1.798

10.  Iranian experience with the non-related renal transplantation.

Authors:  Behzad Einollahi
Journal:  Saudi J Kidney Dis Transpl       Date:  2004 Oct-Dec
View more
  17 in total

1.  Post Genome-Wide Gene-Environment Interaction Study Using Random Survival Forest: Insulin Resistance, Lifestyle Factors, and Colorectal Cancer Risk.

Authors:  Su Yon Jung; Jeanette C Papp; Eric M Sobel; Zuo-Feng Zhang
Journal:  Cancer Prev Res (Phila)       Date:  2019-09-25

2.  The effects of genetic variants related to insulin metabolism pathways and the interactions with lifestyles on colorectal cancer risk.

Authors:  Su Yon Jung; Zuo-Feng Zhang
Journal:  Menopause       Date:  2019-07       Impact factor: 2.953

3.  Pro-inflammatory cytokine polymorphisms in ONECUT2 and HNF4A and primary colorectal carcinoma: a post genome-wide gene-lifestyle interaction study.

Authors:  Su Yon Jung; Jeanette C Papp; Eric M Sobel; Matteo Pellegrini; Herbert Yu; Zuo-Feng Zhang
Journal:  Am J Cancer Res       Date:  2020-09-01       Impact factor: 6.166

4.  Genetic Variants in Metabolic Signaling Pathways and Their Interaction with Lifestyle Factors on Breast Cancer Risk: A Random Survival Forest Analysis.

Authors:  Su Yon Jung; Jeanette C Papp; Eric M Sobel; Zuo-Feng Zhang
Journal:  Cancer Prev Res (Phila)       Date:  2017-10-26

5.  Risk Prediction of Dyslipidemia for Chinese Han Adults Using Random Forest Survival Model.

Authors:  Xiaoshuai Zhang; Fang Tang; Jiadong Ji; Wenting Han; Peng Lu
Journal:  Clin Epidemiol       Date:  2019-12-10       Impact factor: 4.790

6.  Potential Prognostic Immune Biomarkers of Overall Survival in Ovarian Cancer Through Comprehensive Bioinformatics Analysis: A Novel Artificial Intelligence Survival Prediction System.

Authors:  Tingshan He; Liwen Huang; Jing Li; Peng Wang; Zhiqiao Zhang
Journal:  Front Med (Lausanne)       Date:  2021-05-24

7.  Development and validation of a multivariate predictive model for rheumatoid arthritis mortality using a machine learning approach.

Authors:  José M Lezcano-Valverde; Fernando Salazar; Leticia León; Esther Toledano; Juan A Jover; Benjamín Fernandez-Gutierrez; Eduardo Soudah; Isidoro González-Álvaro; Lydia Abasolo; Luis Rodriguez-Rodriguez
Journal:  Sci Rep       Date:  2017-08-31       Impact factor: 4.379

8.  Prediction of Kidney Graft Rejection Using Artificial Neural Network.

Authors:  Leili Tapak; Omid Hamidi; Payam Amini; Jalal Poorolajal
Journal:  Healthc Inform Res       Date:  2017-10-31

9.  Construction data mining methods in the prediction of death in hemodialysis patients using support vector machine, neural network, logistic regression and decision tree.

Authors:  Salman Khazaei; Somayeh Najafi-GhOBADI; Vajihe Ramezani-Doroh
Journal:  J Prev Med Hyg       Date:  2021-04-29

10.  Application of random survival forests in understanding the determinants of under-five child mortality in Uganda in the presence of covariates that satisfy the proportional and non-proportional hazards assumption.

Authors:  Justine B Nasejje; Henry Mwambi
Journal:  BMC Res Notes       Date:  2017-09-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.