| Literature DB >> 35268350 |
Moongi Simon Hong1,2, Yu-Ho Lee3, Jin-Min Kong4, Oh-Jung Kwon5, Cheol-Woong Jung6, Jaeseok Yang7, Myoung-Soo Kim8, Hyun-Wook Han1,2, Sang-Min Nam1,2,9.
Abstract
We developed a machine-learning-based model that could predict a decrease in one-year graft function after kidney transplantation, and investigated the risk factors of the decreased function. A total of 4317 cases were included from the Korean Organ Transplant Registry (2014-2019). An XGBoost model was trained to predict the recipient's one-year estimated glomerular filtration rate (eGFR) below 45 mL/min/1.73 m2 using 112 pre- and peri-transplantation variables. The network of model factors was drawn using inter-factor partial correlations and the statistical significance of each factor. The model with seven features achieved an area under the curve of 0.82, sensitivity of 0.73, and specificity of 0.79. The model prediction was associated with five-year graft and rejection-free survival. Post-transplantation hospitalization >25 days and eGFR ≥ 88.0 were the prominent risk and preventive factors, respectively. Donor age and post-transplantation eGFR < 59.8 were connected to multiple risk factors on the network. Therefore, careful donor-recipient matching in older donors, and avoiding pre-transplantation risk factors, would reduce the risk of graft dysfunction. The model might improve long-term graft outcomes by supporting early detection of graft dysfunction, and proactive risk factor control.Entities:
Keywords: graft survival; kidney transplantation; machine learning; risk factors
Year: 2022 PMID: 35268350 PMCID: PMC8911006 DOI: 10.3390/jcm11051259
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Figure 1Flow diagram showing data processing and study methods. Abbreviations: KOTRY, Korean Organ Transplant Registry; eGFR, estimated glomerular filtration rate (mL/min/1.73 m2); XGBoost, extreme gradient boosting; TPOT, tree-based pipeline optimization tool; AUC, area under the curve; SHAP, shapley additive explanation.
Study variables.
| Recipient Variables |
|---|
|
|
|
|
| age difference, sex match, height difference, weight difference, BSA ratio, viral serostatus g |
a Aspirin, statin, or vitamin D analog. b Corrected calcium = total calcium + 0.8 × (4 − albumin), where calcium in mg/dL, and albumin in g/dL. c Discretized to low, mid, high categories. d Basiliximab, anti-thymocyte globulin, or both. e Steroid, tacrolimus, cyclosporine, mycophenolate mofetil. f Deceased or living. g For hepatitis B virus, positive if donor hepatitis B surface antigen (HBsAg) is positive and recipient hepatitis B surface antibody (HbsAb) is negative. For hepatitis C virus, Epstein–Barr virus, and cytomegalovirus, positive if donor IgG serostatus is positive and recipient IgG serostatus is negative. Abbreviations: BMI, body mass index; WBC, white blood cell; HLA, human leukocyte antigen; BUN, blood urea nitrogen; eGFR, estimated glomerular filtration rate; hsCRP, high-sensitivity c-reactive protein; PTH, parathyroid hormone; LDL, low-density lipoprotein; HDL, high-density lipoprotein; BSA, body surface area.
Figure 2Performance of the XGBoost model to predict a decline in eGFR at one year after kidney transplantation. (A) Receiver operating characteristic curve; (B) summary plot of Shapley additive explanations (SHAP). Model features are sorted along the y axis of the summary plot by the sum of SHAP value (probability) magnitudes over all cases in the test data, and the distribution of each feature’s impacts is plotted using SHAP values of individual cases. Binary features have 1 if present, or 0 if absent. Abbreviations: AUC, area under the curve; eGFR, estimated glomerular filtration rate (mL/min/1.73 m2); sCr, serum creatinine (mg/dL); D, donor; R, recipient.
Figure 3Kidney transplant outcomes according to predicted one-year eGFR levels of the recipient. Predicted eGFR decline was associated with a significant decrease in graft survival (A) and increase in cumulative incidence of biopsy-proven rejection (B) (adjusted hazard ratios of 1.9 and 1.6, p = 0.02 and p = 0.007, respectively, Cox proportional hazard regression with age and sex). Abbreviations: eGFR, estimated glomerular filtration rate (mL/min/1.73 m2).
Factor analysis for one-year renal allograft dysfunction (multiple logistic regression).
| eGFR ≥ 45 | eGFR < 45 | OR | ||
|---|---|---|---|---|
|
| n (%) | n (%) | ||
| eGFR (mL/min/1.73 m2) a | ||||
| <59.8 | 871 (23%) | 315 (67%) | 2.1 (1.5–2.9) | <0.001 |
| 59.8–88.0 | 1725 (45%) | 133 (28%) | NA | NA |
| ≥88.0 | 1248 (32%) | 25 (5.3%) | 0.4 (0.2–0.6) | <0.001 |
| Recipient age (year) b | ||||
| <29 | 214 (5.6%) | 29 (6.1%) | 2.3 (1.5–3.7) | <0.001 |
| 29–57 | 2755 (72%) | 263 (56%) | NA | NA |
| >57 | 875 (23%) | 181 (38%) | 1.5 (1.2–1.9) | <0.001 |
| Post-transplantation stay (day) a | ||||
| ≤25 | 3501 (91%) | 354 (75%) | NA | NA |
| >25 | 343 (8.9%) | 119 (25%) | 2.4 (1.8–3.2) | <0.001 |
| Serum creatinine (mg/dL) a | ||||
| ≤1.24 | 2787 (73%) | 151 (32%) | NA | NA |
| >1.24 | 1057 (27%) | 322 (68%) | 2.0 (1.4–2.8) | <0.001 |
| Female recipient c | 1602 (42%) | 162 (34%) | 1.6 (1.2–2.2) | 0.002 |
| Deceased donor d | 1331 (35%) | 253 (53%) | 1.2 (1.0–1.6) | 0.08 |
| Male donor c | 2064 (54%) | 238 (50%) | 1.1 (0.8–1.5) | 0.49 |
| Delayed graft function d | 116 (3.0%) | 43 (9.1%) | 1.1 (0.7–1.7) | 0.68 |
|
| mean (SE) | mean (SE) | ||
| Donor Age (year) | 46 (0.2) | 56 (0.5) | 1.9 (1.7–2.2) e | <0.001 |
| Height (R)−Height (D) (cm) | −1 (0.2) | 3 (0.6) | 1.3 (1.1–1.5) e | 0.007 |
| HLA mismatch numbers d (range 0–6) | 3.2 (0.03) | 3.6 (0.07) | 1.1 (1.0–1.3) e | 0.04 |
a Measured at discharge in recipients. b Not significant (p = 0.39) when a continuous factor. c Not model factors, but included to control sex effect. d Not model factors, but reported risk factors [10]. e Standardized odds ratio. Abbreviations: eGFR, estimated glomerular filtration rate; OR, odds ratio; CI, confidence intervals; SE, standard error; R, recipient; D, donor; HLA, human leukocyte antigen; NA, not available for a reference category.
Figure 4Partial correlation network of factors associated with one-year renal allograft dysfunction using graphical lasso regularization. Node size is proportional to the effective size of the odds ratio. The nodes are positively (risk) or negatively (protective) related to graft function decline. Green and red edges represent positive and negative correlations between the nodes, respectively. The edge with the highest absolute weight has full-color saturation and the widest width. Abbreviations: R, recipient; D, donor; eGFR, estimated glomerular filtration rate (mL/min/1.73 m2); sCr, serum creatinine (mg/dL); HLA, human leukocyte antigen.