| Literature DB >> 35915457 |
Nuo Lei1, Xianlong Zhang2, Mengting Wei1, Beini Lao1, Xueyi Xu1, Min Zhang1, Huifen Chen1, Yanmin Xu1, Bingqing Xia1, Dingjun Zhang1, Chendi Dong1, Lizhe Fu3, Fang Tang3, Yifan Wu4.
Abstract
BACKGROUND: Kidney disease progression rates vary among patients. Rapid and accurate prediction of kidney disease outcomes is crucial for disease management. In recent years, various prediction models using Machine Learning (ML) algorithms have been established in nephrology. However, their accuracy have been inconsistent. Therefore, we conducted a systematic review and meta-analysis to investigate the diagnostic accuracy of ML algorithms for kidney disease progression.Entities:
Keywords: Artificial intelligence; CKD progression; Chronic kidney disease; Immunoglobulin A nephropathy; Machine learning algorithm; Prediction models
Mesh:
Year: 2022 PMID: 35915457 PMCID: PMC9341041 DOI: 10.1186/s12911-022-01951-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 3.298
Fig. 1PRISMA flow diagram of study selection process
Clinical characteristics of the included studies
| Studies | Country | N | Ages (years) | Men (%) | KD/Outcome | Follow up times(Y) | Reporting dataset | ML algorithm | Optimal model | AUC | Accuracy | C statistic | Analysis | TP | FP | FN | TN |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Goto [ | Japan | 790 | 28.14 | 41.8 | IgAN/doubling of SCR | 10.0 | Training | DTs | DTs | 0.830 | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Diciolla [ | Italy Norway Japan | 1040 | 34.56 | 69.13 | IgAN/ESRD | N/A | Training | ANN NFS SVM DTs | ANN | N/A | 90.1 | N/A | (1) ANN (2) NFS (3) SVM (4) DTs | 177 175 180 178 | 35 63 97 67 | 64 66 61 63 | 764 736 702 732 |
| Pesce [ | Italy | 546 | 32.5 | 68.13 | IgAN/ESRD | 5.0* | Testing | ANN | ANN | 89.9 | 91.8 | N/A | (1) ANN | 14 | 2 | 7 | 87 |
| Norway | 441 | 38.2 | 72.56 | IgAN/ESRD | 6.0* | Testing | ANN | ANN | 93.3 | 92.1 | N/A | (2) ANN | 23 | 3 | 4 | 59 | |
| Japan | 53 | 32.7 | 50.94 | IgAN/ESRD | 3.0* | Testing | ANN | ANN | 100 | 90.9 | N/A | (3) ANN | 1 | 1 | 0 | 9 | |
| Cheng [ | Taiwan, China | 463 | 65 | N/A | CKD/ESRD | 1.5 | Training | C4.5 CART SVM Adaboost | CART + Adaboost | 0.715 | 0.662 | N/A | (1) C4.5 (2) CART (3) SVM | 73 76 80 | 57 48 45 | 59 56 52 | 75 84 87 |
| Feng [ | China | 119 | N/A | N/A | CKD/CKD STAGE | N/A | Training | DTs | DTs | N/A | 0.9827 | N/A | N/A | N/A | N/A | N/A | N/A |
| Liu [ | China | 262 | 32.71 | 47.3 | IgAN/ESRD | 3.0 | Training | RF | RF | 0.9729 | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Helena [ | U.S | 4640 | 60.19 | N/A | CKD/ESRD | 3.7 | Testing | Lasso | Lasso | N/A | N/A | 0.863 | N/A | N/A | N/A | N/A | N/A |
| Xiao [ | China | 551 | 58.15 | N/A | CKD/PRO | 3.0 | Testing | LR ElasticN Lasso Ridge SVM RF KNN ANN XGBoost | LR | 0.873 | 0.82 | N/A | (1) LR (2) ElasticN (3) Lasso (4) Ridge (5) SVM (6) RF (7) KNN (8) ANN’ (9)XGBoost | 54 52 52 51 53 52 49 52 55 | 8 7 7 8 8 8 12 8 8 | 12 14 14 14 13 14 17 14 11 | 37 38 38 37 37 37 33 37 37 |
| Chen [ | China | 2047 | 34.80 | 49.25 | IgAN/ ESRD | 5.01 | Training + Testing | LR SVM DTs ANN RF XGBoost | XGBoost | N/A | N/A | 0.89 | N/A | N/A | N/A | N/A | N/A |
| Masaki [ | Japan | 30,810 | N/A | N/A | DKD/stage of DKD | 0..5 | Training | LR CNN | LR | 0.722 | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Dovgan [ | Taiwan, China | 8492 | N/A | N/A | CKD/ RRT | 1.0 | Training | LR XGBoost SGD SVM BDTs ANN RF Bayes DTs NN | LR | 0.778 | N/A | N/A | (1) LR (2)XGBoost (3) SGD | 651 60 152 | 1628 52 164 | 394 985 893 | 5819 7395 7283 |
| Nagaraj [ | Netherlands | 11,789 | 62.75 | N/A | DKD/ESRD | 2.7 | Testing | LR SVM RF FNN | FNN | 0.84 | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Schena [ | Italy | 948 | 40.6 | 72.2 | IgAN/ESRD | 7.4 | Training | ANN | ANN | 0.89 | 0.83 | N/A | (1) ANN | 36 | 8 | 6 | 117 |
| Yuan [ | China | 1090 | 50.01 | 56.3 | CKD/ESRD | 4.0 | Training | LR RF SVM NNET | RF | 0.878 | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| Zhou ( | China | 2507 | 70.6 | N/A | CKD/ESRD | 3.0 | Training | LR | LR | N/A | N/A | 0.69 | N/A | N/A | N/A | N/A | N/A |
CKD chronic kidney disease, DKD diabetes kidney disease, IgAN Immunoglobulin A Nephropathy, ESRD end stage renal disease, RRT renal replace therapy, PRO proteinuria, TP true positive, FP false positive, FN false negative, TN true negative, AUC area under the curve, DTs Decision Trees, ANN artificial neural network, NFS neural fuzzy systems, SVM support vector machine, RF random forest, LR logistic regression, ElasticN Elastic Net, KNN K Nearest Neighbors, NN Nearest Neighbors, SGD Stochastic Gradient Descent, BDTs Bagging Decision Trees, CNN convolutional neural network, CART classification and regression, Lasso Lasso regression, Ridge Ridge regression, NNET neural network, FNN Feed-forward neural network
*Median times
Fig. 2Risk of bias and application concerns graph for the included studies.Red, yellow and green frames correspond to high, unclear and low risk of bias, respectively
Fig. 3Risk of bias and application concerns summary for the included studies. (+) indicates low risk of bias, (?) indicates unclear risk of bias, (−) indicates high risk of bias
Fig. 4Coupled forest plots for sensitivity and specificity. A All single-unit ML algorithms. B CKD subgroup. C IgAN subgroup. D Sensitivity analysis after eliminating outliers and data with small sample sizes. The gray square with a black point in the center showed study specific estimates of sensitivity and specificity. The width of solid black line showed their 95% confidence intervals. The diamond at the bottom of the figure was a combination of single-unit ML algorithm.The center of diamond represented the point estimates, and the width of diamond represented 95% confidence intervals
Fig. 5HSROC curve with 95% confidence region and prediction region. A All single-unit ML algorithms with AUC of 0.87. B CKD subgroup with AUC of 0.82. C IgAN subgroup with AUC of 0.78. D Sensitivity analysis after eliminating outliers and data with small sample sizes with AUC of 0.83. Each circle represents a single-unit ML algorithm. The curve represents the summary receiver operating characteristic curve for all single-unit ML algorithm. The red square represents the summary estimate of test performance. The zone outlines represent the 95% confidence and 95% prediction regions of the summary estimate, respectively
Fig. 6Univariate meta-regression plot of all single-unit ML algorithms. The red point represents the result of the individual combination of the subgroup into which each independent variable is divided. The width of solid black line showed their 95% confidence intervals. “*” means that the effects of independent variables on the pool sensitivity and specificity were statistically significant
Summary of meta-analysis and subgroup analysis
| Subgroup | Number of ML algorithms | Sensitivity (95% CI) | Specificity (95% CI) | AUC (95% CI) | Correlation coefficient | β | DOR |
|---|---|---|---|---|---|---|---|
| Total DTA | 23 | 0.68 (0.58–0.77) | 0.88 (0.83–0.92) | 0.87 (0.84–0.90) | − 0.53 | 0.015 | 16.34 |
| CKD | 15 | 0.64 (0.49–0.77) | 0.84 (0.74–0.91) | 0.82 (0.79–0.85) | − 0.77 | − 0.036 | 9.31 |
| IgAN | 8 | 0.74 (0.71–0.77) | 0.93 (0.91–0.95) | 0.78 (0.74–0.81) | − 1.0 | 3.781 | 39.27 |
| Classification | 16 | 0.64 (0.50–0.76) | 0.87 (0.79–0.92) | 0.84 (0.81–0.87) | − 0.66 | 0.021 | 11.75 |
| Regression | 7 | 0.80 (0.74–0.84) | 0.91 (0.86–0.95) | N/A | 1.0 | 6.044 | 41.09 |
| Training set | 11 | 0.56 (0.37–0.73) | 0.90 (0.80–0.95) | 0.83 (0.80–0.86) | − 0.57 | 0.074 | 11.40 |
| Testing set | 12 | 0.79 (0.76–0.82) | 0.86 (0.81–0.90) | 0.81 (0.77–0.84) | − 1.0 | 3.693 | 23.33 |
| Y | 11 | 0.71 (0.66–0.76) | 0.89 (0.80–0.94) | N/A | 1 | 1.086a | 19.46 |
| N | 12 | 0.65 (0.46–0.81) | 0.87 (0.78–0.93) | 0.86 (0.83–0.89) | − 0.53 | − 0.172 | 12.92 |
| Asian | 16 | 0.64 (0.49–0.77) | 0.84 (0.75–0.91) | 0.82 (0.79–0.86) | − 0.76 | − 0.042 | 9.53 |
| Not Asian | 7 | 0.74 (0.71–0.77) | 0.93 (0.91–0.95) | 0.78 (0.74–0.81) | − 1 | 3.806 | 10.95 |
DTA diagnostic test accuracy, KD kidney disease, CKD chronic kidney disease, IgAN Immunoglobulin A Nephropathy, ML machine learning, Y Yes, N No
aP < 0.01
Fig. 7Deek’s funnel plot of all single-unit ML algorithms. Each circle represents a single-unit ML algorithm