| Literature DB >> 35808481 |
Roberto Magherini1, Elisa Mussi1, Yary Volpe1, Rocco Furferi1, Francesco Buonamici1, Michaela Servi1.
Abstract
Within the literature concerning modern machine learning techniques applied to the medical field, there is a growing interest in the application of these technologies to the nephrological area, especially regarding the study of renal pathologies, because they are very common and widespread in our society, afflicting a high percentage of the population and leading to various complications, up to death in some cases. For these reasons, the authors have considered it appropriate to collect, using one of the major bibliographic databases available, and analyze the studies carried out until February 2022 on the use of machine learning techniques in the nephrological field, grouping them according to the addressed pathologies: renal masses, acute kidney injury, chronic kidney disease, kidney stone, glomerular disease, kidney transplant, and others less widespread. Of a total of 224 studies, 59 were analyzed according to inclusion and exclusion criteria in this review, considering the method used and the type of data available. Based on the study conducted, it is possible to see a growing trend and interest in the use of machine learning applications in nephrology, becoming an additional tool for physicians, which can enable them to make more accurate and faster diagnoses, although there remains a major limitation given the difficulty in creating public databases that can be used by the scientific community to corroborate and eventually make a positive contribution in this area.Entities:
Keywords: artificial intelligence; deep learning; machine learning; renal pathology
Mesh:
Year: 2022 PMID: 35808481 PMCID: PMC9269842 DOI: 10.3390/s22134989
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Trend of documents per year.
Figure 2Scheme of pathologies with ML techniques applied according to the type of dataset available. Kidney disease addressed in red; type of available data in green; ML technique used in blue.
Renal mass research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Malignant renal cyst prediction | Bayesian classifier | [ | AUC 0.96 | 2009 |
| [ | Identify malignant renal masses | Statistical relational learning—RFGB: relational functional gradient boosting | [ | Accuracy 82% | 2018 |
| [ | Differentiate between malignant and benign masses | CT texture analysis with random forest | [ | Accuracy 90.5% | 2020 |
| [ | Diagnose ccRCC | WEKA with and without SMOTE | [ | AUC contour-focused 0.865–0.984 | 2019 |
| [ | Diagnose ccRCC | Pyradiomics and random forest | [ | Accuracy 84.6% | 2020 |
| [ | Diagnose ccRCC | Radiomics and CatBoost | [ | MR accuracy 73% internal 74% external | 2020 |
| [ | Diagnose ccRCC | MaZda and WEKA toolkit | [ | Accuracy 85.1% | 2018 |
| [ | Diagnose ccRCC | Proteomics-based | [ | Proteomics accuracy 98% image accuracy 83% validation, 95% testing set | 2019 |
| [ | Differentiate between kidney chromophobe renal cell carcinoma and oncocytoma | Linear SVM | [ | Accuracy 80% | 2016 |
| [ | Classify papillary renal cell carcinoma stages | Feature extraction and random forest | [ | Accuracy 88.5% | 2018 |
| [ | Kidney and tumor segmentation | 3D U-Net | [ | Mean Kidney Tumor Dice 0.9168 | 2019 |
| [ | Kidney and tumor segmentation | Cascade 3D U-Net | [ | Mean Kidney Tumor Dice 0.9064 | 2019 |
| [ | Kidney and tumor segmentation | Multi-resolution 3D V-Net | [ | Mean Kidney Tumor Dice 0.8815 | 2019 |
AKI research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Predict AKI in | Boruta [ | [ | AUC 0.796 | 2018 |
| [ | Predict AKI in | Gradient boosted machine | [ | AUC 0.85 | 2021 |
| [ | Predict AKI | Gradient boosted machine | [ | AUC 0.76 | 2021 |
| [ | Predict AKI in burn patients | K-NN | [ | Accuracy 97% | 2019 |
| [ | Predict AKI | Lasso + logistic regression | [ | AUC 0.79 | 2019 |
| [ | Predict AKI | RF + XGboost | [ | AUC 0.843 | 2020 |
| [ | Predict AKI | Streams | [ | Accuracy 56% in 48 h | 2019 |
| [ | Prediction of AKI | Feature selection + random forest | [ | AUC 0.881 in 30 d | 2021 |
| [ | Predict AKI | Gradient boosting tree-based machines | [ | AUC 76% in 48 h | 2020 |
CKD research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Diagnose CKD | Support vector machine—SVM | [ | Accuracy 82% on 2 stages | 2014 |
| [ | CKD diagnosis | Random forest | [ | Accuracy 99.3% | 2016 |
| [ | CKD diagnosis | Decision tree C4.5 | [ | Accuracy 63% | 2016 |
| [ | CKD diagnosis | SVM | [ | Accuracy 98.3% | 2016 |
| [ | CKD diagnosis | k-NN with CFS and AdaBoost | [ | Accuracy 98.1% | 2017 |
| [ | CKD diagnosis | Random forest | [ | Accuracy 100% | 2017 |
| [ | CKD diagnosis | RPART | [ | AUC 0.995 | 2018 |
| [ | CKD diagnosis | PSODP + DL-RNN | [ | Accuracy 99.5% | 2018 |
| [ | CKD diagnosis | PNN [ | [ | Accuracy 96.7% | 2019 |
| [ | CKD diagnosis | RFE and Random Forest | [ | F1 score 100% | 2021 |
| [ | Predict diet plan for | Multiclass | [ | Accuracy 99.17% | 2017 |
| [ | Predict hemoglobin levels | Extraction rule—Re-RX + J48graft | [ | Accuracy 95.18% | 2019 |
Kidney stone research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Renal stone detection | Segmentation + ANN | [ | Accuracy 86% | 2019 |
| [ | Renal stones vs. phleboliths | Radiomics + | [ | Accuracy 85.1% | 2019 |
| [ | Kidney stone removal, | ANN | [ | Accuracy 81–98.2% | 2017 |
| [ | Predict stone-free status | Feature extraction + | [ | Accuracy 60% | 2019 |
| [ | Stone-free prediction | Light gradient boosting method | [ | Accuracy 87.9% | 2020 |
Glomerular disease research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Predict diabetic kidney disease | SVM radial | [ | Accuracy 94% | 2013 |
| [ | Predict diabetic kidney disease | Unbalanced random forest | [ | Accuracy 83.8% | 2018 |
| [ | Predict diabetic kidney disease | Knime + WEKA | [ | Accuracy 83.5% | 2019 |
| [ | Resolution image-based | Convolutional neural network | [ | Accuracy > 80% | 2021 |
| [ | Predict ESKD in patients with IgAN | ANN | [ | AUC 0.82 with | 2021 |
| [ | Predict deterioration of | SVM | [ | Accuracy 79.8% | 2021 |
| [ | Diagnose glomerular disease | Disjunctive least | [ | Accuracy 81.26–96.5% | 1992 |
| [ | Detect pathogenic and | RatSnake—ML automatic | [ | Accuracy 94.7% | 2014 |
| [ | Diagnose glomerular disease | Decision tree with | [ | Accuracy 89.47% | 2021 |
| [ | Predict weight of | ANN | [ | Mean difference 0.497 | 2018 |
Kidney transplant research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Predict | Decision tree | [ | Specificity 73.8% | 2010 |
| [ | Predict post- | Bayesian belief network | [ | Accuracy 52% after 1 year | 2012 |
| [ | Classify risk levels for | ElasticNet + Bayesian belief network | [ | Accuracy 68.4% | 2018 |
| [ | Predict early transplant rejection | Decision tree and random forest | [ | Accuracy 85% | 2019 |
| [ | Predict kidney transplantation | Elderly KTbot | [ | Precision 90% | 2020 |
Other renal diseases research.
| Paper | Objective | Method | Database | Results | Year |
|---|---|---|---|---|---|
| [ | Predict hemoglobin in patients | Data merging + clustering + ensemble of classifiers | [ | Mean absolute error 0.662—Italy, | 2014 |
| [ | Recommend renal biopsy | Tokenization + NLP machine learning classifier | [ | Accuracy 83.5% | 2019 |
| [ | Prediction of 1-year survival | Ensemble artificial intelligence model | [ | Accuracy 94.8% | 2020 |
| [ | Detect kidney and liver tissue | Homodyne-K feature extraction + random forest | [ | Accuracy 94% | 2015 |
| [ | Predict risk stratification | Multitask temporal-based classifier | [ | Specificity 0.828 with 10% threshold | 2015 |
Diagnostic image databases.
| Database | Number of | Description | Year (First Use/Published) | Open Access |
|---|---|---|---|---|
| [ | 93 | Patients’ MDCT. Patients with complicated cysts: cyst with at least one focus of septa, a solid nodule, and any calcification or wall thickening on MDCT | 2009 | No |
| [ | 150 | Patients’ CT. | 2018 | No |
| [ | 79 | 84 renal masses: 63 malignant (25 clear cell RCC, 23 papillary cell RCC, 15 chromophobe RCC), 21 benign (10 oncocytomas, 11 fat-poor angiomyolipomas) | 2020 | No |
| [ | 440 | 440 MRI and CT of patients with ccRCC | 2020 | No |
| [ | 54 | Patients’ CT. All patients have ccRCC. | 2019 | No |
| [ | 216 | 216 proteomics data and 783 slide images (524 tumoral) | 2018 | Yes |
| [ | 300 | CT of patients with one or more kidney tumors. Segmentation of kidneys and tumors. | 2019 | Yes |
| [ | 188 | The database is composed of 40, 16, 38, 60, 28, and 6 entries for healthy, stage 1, 2, 3, 4, 5, respectively. These images are obtained from 35 observers taken at different times. The kidney ultrasonic images are segmented and annotated into three regions of interest (ROIs) | 2014 | No |
| [ | 200 | 200 kidney stones harvested from nondestructive stone extraction at three different sites. Stone size was measured using a digital caliper | 2020 | No |
| [ | 412 | LDCT of 235 kidney stones and 224 phleboliths | 2019 | No |
| [ | 254 | Preoperative abdominopelvic ultrasound and intravenous urography or CT scan of PCNL patients. | 2017 | No |
| [ | 9 | This dataset contains the 3D US abdominal images from 9 pediatric patients with hydronephrosis | 2015 | No |
| [ | 1321 | Biopsy images of pathogenic (338) and nonpathogenic (396) glomerulus and some of pathogenic (338) and nonpathogenic (248) tubulus | 2014 | No |
| [ | 584 | Renal biopsy reports, each of 4 or 5 slides with different stains, for each case: clinical and laboratory data, diagnostic hypothesis, histological biopsy study, histological report of glomerular disease | 2021 | No |
| [ | 422 | Renal immunofluorescent images obtained by fluorescence microscopes relative to a renal biopsy of 162 patients with IgAN and 260 without | 2021 | No |
| [ | 24 | 24 unstained deparaffinized formalin-fixed kidney tissue sections of chRCC and oncocytoma, 12 of each type | 2016 | No |
Numerical databases.
| Database | Number of | Description | Year (First Use/Published) | Open Access |
|---|---|---|---|---|
| [ | 260 | Tumor RNASeq and pathological stage (I, II, III, and IV): Stage I—172, Stage II—22, Stage III—51, and Stage IV—15. | 2010 | Yes |
| [ | 34 | This dataset was obtained using Affymetrix HGU133 Plus 2.0 array platform and includes 19 and 15 samples in early (excellent survival) and late (poor survival) stages of PRCC. | 2005 | Yes |
| [ | 269,999 | 6.1% of patients in the dataset had a clinical deterioration event: 424 cardiac arrests, 13,188 intensive care unit (ICU) transfers, and 2840 deaths on the wards. For each patient, there are a total of 29 features. | 2014 | No |
| [ | 108,441 | Australian and New Zealand Society of Cardiac and Thoracic Surgeons Database registry recorded 110,342 cardiac surgery events in 108,441 unique patients. | 2018 | No |
| [ | 780 | Medical data collected by natural language process module from EMRs including demographic data, daily documentation, laboratory and imaging results, anesthesia records, medications, interventions, and diagnosis. TRIPOD guidelines were followed. | 2021 | No |
| [ | 50 | Serial creatinine testing of patients with ≥20% total body surface area (TBSA) burns at risk for AKI. AKI was defined using the Kidney Disease: Improving Global Outcomes (KDIGO) criteria. | 2019 | No |
| [ | 153,821 | 153,821 patients from 6 different sites. Each patient had a mean of 67 (SD = 46) clinical facts per day. | 2020 | No (only in USA) |
| [ | 671 | Information related to demographic characteristics, clinical condition, preoperative biochemistry data, preoperative medication, and intraoperative time-series hemodynamic features (systolic blood pressure (SBP), diastolic blood pressure (DBP), mean arterial blood pressure (MAP), and heart rate (HR)) from electronic medical records and records on intraoperative variables. | 2020 | No |
| [ | 51,869 | 618,719 blood test occurrences for 51,869 distinct patients. | 2021 | No |
| [ | 400 | The CKD dataset was collected from 400 patients from the University of California, Irvine Machine Learning Repository. | 2015 | Yes |
| [ | 254 | This dataset includes information on preoperative, intraoperative, and postoperative parameters from 254 patients who underwent kidney surgery. | 2019 | No |
| [ | 1015 | The variables contained per IgAN patient are age, sex, hypertension, serum creatinine, daily proteinuria, kidney biopsy, therapy—RASBs or corticosteroids. The primary outcome is ESRD, dialysis, or transplantation. | 2020 | No |
| [ | 80 | Features of 80 IgAN patients: secondary IgA deposition, eGFR, MEST-C scores. | 2021 | No |
| [ | 284 | 38 features for each patient and biopsy diagnosis. | 1992 | No |
| [ | 194 | Features for each patient: age, sex, time in dialysis, donor type, donor age, HLA mismatches, delayed graft function, acute rejection episode, and chronic allograft nephropathy. | 2010 | No |
| [ | 7348 | A total of 793 pre- and post-transplant variables per patient. | 2004 | No (only in USA) |
| [ | 6564 | First 12 h of 6564 HER from critically ill children admitted to a pediatric ICU without evidence of AKI; 4% of the patients developed AKI by 72 h. | 2016 | No |
| [ | 2642 | The dataset contains the data relative to 1781 patients pre-implementation and 861 patients post-implementation of a digital intervention system, with the relative alert severity. | 2019 | No |
| [ | 358 | This dataset includes 42 features including the two target variables, stone-free and one-session success, for all 358 cases. The number of cases with stone-free and one-session success was 253 (70.7%) and 154 (43.0%). | 2020 | No |
| [ | 1250 | Several serum markers per patient undergoing angiography as clinical standard care. | 2015 | No (only in USA) |
| [ | 80 | 80 patients who received HLA-incompatible renal allografts; | 2019 | No |
| [ | 118 | Medical records of 18 elderly and 100 younger patients. | 2020 | No |
| [ | 1386 | Anthropometric measurements and blood pressure (BP), drug use and past medical history, physical assessment for retinopathy, sensory neuropathy, and peripheral arterial disease. eGFR calculated using the Chinese-modified Modification of Diet in Renal Disease equation. | 2013 | No |
| [ | 1000 | 1000 T2DM patients’ data collected by the IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico) of the Hospital of Pavia. | 2018 | No |
| [ | ~32,000 | Diabetes of type 2 patients with a 24-month analysis window. | 2019 | No |
| [ | 3149 | This dataset contains a total of 3149 admission notes from the nephrology department. For the ground truth, there are recommendations given by physicians in first-day progress notes. | 2019 | No |
| [ | 6435 | Electronic health records of patients with hypertension, diabetes, or both. | 2015 | No |
| [ | ~31,000 | United Network for Organ Sharing, a private, non-profit (UNOS) dataset including information on all kidney waiting-list registrations and transplants that had been recorded in the U.S. | 2014 | No (only in USA) |
| [ | 13,011 | 125 features from dialysis clinical practice of 13,011 patients. | 2014 | No |
| [ | 14 | ESRD patients on chronic hemodialysis or hemodiafiltration weighing 20 kg or more. | 2018 | No |
| [ | 79,860 | Various features for each patient are presented with the relative risk score based on mass, serum albumin level, cholesterol level, and creatinine. | 2020 | No |
Searches grouped by type of ML algorithm applied.
| Method—ML Algorithm (Based) | Authors | Year |
|---|---|---|
| Bayesian classifier | [ | 2009 |
| [ | 2012 | |
| [ | 2018 | |
| Logistic regression | [ | 2019 |
| [ | 2020 | |
| Decision tree | [ | 2016 |
| [ | 2021 | |
| [ | 2010 | |
| Random forest | [ | 2015 |
| [ | 2016 | |
| [ | 2017 | |
| [ | 2018 | |
| [ | 2018 | |
| [ | 2019 | |
| [ | 2019 | |
| [ | 2020 | |
| [ | 2020 | |
| [ | 2021 | |
| [ | 2021 | |
| [ | 2018 | |
| SVM | [ | 2013 |
| [ | 2014 | |
| [ | 2016 | |
| [ | 2016 | |
| [ | 2021 | |
| ANN | [ | 2019 |
| [ | 2017 | |
| [ | 2018 | |
| [ | 2021 | |
| Ensemble of classifiers | [ | 2019 |
| [ | 2020 | |
| [ | 2014 | |
| [ | 2017 | |
| [ | 2019 | |
| [ | 2019 | |
| [ | 2019 | |
| [ | 2018 | |
| [ | 2019 | |
| [ | 2019 | |
| [ | 2018 | |
| [ | 2021 | |
| [ | 2021 | |
| [ | 2020 | |
| [ | 2020 | |
| [ | 2020 | |
| [ | 2020 | |
| DNN | [ | 2014 |
| [ | 2015 | |
| [ | 2019 | |
| [ | 2020 | |
| [ | 1992 | |
| [ | 2018 | |
| [ | 2018 | |
| [ | 2019 | |
| [ | 2019 | |
| [ | 2021 | |
| [ | 2019 | |
| [ | 2019 | |
| [ | 2019 |