| Literature DB >> 32733929 |
Timur Köse1, Su Özgür1, Erdal Coşgun2, Ahmet Keskinoğlu3, Pembe Keskinoğlu4.
Abstract
Missing observations are always a challenging problem that we have to deal with in diseases that require follow-up. In hospital records for vesicoureteral reflux (VUR) and recurrent urinary tract infection (rUTI), the number of complete cases is very low on demographic and clinical characteristics, laboratory findings, and imaging data. On the other hand, deep learning (DL) approaches can be used for highly missing observation scenarios with its own missing ratio algorithm. In this study, the effects of multiple imputation techniques MICE and FAMD on the performance of DL in the differential diagnosis were compared. The data of a retrospective cross-sectional study including 611 pediatric patients were evaluated (425 with VUR, 186 with rUTI, 26.65% missing ratio) in this research. CNTK and R 3.6.3 have been used for evaluating different models for 34 features (physical, laboratory, and imaging findings). In the differential diagnosis of VUR and rUTI, the best performance was obtained by deep learning with MICE algorithm with its values, respectively, 64.05% accuracy, 64.59% sensitivity, and 62.62% specificity. FAMD algorithm performed with accuracy = 61.52, sensitivity = 60.20, and specificity was found out to be 61.00 with 3 principal components on missing imputation phase. DL-based approaches can evaluate datasets without doing preomit/impute missing values from datasets. Once DL method is used together with appropriate missing imputation techniques, it shows higher predictive performance.Entities:
Mesh:
Year: 2020 PMID: 32733929 PMCID: PMC7378600 DOI: 10.1155/2020/1895076
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The variables used in deep learning and multiple imputation techniques.
| Clinical variables | Laboratory variables | USG variables |
|---|---|---|
| Diagnosis(VUR/rUTI) | ud-density(c) | USG-R-grade (ordinal:0,1,2) |
| Sex(cat: male/female) | b-leukocyte(c) | USG-L-grade(ordinal: 0,1,2) |
| Age(c) | ud-nitrite(cat:Y/N) | USG-R/L hydronephrosis (cat: Y/N) |
| Fever(cat: Y/N) | ud-l.esterase(cat: Y/N) | USG-bladder wall thickening(cat: Y/N) |
| Emesis(catty/N) | ud-protein(cat: Y/N) | USG-bladder diverticulum(cat: Y/N) |
| Incontinence(cat: Y/N) | us-erythrocyte(cat: Y/N) | USG-ureter dilatation R/L (cat: Y/N) |
| Stomachache(cat: Y/N) | us-leukocyte(cat: Y/N) | |
| Urgency(cat: Y/N) | ud-leukocyte(cat: Y/N) | |
| Frequent urination(cat: Y/N) | us-bacteria (cat: Y/N) | |
| Dysuria(cat: Y/N) | ud-erythrocyte(cat) | |
| Restlessness(cat: Y/N) | b-thrombocyte(c) | |
| Anorexia(cat: Y/N) | b-urea(c) | |
| UTI in history(cat: Y/N) | b-creatinine(c) | |
| Prolonged neonatal jaundice(cat: Y/N) | ||
| Scar(cat: Y/N) |
All categorical variables are defined as binary (cat: Y/N, yes/no, and sex, cat: male/female). c: continuous variable; cat: categorical variable; rUTI: recurrent urinary tract infection; ud: urine dipstick; us: urine sediment; USG: ultrasonography; b: blood; R: right; L: left; u-le: urine-leukocyte esterase.
Figure 1Multiple imputation via MICE package.
Figure 2FAMD factor map. Analyzing mixed data [36].
Figure 3Deep learning. Deep autoencoder [40].
Figure 4Hyperparameters of deep learning algorithm [41].
Confusion matrix.
| Diagnostic test | Gold standard | |||
|---|---|---|---|---|
| Positive | Negative | Row total | ||
| Positive | TP | FP | TP+FP (total number of subjects with positive test) | Positive predictive value TP/(TP+FP) |
| Negative | FN | TN | FN+TN (total number of subjects with negative test) | Negative predictive value TN/(FN+TN) |
| Column total | TP+FN (total number of subjects with given condition) | FP+TN (total number of subjects without given condition) | N=TP+TN+FP+FN (total number of subjects in the study) | |
| Sensitivity TP/(TP+FN) | Specificity TN/(FP+TN) | |||
TP: true positive; TN: true negative; FN: false negative; FP: false positive.
10-fold cross-validation results on CNTK.
| Accuracy | SD | Sensitivity | SD | Specificity | SD | |
|---|---|---|---|---|---|---|
| Deep learning-original dataset |
|
|
|
|
|
|
| FAMD-ncomp = 2 | 58.55 | 3.58 | 58.99 | 3.99 | 59.52 | 5.62 |
| FAMD-ncomp = 3 |
|
|
|
|
|
|
| FAMD-ncomp = 6 | 58.85 | 0.10 | 55.89 | 0.16 | 65.92 | 0.26 |
| FAMD-ncomp = 10 | 57.5 | 0.14 | 54.4 | 0.20 | 63.90 | 0.29 |
| MICE |
|
|
|
|
|
|
ncomp: number of component; SD: standard deviation.
Figure 5Missing ratio of variables in original (not imputed) dataset. UTI: urinary tract infection; ud: urine dipstick; us: urine sediment; USG: ultrasonography; b: blood; R: right; L: left; AP: anterior-posterior; u-le: urine-leukocyte esterase.
Correlations between continuous variables.
| Age | b-leukocyte | b-thrombocyte | b-urea | b-creatinine | ud-density | |
|---|---|---|---|---|---|---|
| Age | 1 | |||||
| b-leukocyte | -0.176985 | 1 | ||||
| b-thrombocyte | -0.154943 | 0.086716 | 1 | |||
| b-urea | 0.303343 | -0.067059 | -0.035233 | 1 | ||
| b-creatinine | -0.001593 | -0.039199 | -0.033116 | 0.050520 | 1 | |
| ud-density | 0.278367 | -0.075667 | -0.158854 | 0.195563 | 0.064533 | 1 |
b: blood; ud: urine dipstick.
Descriptive statistics of deep learning and multiple imputation techniques for the dataset.
| Variables | Deep learning (original dataset) | MICE | FAMD (ncomp = 2) | FAMD (ncomp = 3) | FAMD (ncomp = 6) | FAMD (ncomp = 10) | |
|---|---|---|---|---|---|---|---|
| Age |
| 16 [0-196] | 16 [0-196] | 16 [0-196] | 16 [0-196] | 16 [0-196] | 16 [0-196] |
|
| 36.6 (4.0-60.0) | 36.6 (4.0-60.0) | 36.6 (4.0-60.0) | 36.6 (4.0-60.0) | 36.6 (4.0-60.0) | 36.6 (4.0-60.0) | |
| b-leukocyte |
| 10000 [950-110000] | 10100 [271.5-110000] | 10465 [950-110000] | 10400 [950-110000] | 10400 [950-110000] | 10160 [950-110000] |
|
| 12205.4 (7500-10000) | 12061.6 (7500-13651.1) | 12175 (8100-13300) | 12117 (8010-13021) | 12155 (8000-13211) | 12040 (7725-13100) | |
| b-thrombocyte |
| 341000 [17200-849000] | 345000 [17200-849000] | 345000 [17200-849000] | 345000 [17200-849000] | 346000 [17200-849000] | 345100 [17200-849000] |
|
| 358345.9 (17200-849000) | 357035 (277274-849000) | 358355 (296000-849000) | 358223 (296000-396000) | 358412 (296000-397087) | 356165 (288000-400352) | |
| b-urea |
| 19.0 [0.40-112.0] | 20.0 [0.4-112.0] | 19.4 [0.4-112.0] | 19.0 [0.4-112.0] | 19.0 [0.4-112.0] | 19.0 [0.4-112.0] |
|
| 20.3 (13.0-25.0) | 20.4 (14.2-25.0) | 20.1 (15.0-24.0) | 20.1 (15.0-24.0) | 20.2 (15.0-24.0) | 20.2 (14.0-24.3) | |
| b-creatinine |
| 0.5 [0.10-143.9] | 0.50 [0.02-143.0] | 0.50 [0.1-143.0] | 0.50 [0.1-143.0] | 0.50 [0.014-143.0] | 0.5 [0.013-143.0] |
|
| 1.24 (0.10-143.0) | 2.16 (0.40-0.90) | 1.27 (0.40- 0.98) | 1.29 (0.40-1.00) | 1.50 (0.40-0.90) | 2.01 (0.40-1.46) | |
| ud-density |
| 1013.0 [1000-1035] | 1013 [1000-1035] | 1013 [1000-1035] | 1013 [1000-1035] | 1013 [1000-1035] | 1013 [1000-1035] |
|
| 1013.77 (1006.0-1020.0) | 1014 (1006-1020) | 1013 (1006-1020) | 1014 (1006-1020) | 1014 (1006-1020) | 1014 (1006-1020) | |
| Stomachache | No | 280 | 283 | 288 | 286 | 284 | 284 |
| Yes | 96 | 98 | 96 | 96 | 97 | 99 | |
| Not assessed | 195 | 230 | 227 | 229 | 230 | 228 | |
| Urgency | No | 307 | 313 | 322 | 321 | 317 | 316 |
| Yes | 98 | 100 | 98 | 98 | 99 | 316 | |
| Not assessed | 178 | 198 | 191 | 192 | 195 | 195 | |
| Frequent urination | No | 250 | 255 | 259 | 258 | 258 | 256 |
| Yes | 94 | 99 | 95 | 96 | 95 | 98 | |
| Not assessed | 230 | 257 | 257 | 257 | 258 | 257 | |
| Dysuria | No | 264 | 268 | 267 | 268 | 268 | 267 |
| Yes | 92 | 92 | 92 | 92 | 92 | 92 | |
| Not assessed | 247 | 251 | 252 | 251 | 251 | 252 | |
| Restlessness | No | 114 | 115 | 114 | 114 | 114 | 115 |
| Yes | 476 | 477 | 478 | 478 | 478 | 477 | |
| Not assessed | 19 | 19 | 19 | 19 | 19 | 19 | |
| Anorexia | No | 461 | 474 | 486 | 485 | 486 | 479 |
| Yes | 72 | 79 | 72 | 72 | 72 | 77 | |
| Not assessed | 53 | 58 | 53 | 54 | 53 | 55 | |
| UTI history | No | 136 | 141 | 136 | 137 | 136 | 141 |
| Yes | 470 | 470 | 475 | 474 | 475 | 470 | |
| Prolonged history of jaundice | No | 537 | 565 | 566 | 566 | 566 | 566 |
| Yes | 44 | 46 | 46 | 46 | 46 | 46 | |
| Fever | No | 364 | 368 | 368 | 368 | 368 | 367 |
| Subfebrile | 64 | 64 | 64 | 64 | 64 | 64 | |
| Febrile | 179 | 179 | 179 | 179 | 179 | 180 | |
| Emesis | No | 503 | 511 | 514 | 514 | 514 | 511 |
| Yes | 97 | 100 | 97 | 97 | 97 | 100 | |
| Incontinence | No | 262 | 273 | 278 | 275 | 274 | 273 |
| Yes | 73 | 78 | 74 | 76 | 76 | 77 | |
| Not assessed | 238 | 260 | 259 | 260 | 261 | 261 | |
| ud-nitrite | No | 511 | 547 | 549 | 549 | 549 | 547 |
| Yes | 62 | 64 | 62 | 62 | 62 | 64 | |
| ud-protein | No | 509 | 546 | 550 | 550 | 550 | 547 |
| Yes | 61 | 65 | 61 | 61 | 61 | 64 | |
| us-erythrocyte | No | 428 | 598 | 494 | 600 | 600 | 598 |
| Yes | 11 | 13 | 117 | 11 | 11 | 13 | |
| us-leukocyte | No | 386 | 502 | 521 | 524 | 524 | 509 |
| Yes | 82 | 109 | 90 | 87 | 87 | 102 | |
| ud-leukocyte | No | 342 | 444 | 483 | 476 | 474 | 450 |
| Yes | 112 | 167 | 128 | 135 | 137 | 161 | |
| us-bacteria | No | 378 | 536 | 551 | 552 | 539 | 538 |
| Yes | 57 | 75 | 60 | 59 | 72 | 73 | |
| ud-erythrocyte | No | 424 | 539 | 548 | 548 | 547 | 533 |
| Yes | 63 | 72 | 63 | 63 | 64 | 78 | |
| ud-l.esterase | No | 367 | 596 | 521 | 599 | 599 | 599 |
| Yes | 12 | 15 | 90 | 12 | 12 | 12 | |
| USG-R-grade | Grade 0 | 384 | 592 | 593 | 593 | 593 | 592 |
| Grade 1 | 14 | 15 | 14 | 14 | 14 | 15 | |
| Grade 2 | 3 | 3 | 3 | 3 | 3 | 3 | |
| Grade 3 | 1 | 1 | 1 | 1 | 1 | 1 | |
| USG-L-grade | Grade 0 | 389 | 602 | 602 | 602 | 602 | 601 |
| Grade 1 | 5 | 5 | 5 | 5 | 5 | 6 | |
| Grade 2 | 2 | 2 | 2 | 2 | 2 | 2 | |
| Grade 3 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Grade 4 | 1 | 1 | 1 | 1 | 1 | 1 | |
| USG-R/L-hydronephrosis | Normal | 340 | 546 | 546 | 548 | 546 | 545 |
| Mild | 36 | 39 | 36 | 36 | 36 | 39 | |
| Heavy | 25 | 26 | 29 | 27 | 29 | 27 | |
| USG-R/L-ureter dilatation | No | 365 | 576 | 578 | 577 | 576 | 576 |
| Yes | 33 | 35 | 33 | 34 | 35 | 35 | |
| USG-bladder wall thickening | No | 353 | 581 | 590 | 590 | 590 | 587 |
| Yes | 21 | 30 | 21 | 21 | 21 | 24 | |
| USG-bladder diverticulum | No | 365 | 600 | 601 | 601 | 601 | 599 |
| Yes | 10 | 11 | 10 | 10 | 10 | 12 | |
| Scar | Normal | 286 | 487 | 506 | 507 | 506 | 495 |
| Yes | 99 | 124 | 105 | 104 | 105 | 116 |
UTI: urinary tract infection; ud: urine dipstick; us: urine sediment; USG: ultrasonography; b: blood; R: right; L: left; AP: anterior-posterior; u-le: urine-leukocyte esterase; : mean; : median; Min: minimum; Max: maximum.