| Literature DB >> 33993876 |
Shuanbao Yu1, Jin Tao1, Biao Dong1, Yafeng Fan1, Haopeng Du1, Haotian Deng1, Jinshan Cui1, Guodong Hong1, Xuepei Zhang2,3.
Abstract
BACKGROUND: Machine learning has many attractive theoretic properties, specifically, the ability to handle non predefined relations. Additionally, studies have validated the clinical utility of mpMRI for the detection and localization of CSPCa (Gleason score ≥ 3 + 4). In this study, we sought to develop and compare machine-learning models incorporating mpMRI parameters with traditional logistic regression analysis for prediction of PCa (Gleason score ≥ 3 + 3) and CSPCa on initial biopsy.Entities:
Keywords: Machine learning; Predictive model; Prostate biopsy; Prostate cancer
Year: 2021 PMID: 33993876 PMCID: PMC8127331 DOI: 10.1186/s12894-021-00849-w
Source DB: PubMed Journal: BMC Urol ISSN: 1471-2490 Impact factor: 2.264
The clinical characteristics of enrolled patients stratified by pathological results between April 2016 and March 2020
| Clinical characteristics | PCa (GS ≥ 3 + 3) | CSPCa (GS ≥ 3 + 4) | ||||
|---|---|---|---|---|---|---|
| No (n = 443)† | Yes (n = 245)† | No (n = 488)† | Yes (n = 200)† | |||
| Age (years) | 66 (61–72) | 70 (63–76) | < 0.001 | 66 (61–72) | 70 (63–76) | < 0.001 |
| tPSA (ng/ml) | 10.5 (6.65–17.0) | 20.8 (9.67–30.3) | < 0.001 | 10.9 (6.70–17.1) | 23.0 (10.9–33.1) | < 0.001 |
| f/tPSA | 0.15 (0.10–0.21) | 0.11 (0.07–0.17) | < 0.001 | 0.15 (0.10–0.21) | 0.11 (0.07–0.17) | < 0.001 |
| PSAD (ng/ml2) | 0.18 (0.11–0.29) | 0.46 (0.25–0.73) | < 0.001 | 0.19 (0.11–0.30) | 0.52 (0.30–0.80) | < 0.001 |
| PV (ml) | 58 (39–82) | 38 (27–58) | < 0.001 | 57 (38–81) | 37 (27–54) | < 0.001 |
| MRI-PCa, No. (%) | < 0.001 | < 0.001 | ||||
| Negative | 250 (56) | 37 (15) | 264 (54) | 23 (12) | ||
| Equivocal | 95 (21) | 29 (12) | 104 (21) | 20 (10) | ||
| Suspicious | 98 (22) | 179 (73) | 120 (25) | 157 (79) | ||
| MRI-SVI, No. (%) | < 0.001 | < 0.001 | ||||
| Negative | 439 (99) | 163 (67) | 481 (99) | 121 (61) | ||
| Equivocal | 1 (0.2) | 7 (3) | 1 (0.2) | 7 (4) | ||
| Suspicious | 3 (0.7) | 75 (31) | 6 (1) | 72 (36) | ||
| MRI-LNI, No. (%) | < 0.001 | < 0.001 | ||||
| Negative | 432 (98) | 202 (82) | 475 (97) | 159 (80) | ||
| Equivocal | 11 (2) | 18 (7) | 12 (2) | 17 (9) | ||
| Suspicious | 0 (0) | 25 (10) | 1 (0.2) | 24 (12) | ||
PCa prostate cancer, CSPCa clinically significant prostate cancer, GS Gleason score, tPSA total prostate-specific antigen, f/tPSA free/total PSA, PV prostate volume, SVI seminal vesicle invasion, LNI lymph node invasion
†Data are presented as median (quartile range) unless other indicated
Fig. 1Receive operating characteristic (ROC) curves of machine-learning and logistic regression models for predicting prostate cancer (PCa) and clinically significant prostate cancer (CSPCa) in the validation cohort. a PCa: Gleason score ≥ 3 + 3; b CSPCa: Gleason score ≥ 3 + 4. Abbreviations ANN artificial neural network, SVM support vector machine, CART classification and regression tree, RF random forest, LR logistic regression
Fig. 2Calibration plot of observed vs predicted rick of prostate cancer (PCa) and clinically significant prostate cancer (CSPCa) using machine-learning and logistic regression models in the validation cohort. a: PCa: Gleason score ≥ 3 + 3; b CSPCa: Gleason score ≥ 3 + 4. Abbreviations ANN artificial neural network, SVM support vector machine, CART classification and regression tree, RF random forest, LR logistic regression
Fig. 3Decision curve analysis (DCA) of machine-learning and logistic regression models for predicting prostate cancer (PCa) and clinically significant prostate cancer (CSPCa) in the validation cohort. a PCa: Gleason score ≥ 3 + 3; b CSPCa: Gleason score ≥ 3 + 4. Abbreviations ANN artificial neural network, SVM support vector machine, CART classification and regression tree, RF random forest, LR logistic regression
Percentage of biopsies that would be spared or delayed using machine-learning and logistic regression models at given sensitivity for detection of CSPCa in the validation cohorts
| Models | Sensitivity for detection of CSPCa | Cut-off for predicted risk (%) | Biopsies sSpared † (n = 208), n (%) | Unnecessary biopsy avoided | Biopsy delayed | |||
|---|---|---|---|---|---|---|---|---|
| GS < 3 + 3 (n = 131), n (%) | GS = 3 + 3 (n = 12), n (%) | GS = 3 + 4 (n = 11), n (%) | GS = 4 + 3 (n = 18), n (%) | GS ≥ 4 + 4 (n = 36), n (%) | ||||
| ANN | 64/65 (98%) | 9 | 34 (16) | 32 (24) | 1 (8) | 0 (0) | 1 (6) | 0 (0) |
| SVM | 64/65 (98%) | 11 | 57 (24) | 55 (42) | 1 (8) | 0 (0) | 0 (0) | 1 (3) |
| CART | 64/65 (98%) | NA | NA | NA | NA | NA | NA | NA |
| RF | 64/65 (98%) | 7 | 61 (29) | 57 (44) | 3 (25) | 1 (9) | 0 (0) | 0 (0) |
| LR | 64/65 (98%) | 7 | 30 (14) | 27 (21) | 2 (17) | 1 (9) | 0 (0) | 0 (0) |
| ANN | 62/65 (95%) | 11 | 56 (25) | 51 (39) | 2 (17) | 1 (9) | 1 (6) | 1 (3) |
| SVM | 62/65 (95%) | 14 | 77 (37) | 71 (54) | 3 (25) | 1 (9) | 1 (6) | 1 (3) |
| CART | 62/65 (95%) | NA | NA | NA | NA | NA | NA | NA |
| RF | 62/65 (95%) | 11 | 79 (38) | 73 (56) | 3 (25) | 1 (9) | 2 (11) | 0 (0) |
| LR | 62/65 (95%) | 10 | 53 (25) | 47 (36) | 3 (25) | 1 (9) | 1 (6) | 1 (3) |
| ANN | 59/65 (91%) | 27 | 109 (52) | 99 (76) | 4 (33) | 3 (27) | 1 (6) | 2 (6) |
| SVM | 59/65 (91%) | 23 | 110 (53) | 100 (76) | 4 (33) | 4 (36) | 1 (6) | 1 (3) |
| CART | 57/65 (88%) | 10 | 104 (50) | 90 (69) | 6 (50) | 4 (36) | 1 (6) | 3 (8) |
| RF | 59/65 (91%) | 20 | 101 (49) | 91 (69) | 4 (33) | 4 (36) | 2 (11) | 0 (0) |
| LR | 59/65 (91%) | 24 | 107 (51) | 97 (74) | 4 (33) | 4 (36) | 1 (6) | 1 (3) |
| ANN | 64/65 (98%) | 7 | 60 (29) | 58 (44) | 1 (8) | 0 (0) | 0 (0) | 1 (3) |
| SVM | 64/65 (98%) | 9 | 69 (33) | 65 (50) | 3 (25) | 0 (0) | 0 (0) | 1 (3) |
| CART | 64/65 (98%) | NA | NA | NA | NA | NA | NA | NA |
| RF | 64/65 (98%) | 7 | 79 (38) | 75 (57) | 3 (25) | 1 (9) | 0 (0) | 0 (0) |
| LR | 64/65 (98%) | 6 | 61 (29) | 57 (44) | 3 (25) | 0 (0) | 1 (6) | 0 (0) |
| ANN | 62/65 (95%) | 8 | 79 (38) | 75 (57) | 1 (8) | 1 (9) | 1 (6) | 1 (3) |
| SVM | 62/65 (95%) | 10 | 82 (39) | 76 (58) | 3 (25) | 1 (9) | 1 (6) | 1 (3) |
| CART | 62/65 (95%) | NA | NA | NA | NA | NA | NA | NA |
| RF | 62/65 (95%) | 8 | 84 (40) | 78 (60) | 3 (25) | 1 (9) | 2 (11) | 0 (0) |
| LR | 62/65 (95%) | 7 | 70 (34) | 64 (49) | 3 (25) | 1 (9) | 1 (6) | 1 (3) |
| ANN | 59/65 (91%) | 9 | 96 (46) | 86 (66) | 4 (33) | 4 (36) | 1 (6) | 1 (3) |
| SVM | 59/65 (91%) | 18 | 124 (60) | 112 (85) | 6 (50) | 4 (36) | 1 (6) | 1 (3) |
| CART | 58/65 (89%) | 10 | 109 (52) | 97 (74) | 5 (42) | 4 (36) | 1 (6) | 2 (6) |
| RF | 59/65 (91%) | 13 | 102 (49) | 92 (70) | 4 (33) | 3 (27) | 3 (17) | 0 (0) |
| LR | 59/65 (91%) | 14 | 105 (50) | 32 (24) | 4 (33) | 3 (27) | 1 (6) | 2 (6) |
PCa prostate cancer, CSPCa clinically significant prostate cancer, GS Gleason score, ANN artificial neural network, SVM support vector machine, CART classification and regression tree, RF random forest, LR logistic regression, NA not applicable
†Number of biopsies spared = number of unnecessary biopsy avoided + number of biopsy delayed