| Literature DB >> 35178455 |
Mladen Marinkovic1, Marina Popovic2, Suzana Stojanovic-Rundic1,3, Milos Nikolic4, Milena Cavic5, Dusica Gavrilovic6, Dusan Teodorovic4,7, Nenad Mitrovic8, Ljiljana Mijatovic Teodorovic2,9.
Abstract
After primary treatment of localized prostate carcinoma (PC), up to a third of patients have disease recurrence. Different predictive models have already been used either for initial stratification of PC patients or to predict disease recurrence. Recently, artificial intelligence has been introduced in the diagnosis and management of PC with a potential to revolutionize this field. The aim of this study was to analyze machine learning (ML) classifiers in order to predict disease progression in the moment of prostate-specific antigen (PSA) elevation during follow-up. The study cohort consisted of 109 PC patients treated with external beam radiotherapy alone or in combination with androgen deprivation therapy. We developed and evaluated the performance of two ML algorithms based on artificial neural networks (ANN) and naïve Bayes (NB). Of all patients, 72.5% was randomly selected for a training set while the remaining patients were used for testing of the models. The presence/absence of disease progression was defined as the output variable. The input variables for models were conducted from the univariate analysis preformed among two groups of patients in the training set. They included two pretreatment variables (UICC stage and Gleason's score risk group) and five posttreatment variables (nadir PSA, time to nadir PSA, PSA doubling time, PSA velocity, and PSA in the moment of disease reevaluation). The area under the receiver operating characteristic curve, sensitivity, specificity, positive predictive value, negative predictive value, and predictive accuracy was calculated to test the models' performance. The results showed that specificity was similar for both models, while NB achieved better sensitivity then ANN (100.0% versus 94.4%). The ANN showed an accuracy of 93.3%, and the matching for NB model was 96.7%. In this study, ML classifiers have shown potential for application in routine clinical practice during follow-up when disease progression was suspected.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35178455 PMCID: PMC8844388 DOI: 10.1155/2022/7943609
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Flowchart of patient selection.
Comparison of patients', disease, treatment, and follow-up characteristics between two groups of patients.
| Characteristics | N (%) | Wilcoxon rank sum test | |
|---|---|---|---|
| Group 1 | Group 2 | ||
| Age (years) | |||
| Mean (SD) | 70.9 (6.0) | 71.3 (5.8) |
|
| Median (range) | 71.0 (55.0-84.0) | 71.5 (60.0-80.0) | |
| T in clinical TNM | |||
| T1 | 1 (1.3%) | 0 (0%) |
|
| T2 | 70 (88.6%) | 16 (23.3%) | |
| T3 | 8 (10.1%) | 13 (43.3%) | |
| T4 | 0 (0%) | 1 (3.3%) | |
| N in clinical TNM | |||
| N0 | 75 (94.9%) | 29 (96.7) |
|
| N1 | 4 (5.1%) | 1 (3.3) | |
| UICC2 staging | |||
| I | 17 (21.5%) | 4 (13.3%) |
|
| II | 53 (67.1%) | 12 (40.0%) | |
| III | 6 (7.6%) | 13 (43.3%) | |
| IV | 3 (3.8%) | 1 (3.3%) | |
| Baseline PSA1 (ng/mL) | |||
| Mean (SD) | 48.0 (49.8) | 44.4 (57.3) |
|
| Median (range) | 30.1 (4.5-249.4) | 30.8 (7.0-311.0) | |
| Gleason's score (category) | |||
| Low risk (GS3 6) | 8 (10.1%) | 6 (20.0%) |
|
| Intermediate risk (GS3 7) | 42 (53.2%) | 16 (53.3%) | |
| High risk (GS3 8-10) | 29 (36.7%) | 8 (26.7%) | |
| ADT4 | |||
| No | 23 (29.1%) | 12 (40.0%) |
|
| Yes | 56 (70.9%) | 18 (60.0%) | |
| ADT4 duration (months) | |||
| Mean (SD) | 12.4 (2.6) | 12.8 (1.4) |
|
| Median (range) | 12.0 (3.0-24.0) | 12.0 (12.0-15.0) | |
| Radiotherapy (Gy) | |||
| Mean (SD) | 66.6 (2.7) | 66.4 (2.4) |
|
| Median (range) | 65 (65-72) | 65 (65-72) | |
| Nadir PSA1 (ng/mL) | |||
| Mean (SD) | 2.0 (5.6) | 4.0 (9.9) |
|
| Median (range) | 0.4 (0-42.9) | 0.6 (0-49.9) | |
| Time to nadir PSA1 (months) | |||
| Mean (SD) | 7.2 (5.5) | 5.9 (4.6) |
|
| Median (range) | 6.0 (1.0-31.0) | 4.0 (1.0-21.0) | |
| PSA1 doubling (months) | |||
| Mean (SD) | 6.9 (8.4) | 7.6 (9.9) |
|
| Median (range) | 3.9 (0.8-50.0) | 3.7 (0.9-42.0) | |
| PSA1 velocity (ng/mL/month) | |||
| Mean (SD) | 2.2 (4.2) | 2.4 (3.9) |
|
| Median (range) | 0.6 (0-26.1) | 1.0 (0-15.4) | |
| PSA1 in the moment of disease reevaluation | |||
| Mean (SD) | 23.1 (33.5) | 19.5 (24.8) |
|
| Median (range) | 12.3 (0.3-200.4) | 12.5 (0.04-120.0) | |
| Disease progression | |||
| No | 25 (31.6%) | 12 (40.0%) |
|
| Yes | 54 (68.4%) | 18 (60.0%) | |
| Site | |||
| Local recidive | 8 (10.1%) | 1 (3.3%) |
|
| Regional lymph nodes | 7 (8.9%) | 3 (10.0%) |
|
| Distant lymph nodes | 4 (5.1%) | 2 (6.7%) |
|
| Bone | 34 (43.0%) | 12 (40.0%) |
|
| Liver | 1 (1.3%) | 0 (0%) |
|
| Lung | 2 (2.5%) | 0 (0%) |
|
| Biochemical relapse | 11 (13.9%) | 4 (13.3%) |
|
| No. pts | 79 (100%) | 30 (100%) | |
1PSA = prostate-specific antigen; 2UICC = Union for International Cancer Control; 3GS = Gleason's score; 4ADT = androgen deprivation therapy; ∗Fisher exact test; #Pearson χ2 test.
Comparison of characteristics among patients with and without disease progression in the training group.
| Characteristic | The presence/absence of disease progression - training group | ||
|---|---|---|---|
| Presence | Absence | Wilcoxon rank sum test | |
| Age (years) | |||
| Mean (SD) | 70.8 (6.3) | 71.2 (5.3) |
|
| Median (range) | 71.0 (55.0-84.0) | 72.0 (60.0-84.0) | |
| Baseline PSA | |||
| Mean (SD) | 53.2 (53.2) | 36.8 (40.1) |
|
| Median (range) | 35.8 (4.5-249.0) | 27.0 (5.6-190.0) | |
| T in clinical TNM | |||
| T1 and T2 | 8 (14.8%) | 0 (0%) |
|
| T3 and T4 | 46 (85.2%) | 25 (100%) | |
| N in clinical TNM | |||
| N0 | 50 (92.6%) | 25 (100%) |
|
| N1 | 4 (7.4%) | 0 (0%) | |
| UICC1 staging | |||
| I and II | 45 (83.3%) | 25 (100%) |
|
| III and IV | 9 (16.7%) | 0 (0%) | |
| Gleason's score (category) | |||
| Low risk (GS2 6) | 4 (7.4%) | 4 (16.0%) |
|
| Intermediate risk (GS2 7) | 24 (44.4%) | 18 (72.0%) | |
| High risk (GS2 8-10) | 26 (48.2%) | 3 (12.0%) | |
|
| |||
| Mean (SD) | 2.7 (6.6) | 0.6 (1.0) |
|
| Median (range) | 0.7 (0-43.0) | 0.1 (0-4.2) | |
| Time to nadir PSA3 | |||
| Mean (SD) | 5.7 (3.4) | 10.6 (7.4) |
|
| Median (range) | 5.0 (1.0-15.0) | 8.0 (3.0-31.0) | |
| PSA3 doubling time | |||
| Mean (SD) | 3.6 (2.4) | 14.0 (11.8) |
|
| Median (range) | 3.1 (0.8-12.9) | 11.5 (2.1-50.0) | |
| PSA3 velocity | |||
| Mean (SD) | 3.1 (4.9) | 0.2 (0.2) | |
| Median (range) | 1.4 (0.1-26.1) | 0.1 (0-0.8) |
|
| PSA3 in the moment of disease reevaluation | |||
| Mean (SD) | 32.1 (37.2) | 3.6 (3.1) |
|
| Median (range) | 19.1 (3.9-200.4) | 3.1 (0.3-15.5) | |
| Total | 54 (68.4%) | 25 (31.6%) |
|
1UICC: Union for International Cancer Control; 2GS: Gleason's score; 3PSA: prostate-specific antigen; #Fisher exact test.
Results of the ROC analysis for nadir PSA, time to nadir PSA, PSA doubling time, PSA velocity, PSA in the moment of disease reevaluation, and relevant events.
| Characteristics | Nadir PSA | Time to nadir PSA | PSA doubling time | PSA velocity | PSA in the moment of disease reevaluation |
|---|---|---|---|---|---|
| AUC ROCa (95% CI) | 64.3% (51.7-76.9%) | 74.2% (62.6-85.8%) | 87.2% (78.8-95.5%) | 91.8% (86.0-97.6%) | 97.1% (93.3-100%) |
| Likelihood ratio testb |
|
|
|
|
|
| ROC cut-off valuec | 0.3095 | 6.5 | 5.05 | 0.55 | 6.49 |
| Sensitivity (95% CI) | 63.0% (50.0-75.9%) | 66.7% (53.7-77.8%) | 79.6% (68.5-90.7%) | 72.2% (59.2-83.3%) | 94.4% (89.0-100%) |
| Specificity (95% CI) | 68.0% (48.0-84.0%) | 80.0% (64.0-96.0%) | 84.0% (68.0-96.0%) | 92.0% (80.0-100%) | 92.0% (80.0-100%) |
aArea under the ROC curve (DeLong's method); bLikelihood ratio test for AUC ROC; cValue with maximum sensitivity and specificity.
Figure 2ROC curves for the nadir PSA (a), time to nadir PSA (b), PSA doubling time (c), PSA velocity (d), and PSA in the moment of disease reevaluation (e) in relation to disease progression during follow-up.
The value of nadir PSA, time to nadir PSA, PSA doubling time, PSA velocity, and PSA in the moment of disease reevaluation, in prediction of disease progression.
| Characteristic | The presence/absence of disease progression – training group | ||
|---|---|---|---|
| Presence | Absence | Wilcoxon rank sum test | |
| Nadir PSA (ROC cut-off value) | |||
| ≤0.3095 | 20 (37.0%) | 17 (68.0%) |
|
| >0.3095 | 34 (63.0%) | 8 (32.0%) | |
| Time to nadir PSA (ROC cut-off value) | |||
| ≤6.5 | 36 (66.7%) | 5 (20.0%) |
|
| >6.5 | 18 (33.3%) | 20 (80.0%) | |
| PSA doubling time (ROC cut-off value) | |||
| ≤5.05 | 43 (79.6%) | 4 (16.0%) |
|
| >5.05 | 11 (20.4%) | 21 (84.0%) | |
| PSA velocity (ROC cut-off value) | |||
| ≤0.55 | 15 (27.8%) | 23 (92.0%) |
|
| >0.55 | 39 (72.2%) | 2 (8.0%) | |
| PSA in the moment of disease reevaluation (ROC cut-off value) | |||
| ≤6.49 | 3 (5.6%) | 23 (92.0%) |
|
| >6.49 | 51 (94.4%) | 2 (8.0%) | |
| Total | 54 (68.4%) | 25 (31.6%) |
|
Figure 3The artificial neural network architecture for predicting presence versus absence of disease progression.
Figure 4ROC curves of ANN and NB model for prediction of disease progression.
Sensitivity, specificity, positive predictive value, negative predictive value and predictive accuracy for ANN and NB models for prediction of disease progression, and comparison between models.
| Characteristics | ANN | NB | DeLong's test∗ |
|---|---|---|---|
| AUC (95% CI) | 91.7% (76.2-100%) | 97.7% (92.9-100%) | Z = −0.7718, |
| Likelihood ratio testb |
|
| |
| ROC cut-off valuec | 0.766 | 0.983 | — |
| Sensitivity (95% CI) | 94.4% (72.7-99.9%) | 100.0% (80.5-100.0%) | |
| Specificity (95% CI) | 91.7% (61.5-99.8%) | 92.3% (64.0-99.8%) | |
| PPV (95% CI) | 94.4% (72.2-99.1%) | 94.4% (72.1-99.1%) | |
| NPV (95% CI) | 91.7% (61.9-98.7%) | 100% | |
| Predictive accuracy (95% CI) | 93.3% (77.9-99.2%) | 96.7% (82.8-99.9%) |
AUC: area under the curve; CI: confidence interval; NPV: negative predictive value; PPV: positive predictive value; ∗DeLong's test for two correlated ROC curves.
Importance of inputs.
| Input 1 | Input 2 | Input 3 | Input 4 | Input 5 | Input 6 | Input 7 | |
|---|---|---|---|---|---|---|---|
| UICC1 stage | GS2 risk group | Nadir PSA3 | Time to nadir PSA3 | PSA3 doubling time | PSA3 velocity | Follow-up PSA3 | |
| Relative importance (%) | 26.2 | 13.4 | 9.4 | 8.8 | 9.7 | 11.7 | 20.7 |
1UICC: Union for International Cancer Control; 2GS: Gleason's score; 3PSA: prostate-specific antigen.