| Literature DB >> 35875103 |
Sunmeng Chen1, Tengteng Jian1, Changliang Chi1, Yi Liang2, Xiao Liang2, Ying Yu1, Fengming Jiang1, Ji Lu1.
Abstract
Purpose: PSA is currently the most commonly used screening indicator for prostate cancer. However, it has limited specificity for the diagnosis of prostate cancer. We aim to construct machine learning-based models and enhance the prediction of prostate cancer.Entities:
Keywords: machine learning; prediction models; prostate biopsy; prostate cancer; prostate-specific antigen
Year: 2022 PMID: 35875103 PMCID: PMC9299367 DOI: 10.3389/fonc.2022.941349
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Characteristics of Patients, stratified by biopsy outcomes.
| Total (n=551) | Non-PCa (n=249) | PCa (n=302) | PCa detection (%) | p value | |
|---|---|---|---|---|---|
|
| < 0.001 | ||||
|
| 157 (28.5) | 96 (38.6) | 61 (20.2) | 38.9 | |
|
| 394 (71.5) | 153 (61.4) | 241 (79.8) | 61.2 | |
|
| 23.6 ± 3.2 | 24.0 ± 3.0 | 23.3 ± 3.4 | 0.013 | |
|
| 0.059 | ||||
|
| 416 (75.5) | 178 (71.5) | 238 (78.8) | 57.2 | |
|
| 135 (24.5) | 71 (28.5) | 64 (21.2) | 47.4 | |
|
| 0.603 | ||||
|
| 511 (92.7) | 233 (93.6) | 278 (92.1) | 54.4 | |
|
| 40 (7.3) | 16 (6.4) | 24 (7.9) | 60.0 | |
|
| 3.7 (2.9-4.8) | 4.0 (3.1-5.0) | 3.7 (2.8-4.6) | 0.024 | |
|
| 1.8 ± 0.7 | 1.8 ± 0.6 | 1.8 ± 0.7 | 0.798 | |
|
| 2.1 (1.5-3.0) | 2.1 (1.5-3.3) | 2.0 (1.5-2.9) | 0.150 | |
|
| 32.0 (14.5-100.0) | 16.2 (10.6-26.1) | 88.8 (34.0-123.2) | < 0.001 | |
|
| < 0.001 | ||||
|
| 13 (2.4) | 10 (4) | 3 (1) | 23.1 | |
|
| 59 (10.7) | 48 (19.3) | 11 (3.6) | 18.6 | |
|
| 122 (22.1) | 90 (36.1) | 32 (10.6) | 26.2 | |
|
| 145 (26.3) | 4 (1.6) | 141 (46.7) | 97.2 | |
|
| 13.6 ± 31.3 | 3.3 ± 5.8 | 22.2 ± 39.9 | < 0.001 | |
|
| 0.280 | ||||
|
| 390 (70.8) | 170 (68.3) | 220 (72.8) | 56.4 | |
|
| 161 (29.2) | 79 (31.7) | 82 (27.2) | 50.9 | |
|
| 55.3 (37.2-79.0) | 65.2 (44.7-91.5) | 48.9 (34.3-70.3) | < 0.001 | |
|
| 0.6 (0.3-1.6) | 0.3 (0.2-0.4) | 1.5 (0.7-2.7) | < 0.001 |
PV, prostate volume; PSAD, prostate-specific antigen density; BMI, body mass index; NLR, neutrophil-to-lymphocyte ratio; tPSA, total prostate specific antigen; fPSA, free prostate specific antigen; PCa, prostate cancer; Non-PCa, non-prostate cancer.
Diagnostic performance of different machine learning models.
| Outcome | Dataset | tPSAlogisticregression | Multivariate logisticregression | Decision Tree | Random Forest | SupportVectorMachine |
|---|---|---|---|---|---|---|
| AUC | Training | 0.842 | 0.910 | 0.922 | 1.00 | 0.884 |
| Test | 0.846 | 0.918 | 0.886 | 0.898 | 0.895 | |
| Sensitivity(%) | Training | 69.2 | 70.5 | 91.2 | 100 | 86.8 |
| Test | 63.9 | 88.0 | 86.7 | 84.0 | 86.7 | |
| Specificity(%) | Training | 88.8 | 95.2 | 83.4 | 100 | 70.6 |
| Test | 93.2 | 87.1 | 69.4 | 79.0 | 85.5 | |
| Accuracy(%) | Training | 78.0 | 81.6 | 87.7 | 100 | 79.5 |
| Test | 77.1 | 87.6 | 78.8 | 81.8 | 86.1 |
Figure 1Performance of machine learning models. (A) Calibration curves of five prediction models in the training dataset. The predicted probabilities are plotted on the X-axis, and the actual probabilities are plotted on the Y-axis. (B) The ROC curves of the tPSA LR, multivariate LR, DT, RF, and SVM models in the test dataset. (C) Clinical decision curve analysis (DCA) of the five models in the test dataset.