| Literature DB >> 35270995 |
Sarah M Ayyad1, Mohamed A Badawy2, Mohamed Shehata3, Ahmed Alksas3, Ali Mahmoud3, Mohamed Abou El-Ghar2, Mohammed Ghazal4, Moumen El-Melegy5, Nahla B Abdel-Hamid1, Labib M Labib1, H Arafat Ali1,6, Ayman El-Baz3.
Abstract
Prostate cancer, which is also known as prostatic adenocarcinoma, is an unconstrained growth of epithelial cells in the prostate and has become one of the leading causes of cancer-related death worldwide. The survival of patients with prostate cancer relies on detection at an early, treatable stage. In this paper, we introduce a new comprehensive framework to precisely differentiate between malignant and benign prostate cancer. This framework proposes a noninvasive computer-aided diagnosis system that integrates two imaging modalities of MR (diffusion-weighted (DW) and T2-weighted (T2W)). For the first time, it utilizes the combination of functional features represented by apparent diffusion coefficient (ADC) maps estimated from DW-MRI for the whole prostate in combination with texture features with its first- and second-order representations, extracted from T2W-MRIs of the whole prostate, and shape features represented by spherical harmonics constructed for the lesion inside the prostate and integrated with PSA screening results. The dataset presented in the paper includes 80 biopsy confirmed patients, with a mean age of 65.7 years (43 benign prostatic hyperplasia, 37 prostatic carcinomas). Experiments were conducted using different well-known machine learning approaches including support vector machines (SVM), random forests (RF), decision trees (DT), and linear discriminant analysis (LDA) classification models to study the impact of different feature sets that lead to better identification of prostatic adenocarcinoma. Using a leave-one-out cross-validation approach, the diagnostic results obtained using the SVM classification model along with the combined feature set after applying feature selection (88.75% accuracy, 81.08% sensitivity, 95.35% specificity, and 0.8821 AUC) indicated that the system's performance, after integrating and reducing different types of feature sets, obtained an enhanced diagnostic performance compared with each individual feature set and other machine learning classifiers. In addition, the developed diagnostic system provided consistent diagnostic performance using 10-fold and 5-fold cross-validation approaches, which confirms the reliability, generalization ability, and robustness of the developed system.Entities:
Keywords: MRI; PSA; computer-aided diagnosis; functional features; prostate cancer; shape features; texture analysis
Mesh:
Year: 2022 PMID: 35270995 PMCID: PMC8915102 DOI: 10.3390/s22051848
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
A brief comparison between previous prostate MRI CAD studies.
| Reference | Year | Type of Approach | Features Type | Classes | Images Sequences | No. of Patients Involved | Accuracy Result |
|---|---|---|---|---|---|---|---|
| [ | 2017 | Handcrafted | Spatial, intensity, and texture | Benign, Gleason 6, Gleason 7, Gleason 8, Gleason 9, Gleason 10 | B2000, ADC, and T2W | 224 | SVM model achieved an AUC value of 0.86, while Random Forest achieved an AUC of 0.93 |
| [ | 2016 | Texture | Malignant or benign | T2W | 45 | It has a value of 0.93 AUC | |
| [ | 2017 | Texture, intensity, edge, and anatomical | Voxel-based classification | DWI, T2W, DCE, and MRSI | 17 | Classification performance of an average AUC of 0.836 ± 0.083 is achieved | |
| [ | 2019 | Texture | High risk patients and low risk patients | T2WI and ADC | 121 | Quadratic kernel based SVM is the best model with an accuracy of 0.92 | |
| [ | 2020 | Texture and intensity | Benign and/or cs PCa vs. non-cs PCa | B50, b400, b800, b1400, T2WI, DCE, and ADC | 206 | It has an average AUC value of 0.838 | |
| [ | 2020 | Shape, texture, and statistical texture | Normal vs. cancerous prostate lesion and clinically significant PCa vs. clinically insignificant PCa | ADC and T2WI | 191 | AUC value for normal vs. cancerous classification is 0.889, while the AUC value for clinically significant PCa vs. clinically insignificant PCa is 0.844 | |
| [ | 2019 | Deep learning-based CAD | Produces a voxel probability map | T2WI | 19 | The model attained an AUC value of 0.995, a recall of 0.928, and an accuracy of 0.894. | |
| [ | 2018 | Produces probability maps to detect prostate cancer | T2WI, ADC, and high | 186 | The model attained an average AUC value of 0.94 in the peripheral zone and an average AUC value of 0.92 in transition zone. | ||
| [ | 2020 | Gives a PI-RADS score to a lesion detected and segmented by a radiologist | T2WI, T1WI, ADC, and (b1500 or b2000) | 687 | Kappa = 0.40, sensitivity = 0.89, and specificity = 0.73. | ||
| [ | 2021 | Probability that patient has prostate cancer | T2WI, b200, ADC in the first dataset, T2WI, ADC in the second dataset | 249 patients in the 1st dataset and 282 patients in the 2nd dataset | AUC value for the first dataset was 0.79, and for the second dataset was 0.86. | ||
| [ | 2021 | Predicting the Gleason grade group and classifying benign vs. csPCa | T1WI and T2WI | 490 cases for training and 75 cases for testing from 2 different datasets | On the lesion level, AUC of 0.96 for both the first and second datasets. On the patient level, AUC of 0.87 and 0.91, for the first and second datasets, respectively. | ||
Figure 1The proposed framework for early detection of prostatic adenocarcinoma.
Figure 2Calculations of voxel-wise apparent diffusion coefficients (ADC) for PCa and the cumulative distribution functions (CDFs) at different b-values from b100 to b1400.
Figure 3CDFs of ADC values for a benign case (solid) vs. a malignant case (dotted) for ADC maps obtained using different b-values from b100 to b1400. Note that region index indicates the different regions where the ADC values within the same range falls into.
Figure 4Illustrative examples of prostatic texture differences showing high gray level heterogeneity in four different malignant cases (first row) and low gray level heterogeneity in four different benign cases (second row).
Figure 5First-order textural features extraction.
Figure 6Second-order GLCM textural features extraction, where the central voxel of interest is shown in blue and the 26-neighbors are shown in red. The spatial relationship in the neighborhood block is obtained at different angles of zero, , , and .
Figure 7Second-order GLRLM textural features extraction, where the central voxel of interest is shown in blue and the 26-neighbors are shown in red. The spatial relationship is investigated to detect groups of sequential horizontal or vertical voxels with the same gray level.
Figure 8Visualization 3D shape differences between four malignant cases in the first row, and four benign cases in the second row.
Figure 9Reconstruction errors differences at different spherical harmonics (SH 01, 10, 50, 70, 85) between a malignant case and a benign case.
Details of the extracted feature sets. Let ST denote the significance threshold.
| Feature Set No. | Representation | Number of Extracted Features |
|---|---|---|
| FS1 | Functional features for whole prostate | 122 |
| FS2 | Texture features for whole prostate | 58 |
| FS3 | Shape features for lesion only | 85 |
| FS4 | Prostatic-specific antigen (PSA) | 1 |
| FS5 | Combined features (FS1 + FS2 + FS3 + FS4) | 266 |
| FS6 | Selected features of FS5 with ST = 0.05 | 101 |
| FS7-Proposed | Selected features of FS5 with ST = 0.1 | 162 |
Figure 10Performance metrics for evaluation of the proposed framework.
PIRADS scores.
| PIRADS Score | Definition |
|---|---|
| 1 | Most probably benign (normal) |
| 2 | Probably benign tumor |
| 3 | Intermediate (the presence of clinically significant cancer is equivocal) |
| 4 | Probably malignant tumor |
| 5 | Most probably malignant tumor |
Comparison of experimental results of classification accuracy (%), sensitivity (%), specificity (%), and AUC (in terms of mean ± standard deviation) using the proposed SVM classification model, where 𝛜 indicates 1.0 × 10−5.
| Feature Set | Validation | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|---|
| FS1 | 5-fold | 81.81 ± 2.13 | 71.17 ± 3.6 | 90.96 ± 3.18 | 0.8106 ± 0.0215 |
| 10-fold | 83.75 ± 2.00 | 72.59 ± 2.25 | 93.35 ± 2.89 | 0.8297 ± 0.0197 | |
| Leave-one-out | 82.50 ± 𝛜 | 67.57 ± 𝛜 | 95.35 ± 𝛜 | 0.8146 ± 𝛜 | |
| FS2 | 5-fold | 75.83 ± 1.72 | 61.26 ± 2.01 | 88.37 ± 3 | 0.7482 ± 0.0166 |
| 10-fold | 74.82 ± 2.26 | 61.39 ± 3.45 | 86.38± 2.3 | 0.7389 ± 0.0231 | |
| Leave-one-out | 77.50 ± 𝛜 | 64.86 ± 𝛜 | 88.37 ± 𝛜 | 0.7662 ± 𝛜 | |
| FS3 | 5-fold | 74.28 ± 1.87 | 81.46 ± 2.25 | 68.11 ± 2.97 | 0.7479 ± 0.0183 |
| 10-fold | 74.58 ± 2.00 |
| 69.38 ± 2.48 | 0.75 ± 0.0206 | |
| Leave-one-out | 77.50 ± 𝛜 | 86.49 ± 𝛜 | 69.77 ± 𝛜 | 0.7813 ± 𝛜 | |
| FS4 | 5-fold | 72.50 ± 𝛜 | 51.35 ± 𝛜 | 90.70 ± 𝛜 | 0.7102 ± 𝛜 |
| 10-fold | 72.50 ± 𝛜 | 51.35 ± 𝛜 | 90.70 ± 𝛜 | 0.7102 ± 𝛜 | |
| Leave-one-out | 72.50 ± 𝛜 | 51.35 ± 𝛜 | 90.70 ± 𝛜 | 0.7102 ± 𝛜 | |
| FS5 | 5-fold | 84.37 ± 2.01 | 75.23 ± 4.25 | 92.25 ± 2.57 | 0.8373 ± 0.021 |
| 10-fold | 84.50 ± 1.27 | 76.49 ± 2.72 | 91.39 ± 2.56 | 0.8394 ± 0.0127 | |
| Leave-one-out | 87.50 ± 𝛜 | 81.08 ± 𝛜 | 93.02 ± 𝛜 | 0.8705 ± 𝛜 | |
| FS6 | 5-fold |
| 73.87 ± 1.28 |
| 0.8461 ± 0.0092 |
| 10-fold | 85.94 ± 0.83 | 74.33 ± 1.36 |
| 0.8513 ± 0.0084 | |
| Leave-one-out | 86.25 ± 𝛜 | 75.68 ± 𝛜 | 95.35 ± 𝛜 | 0.8551 ± 𝛜 | |
|
|
| 85.18 ± 1.04 |
| 91.03 ± 1.49 |
|
|
|
| 80.27 ± 2.11 | 93.95 ± 1.54 |
| |
|
|
Comparison of experimental results of classification accuracy (%), sensitivity (%), specificity (%), and AUC (in terms of mean ± standard deviation) using a RF classification model, where 𝛜 indicates 1.0 × 10−5.
| Feature Set | Validation | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|---|
| FS1 | 5-fold | 86.67 ± 1.56 | 76.58 ± 1.27 | 95.35 ± 2.68 | 0.8603 ± 0.0152 |
| 10-fold | 86.09 ± 1.59 | 77.03 ± 1.35 | 93.9 ± 2.31 | 0.8546 ± 0.0154 | |
| Leave-one-out | 85.78 ± 1.24 | 76.35 ± 1.79 | 93.9 ± 2.83 | 0.8512 ± 0.0115 | |
| FS2 | 5-fold | 76.25 ± 2.28 | 63.97 ± 4.03 | 86.82 ± 2.89 | 0.7539 ± 0.0234 |
| 10-fold | 76.67 ± 1.38 | 65.76 ± 4.03 | 86.05 ± 2.68 | 0.7591 ± 0.0148 | |
| Leave-one-out | 76.75 ± 1.00 | 65.4 ± 2.02 | 86.51 ± 3.08 | 0.7596 ± 0.0087 | |
| FS3 | 5-fold | 73.25 ± 0.61 | 75.68 ± 1.71 | 71.16 ± 1.14 | 0.7342 ± 0.0065 |
| 10-fold | 72.68 ± 1.45 | 75.68 ± 1.45 | 70.1 ± 1.94 | 0.7289 ± 0.0153 | |
| Leave-one-out | 72.50 ± 𝛜 | 75.68 ± 𝛜 | 69.77 ± 𝛜 | 0.7272 ± 𝛜 | |
| FS4 | 5-fold | 73.75 ± 𝛜 | 51.35 ± 𝛜 | 93.02 ± 𝛜 | 0.7219 ± 𝛜 |
| 10-fold | 73.57 ± 0.44 | 50.96 ± 0.94 | 93.02 ± 𝛜 | 0.72 ± 0.0047 | |
| Leave-one-out | 73.75 ± 𝛜 | 51.35 ± 𝛜 | 93.02 ± 𝛜 | 0.7219 ± 𝛜 | |
| FS5 | 5-fold | 84.82 ± 1.82 | 77.22 ± 1.97 | 91.36 ± 2.05 | 0.8429 ± 0.0182 |
| 10-fold | 87.32 ± 1.56 | 79.54 ± 1.97 | 94.02 ± 1.69 | 0.8678 ± 0.0157 | |
| Leave-one-out | 86.13 ± 1.42 | 77.30 ± 1.32 | 93.72 ± 2.09 | 0.8551 ± 0.0138 | |
| FS6 | 5-fold | 83.75 ± 0.95 | 75.29 ± 1.73 | 91.03 ± 2.30 | 0.8316 ± 0.0089 |
| 10-fold | 84.58 ± 1.56 | 76.58 ± 3.12 | 91.47 ± 1.09 | 0.8402 ± 0.0165 | |
| Leave-one-out | 86.38 ± 1.42 | 78.65 ± 2.24 | 93.02 ± 1.80 | 0.8584 ± 0.0144 | |
| FS7 | 5-fold | 84.86 ± 1.5 | 77.78 ± 2.47 | 90.96 ± 2.31 | 0.8437 ± 0.015 |
| 10-fold | 85.63 ± 1.53 | 77.67 ± 1.31 | 92.73 ± 2.44 | 0.8505 ± 0.0147 | |
| Leave-one-out | 86.25 ± 1.48 | 77.30 ± 1.32 | 93.95 ± 2.59 | 0.8564 ± 0.0141 |
Comparison of experimental results of classification accuracy (%), sensitivity (%), specificity (%), and AUC (in terms of mean ± standard deviation) using a DT classification model, where 𝛜 indicates 1.0 × 10−5.
| Feature Set | Validation | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|---|
| FS1 | 5-fold | 75.45 ± 2.86 | 76.35 ± 4.22 | 84.71 ± 6.62 | 0.7553 ± 0.0266 |
| 10-fold | 75.50 ± 1.27 | 77.30 ± 1.32 | 73.95 ± 1.74 | 0.7563 ± 0.0125 | |
| Leave-one-out | 77.50 ± 𝛜 | 72.97 ± 𝛜 | 81.40 ± 𝛜 | 0.7718 ± 𝛜 | |
| FS2 | 5-fold | 70.63 ± 1.88 | 53.60 ± 4.25 | 85.27 ± 4.58 | 0.6944 ± 0.0182 |
| 10-fold | 71.00 ± 0.94 | 54.59 ± 3.15 | 85.12 ± 3.78 | 0.6978 ± 0.0082 | |
| Leave-one-out | 70.00 ± 𝛜 | 45.95 ± 𝛜 | 90.70 ± 𝛜 | 0.6832 ± 𝛜 | |
| FS3 | 5-fold | 66.79 ± 1.13 | 61.78 ± 4.88 | 71.10 ± 3.90 | 0.6644 ± 0.0122 |
| 10-fold | 65.00 ± 2.85 | 62.16 ± 2.96 | 67.44 ± 3.89 | 0.6480 ± 0.0280 | |
| Leave-one-out | 66.25 ± 𝛜 | 70.27 ± 𝛜 | 62.79 ± 𝛜 | 0.6653 ± 𝛜 | |
| FS4 | 5-fold | 66.88 ± 3.59 | 61.71 ± 2.88 | 71.32 ± 5.48 | 0.6652 ± 0.0347 |
| 10-fold | 67.50 ± 1.12 | 58.38 ± 2.76 | 75.35 ± 1.86 | 0.6686 ± 0.0117 | |
| Leave-one-out | 65.00 ± 𝛜 | 56.76 ± 𝛜 | 72.09 ± 𝛜 | 0.6442 ± 𝛜 | |
| FS5 | 5-fold | 78.44 ± 2.32 | 79.39 ± 3.56 | 77.62 ± 4.93 | 0.7851 ± 0.0221 |
| 10-fold | 80.25 ± 3.10 | 79.46 ± 2.16 | 80.93 ± 5.58 | 0.8019 ± 0.0293 | |
| Leave-one-out | 82.50 ± 𝛜 | 83.78 ± 𝛜 | 81.40 ± 𝛜 | 0.8259 ± 𝛜 | |
| FS6 | 5-fold | 79.84 ± 3.09 | 76.01 ± 4.95 | 83.14 ± 4.15 | 0.7958 ± 0.0312 |
| 10-fold | 79.82 ± 1.82 | 79.92 ± 4.30 | 79.73 ± 3.67 | 0.7983 ± 0.0185 | |
| Leave-one-out | 83.75 ± 𝛜 | 83.78 ± 𝛜 | 83.72 ± 𝛜 | 0.8375 ± 𝛜 | |
| FS7 | 5-fold | 81.46 ± 1.97 | 77.93 ± 2.88 | 84.50 ± 3.72 | 0.8121 ± 0.019 |
| 10-fold | 80.36 ± 1.10 | 80.31 ± 2.78 | 80.40 ± 2.44 | 0.8035 ± 0.0112 | |
| Leave-one-out | 83.75 ± 𝛜 | 83.78 ± 𝛜 | 83.72 ± 𝛜 | 0.8375 ± 𝛜 |
Comparison of experimental results of classification accuracy (%), sensitivity (%), specificity (%), and AUC (in terms of mean ± standard deviation) using an LDA classification model, where 𝛜 indicates 1.0 × 10−5.
| Feature Set | Validation | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|---|
| FS1 | 5-fold | 79.38 ± 0.88 | 72.97 ± 𝛜 | 84.88 ± 1.64 | 0.7893 ± 0.0082 |
| 10-fold | 79.75± 0.94 | 72.97 ± 𝛜 | 85.58 ± 1.74 | 0.7928 ± 0.0087 | |
| Leave-one-out | 80.00 ± 𝛜 | 72.97 ± 𝛜 | 86.05 ± 𝛜 | 0.7951 ± 𝛜 | |
| FS2 | 5-fold | 73.03 ± 1.13 | 58.69 ± 1.89 | 85.38 ± 1.05 | 0.7203 ± 0.0116 |
| 10-fold | 72.92 ± 0.59 | 59.01 ± 1.86 | 84.89 ± 1.17 | 0.7195 ± 0.0064 | |
| Leave-one-out | 71.25 ± 𝛜 | 56.76 ± 𝛜 | 83.72 ± 𝛜 | 0.7024 ± 𝛜 | |
| FS3 | 5-fold | 72.29 ± 0.86 | 74.33 ± 1.36 | 70.54 ± 2.19 | 0.7243 ± 0.0078 |
| 10-fold | 71.50 ± 1.22 | 74.6 ± 1.33 | 68.84 ± 1.86 | 0.7172 ± 0.0119 | |
| Leave-one-out | 72.50 ± 𝛜 | 75.68 ± 𝛜 | 69.77 ± 𝛜 | 0.7272 ± 𝛜 | |
| FS4 | 5-fold | 73.13 ± 0.88 | 50 ± 1.91 | 93.02 ± 𝛜 | 0.7151 ± 0.0095 |
| 10-fold | 73.39 ± 0.56 | 50.58 ± 1.22 | 93.02 ± 𝛜 | 0.718 ± 0.0061 | |
| Leave-one-out | 73.75 ± 𝛜 | 51.35 ± 𝛜 | 93.02 ± 𝛜 | 0.7219 ± 𝛜 | |
| FS5 | 5-fold | 81.56 ± 0.54 | 73.99 ± 1.31 | 88.08 ± 0.77 | 0.8103 ± 0.0057 |
| 10-fold | 81.75 ± 0.83 | 74.87 ± 1.24 | 87.67 ± 1.06 | 0.8127 ± 0.0083 | |
| Leave-one-out | 82.50 ± 𝛜 | 75.68 ± 𝛜 | 88.37 ± 𝛜 | 0.8202 ± 𝛜 | |
| FS6 | 5-fold | 82.92 ± 0.59 | 73.42 ± 1.01 | 91.09 ± 0.86 | 0.8226 ± 0.0059 |
| 10-fold | 82.32 ± 0.8 | 72.97 ± 2.04 | 90.37 ± 0.82 | 0.8167 ± 0.0087 | |
| Leave-one-out | 82.50 ± 𝛜 | 72.97 ± 𝛜 | 90.70 ± 𝛜 | 0.8184 ± 𝛜 | |
| FS7 | 5-fold | 83.00 ± 0.93 | 75.68 ± 𝛜 | 89.30 ± 1.54 | 0.8249 ± 0.0077 |
| 10-fold | 82.29 ± 0.47 | 75.68 ± 𝛜 | 87.98 ± 0.87 | 0.8183 ± 0.0043 | |
| Leave-one-out | 82.50 ± 𝛜 | 75.68 ± 𝛜 | 88.37 ± 𝛜 | 0.8202 ± 𝛜 |
Comparison of experimental results of classification accuracy (%), sensitivity (%), specificity (%), and AUC (in terms of mean ± standard deviation) using the different classifiers for only our proposed feature set (FS7), where 𝛜 indicates 1.0 × 10−5.
| Classifier | Validation | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|---|
| SVM | 5-fold |
|
|
|
|
| 10-fold |
| 80.27 ± 2.11 |
|
| |
| Leave-one-out | 81.08 ± 𝛜 | ||||
| RF | 5-fold | 84.86 ± 1.5 | 77.78 ± 2.47 | 90.96 ± 2.31 | 0.8437 ± 0.015 |
| 10-fold | 85.63 ± 1.53 | 77.67 ± 1.31 | 92.73 ± 2.44 | 0.8505 ± 0.0147 | |
| Leave-one-out | 86.25 ± 1.48 | 77.3 ± 1.32 | 93.95 ± 2.59 | 0.8564 ± 0.0141 | |
| DT | 5-fold | 81.46 ± 1.97 | 77.93 ± 2.88 | 84.50 ± 3.72 | 0.8121 ± 0.019 |
| 10-fold | 80.36 ± 1.1 |
| 80.40 ± 2.44 | 0.8035 ± 0.0112 | |
| Leave-one-out | 83.75 ± 𝛜 | 83.72 ± 𝛜 | 0.8375 ± 𝛜 | ||
| LDA | 5-fold | 83.00 ± 0.93 | 75.68 ± 𝛜 | 89.3 0± 1.54 | 0.8249 ± 0.0077 |
| 10-fold | 82.29 ± 0.47 | 75.68 ± 𝛜 | 87.98 ± 0.87 | 0.8183 ± 0.0043 | |
| Leave-one-out | 82.50 ± 𝛜 | 75.68 ± 𝛜 | 88.37 ± 𝛜 | 0.8202 ± 𝛜 |
Figure 11ROC curves of (a) SVM comparing various feature sets using leave-one-out cross validation, (b) different classifiers comparison using FS7 along with leave-one-out cross validation, (c) different classifiers comparison using FS7 along with 5-fold cross validation, and (d) comparison of classifiers using FS7 along with 10-fold cross validation.