| Literature DB >> 31187216 |
Michela Antonelli1,2, Edward W Johnston3, Nikolaos Dikaios3, King K Cheung3, Harbir S Sidhu3, Mrishta B Appayya3, Francesco Giganti4,5, Lucy A M Simmons5, Alex Freeman6, Clare Allen4, Hashim U Ahmed5, David Atkinson3, Sebastien Ourselin2, Shonit Punwani7,8.
Abstract
OBJECTIVE: The purpose of this study was: To test whether machine learning classifiers for transition zone (TZ) and peripheral zone (PZ) can correctly classify prostate tumors into those with/without a Gleason 4 component, and to compare the performance of the best performing classifiers against the opinion of three board-certified radiologists.Entities:
Keywords: Diagnosis, computer-assisted; Gleason score; Machine learning; Magnetic resonance imaging; Prostate cancer
Mesh:
Year: 2019 PMID: 31187216 PMCID: PMC6682575 DOI: 10.1007/s00330-019-06244-2
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1Flow diagram of patient selection for the training cohort
Clinical characteristics
| PZ | TZ | ||||||
|---|---|---|---|---|---|---|---|
| Parameter | Min | Max | Median | Min | Max | Median | |
| TRC | Age (years) | 43 | 79 | 63.4 | 48 | 83.4 | 65.5 |
| PSA (ng/ml) | 2.5 | 19 | 6.6 | 2.7 | 30.3 | 9.6 | |
| GV (ml) | 16 | 77 | 35.2 | 18 | 65.8 | 32.1 | |
| TV (ml) | 0.02 | 5.1 | 0.4 | 0.03 | 10 | 1.2 | |
| TSC | Age (years) | 55.7 | 80.2 | 69.8 | 56.8 | 70 | 63.3 |
| PSA (ng/ml) | 2.7 | 91 | 8.1 | 3.4 | 18 | 8.6 | |
| GV (ml) | 20.8 | 75.9 | 43.8 | 25 | 100 | 35 | |
| TV (ml) | 0.1 | 15 | 0.9 | 0.05 | 9.4 | 0.8 | |
PZ, peripheral zone; TZ, transition zone; PSA, prostate-specific antigen; GV, gland volume; TV, tumor volume; TRC, model derivation cohort; TSC, temporally separated cohort
Description of mp-MRI parameters
| Sequence | Coil | TR | TE | FA degrees | WFS (pix) | BW Hz/Px | FoV (mm) | SL (mm) | Gap | TSE factor | PD | FS | ACQ | TRs (s) | Total scan |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| T2 TSE coronal | Dual | 6128 | 100 | 90 | 2.704 | 160.7 | 180 | 3 | 3 | 16 | R > L | No | 300 × 290 | 05:55.4 | |
| T2 TSE axial | Dual | 5407 | 100 | 90 | 2.704 | 160.7 | 180 | 3 | 0 | 16 | R > L | No | 300 × 290 | 05:13.6 | |
| T1W TSE | Dual | 487 | 8 | 90 | 1.997 | 217.6 | 240 | 3 | 3 | 4 | R > L | No | 184 × 184 | 03:06.8 | |
| DWI 01505001000 | Dual | 2753 | 80 | 90 | 40.353 | 10.8 | 220 | 5 | 0 | A > P | SPAIR | 168 × 169 | 05:16.5 | ||
| DWI b2000 | Dual | 2000 | 78 | 90 | 44.108 | 9.9 | 220 | 5 | 0 | A > P | SPIR | 168 × 169 | 03:40.0 | ||
| DCE | Dual | 5.8 | 2.8 | 10 | 1.766 | 246.1 | 180 | 3 | 0 | R > L | SPAIR | 140 × 162 | 13 | 04:14.1 |
TSE, turbo spin echo; TR, time to repetition; TE, time to echo; FA, flip angle; WFS, water-fat shift; BW, bandwidth; FoV, field of view; DWI, diffusion-weighted imaging; DCE, dynamic contrast-enhanced; TRs, temporal resolution; PD, phasing direction; SL, slice thickness
Fig. 2a Axial T2 TSE of a 64-year-old male showing the volumetric contour of a TZ prostate tumor for extraction of mp-MRI parameters. b Axial post gadolinium dynamic contrast-enhanced image. c Axial (b) = 2000 mm/s2. d ADC “map”
Fig. 3Flow diagram outlining the feature selection validation strategy used in the study. CFS, correlation features selection; ALL, set containing all the features; SELTZ, subset of feature selected for the TZ; SELPZ, subset of feature selected for PZ; AUCALLPZ, area under the curve obtained on PZ using all the features; AUCALLTZ, area under the curve obtained on TZ using all the features; LR, linear regression; FFNN, feed-forward neural network; SVM, support vector machine; NB, naïve Bayes; RF, random forest, AUCSELPZ, area under the curve obtained on PZ obtained using the selected feature; AUCSELTZ, area under the curve obtained on TZ obtained using the selected feature
Fig. 4Flow diagram outlining the model validation strategy used in the study. SELTZ, subset of feature selected for the transition zone; SELPZ, subset of feature selected for PZ; AUCSELPZ, area under the curve obtained on PZ obtained using the selected feature; AUCSELTZ, area under the curve obtained on TZ obtained using the selected feature; LR, linear regression; FFNN, feed-forward neural network; SVM, support vector machine; NB, naïve Bayes; RF, random forest; PZ, peripheral zone; TZ, transition zone
Mean and standard deviation (in brackets) of the AUC obtained on the test set by the five classifiers following the fivefold cross-validation, when all the features (ALL) and only the features selected by CFS (SEL) are used
| TZ | PZ | |||||
|---|---|---|---|---|---|---|
| ALL | SELTZ | ALL | SELPZ | |||
| LR | 0.65 (0.068) | 0.73 (0.004) | < 0.0001 | 0.80 (0.020) | 0.83 (0.028) | < 0.0001 |
| FFNN | 0.62 (0.084) | 0.61 (0.081) | 0.2713 | 0.77 (0.033) | 0.80 (0.032) | < 0.0001 |
| SVM | 0.43 (0.064) | 0.42 (0.069) | 0.2431 | 0.72 (0.035) | 0.73 (0.028) | 0.0431 |
| NB | 0.73 (0.060) | 0.75 (0.047) | < 0.0001 | 0.78 (0.022) | 0.81 (0.018) | < 0.0001 |
| RF | 0.53 (0.061) | 0.53 (0.071) | 0.32983 | 0.80 (0.023) | 0.80 (0.024) | 0.3272 |
TZ, transition zone; PZ, peripheral zone; ALL, all the features; SEL, features selected by CFS for TZ; SEL, features selected by CFS for PZ; LR, linear regression; FFNN, feed-forward neural network; SVM, support vector machine; NB, naïve Bayes; RF, random forest
Results of the statistical tests on AUC distributions obtained on the test set by the 5 classifiers, trained with SEL following fivefold cross-validation
| TZ | PZ | |||||||||
| Friedman rank | Iman and Davenport, | Hypothesis | Friedman rank | Iman and Davenport, | Hypothesis | |||||
| LR | 1.86 | < 0.0001 | Rejected | 1.49 | < 0.0001 | Rejected | ||||
| FFNN | 3.43 | 3.00 | ||||||||
| SVM | 4.95 | 4.89 | ||||||||
| NB | 1.38 | 2.62 | ||||||||
| RF | 3.74 | 3.00 | ||||||||
| Holm post hoc procedure | ||||||||||
| Alpha/i | Hypothesis | Alpha/i | Hypothesis | |||||||
| 4 | SVM | 14.40 | < 0.0001 | 0.0125 | Rejected | SVM | 15.20 | < 0.0001 | 0.0125 | Rejected |
| 3 | TREE | 10.59 | < 0.0001 | 0.00167 | Rejected | FFNN | 6.75 | < 0.0001 | 0.00167 | Rejected |
| 2 | FFNN | 9.19 | < 0.0001 | 0.025 | Rejected | NB | 6.75 | < 0.0001 | 0.025 | Rejected |
| 1 | LR | 2.15 | 0.0318 | 0.05 | Rejected | RF | 5.05 | < 0.0001 | 0.05 | Rejected |
TZ, transition zone; PZ, peripheral zone; LR, linear regression; FFNN, feed-forward neural network; SVM, support vector machine; NB, naïve Bayes; RF, random forest
Fig. 5Mean ROC curve, along with the sensitivity and specificity mean values obtained by the three radiologists and computed at the three cut-off points generated on the test set following the fivefold cross-validation by the best performing classifiers (NB and LR) on TZ (left) and PZ (right). PZ, peripheral zone; TZ, transition zone; LR, linear regression; NB, naïve Bayes; ROC, receiver operator characteristic
Mean values of sensitivity (SN) and specificity (SP) at the three cut-off points obtained by the three radiologists and the best performing classifiers following fivefold cross-validation
| SN | SP | ||
|---|---|---|---|
| TZ | NB point_50 | 0.88 | 0.51 |
| NB point_01 | 0.75 | 0.57 | |
| NB point_RAD | 0.92 | 0.56 | |
| Mean Rad | 0.82 | 0.44 | |
| PZ | LR point_50 | 0.93 | 0.53 |
| LR point_01 | 0.76 | 0.73 | |
| LR point_RD | 0.90 | 0.65 | |
| Mean Rad | 0.72 | 0.40 |