| Literature DB >> 31234363 |
Annarita Fanizzi1, Liliana Losurdo2, Teresa Maria A Basile3, Roberto Bellotti4, Ubaldo Bottigli5, Pasquale Delogu6, Domenico Diacono7, Vittorio Didonna8, Alfonso Fausto9, Angela Lombardi10, Vito Lorusso11, Raffaella Massafra12, Sabina Tangaro13, Daniele La Forgia14.
Abstract
Contrast-Enhanced Spectral Mammography (CESM) is a novelty instrumentation for diagnosing of breast cancer, but it can still be considered operator dependent. In this paper, we proposed a fully automatic system as a diagnostic support tool for the clinicians. For each Region Of Interest (ROI), a features set was extracted from low-energy and recombined images by using different techniques. A Random Forest classifier was trained on a selected subset of significant features by a sequential feature selection algorithm. The proposed Computer-Automated Diagnosis system is tested on 48 ROIs extracted from 53 patients referred to Istituto Tumori "Giovanni Paolo II" of Bari (Italy) from the breast cancer screening phase between March 2017 and June 2018. The present method resulted highly performing in the prediction of benign/malignant ROIs with median values of sensitivity and specificity of 87 . 5 % and 91 . 7 % , respectively. The performance was high compared to the state-of-the-art, even with a moderate/marked level of parenchymal background. Our classification model outperformed the human reader, by increasing the specificity over 8 % . Therefore, our system could represent a valid support tool for radiologists for interpreting CESM images, both reducing the false positive rate and limiting biopsies and surgeries.Entities:
Keywords: background parenchymal enhancement (BPE); breast cancer; computer-automated diagnosis (CADx); contrast-enhanced spectral mammography (CESM); feature extraction; machine learning techniques
Year: 2019 PMID: 31234363 PMCID: PMC6616937 DOI: 10.3390/jcm8060891
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Figure 1Typical example of images obtained from a CESM examination [27]: low-energy (a), high-energy (b) and recombined (c) images. A suspicious lesion is pointed by a white arrow on the recombined image, masked by the denser breast parenchyma on the low-energy image.
Figure 2CESM examination: the diagram shows the steps of the image acquisition in the standard CranioCaudal (CC) and MedioLateral Oblique (MLO) views, after the iodinated Contrast Medium (CM) injection. Breast 1 stands for the breast with no pathology, while breast 2 is the breast with one or more lesions.
Figure 3BPE distribution of the patients (%) undergone to CESM examinations and analyzed in this study.
Figure 4Flow-chart of the proposed model. In the first phase, a set of features on each ROI is extracted, then a sub-set of significant features is selected; finally, binary RF classifiers are trained.
Figure 5Haar decomposition schema. (a) one- and (b) two-level Haar decomposition.
Figure 6(a) The spatial relationships of pixels; (b) the GLCM directions. D is the offset and represents the distance between each pixels and the pixel of interest.
Figure 7Scheme of the extraction of each feature set.
Diagnostic performances of human reader on only LE and CESM images with respect to micro-histological investigation. We denote with “CESM images” the joint reading of LE and RC images.
| Diagnostic Test | Only LE | CESM | Micro-Histological Results |
|---|---|---|---|
| No. of selected patients | 53 | 53 | 53/53 |
| No. of selected breasts | 47 | 48 | 47/48 |
| No. of selected lesions | 57 | 58 | 57/58 |
| No. of malignant lesions (TP) | 38 (34) | 38 (34) | 34/34 |
| No. of benign lesions (TN) | 16 (16) | 20 (20) | 23/24 |
| Sensitivity [CI | 91.2% (88.8–93.5%) | 100% (95.0–100%) | |
| Specificity [CI | 69.6% (67.9–71.2%) | 83.3% (81.2–85.4%) | |
| Accuracy [CI | 82.5% (80.4–84.6%) | 93.1% (90.7–95.5%) | |
| MCC |
|
| |
| Size of lesions: mean ± SD (mm) |
|
| |
| Size of the smallest lesion detected (mm) | 6 | 6 |
Figure 8Accuracy for benign/malignant classification with respect to the number of features. The gray box highlights the accuracy peak obtained for , significantly different to the model with only two features (p-value of Wilcoxon–Mann–Whitney test ).
ROI classification into benign and malignant, in terms of accuracy, sensitivity, specificity, and MCC with respect to BPE degree. For each BPE class, the number of benign/malignant (B/M) ROIs is shown.
| BPE I | BPE II | BPE III-IV | Overall Dataset | |
|---|---|---|---|---|
| Accuracy (%) | 86.0 (84.0–88.0) | 95.5 (90.9–100) | 83.3 (81.3–84.3) | 87.5 (85.4–89.6) |
| Sensitivity (%) | 75.0 (70.0–80.0) | 100 (100–100) | 92.9 (85.7–100) | 87.5 (83.3–91.7) |
| Specificity (%) | 96.7 (86.7–100) | 100 (75.0–100) | 80.0 (60.0–80.0) | 91.7 (87.5–91.7) |
| MCC | 0.84 (0.72–0.91) | 0.91 (0.81–1) | 0.68 (0.66–0.71) | 0.76 (0.74–0.79) |
Result comparison between human reader and proposed model.
| Human Reader (24B/34M) | Proposed Model (24B/24M) | |
|---|---|---|
| Accuracy (%) |
|
|
| Sensitivity (%) | 100 |
|
| Specificity (%) |
|
|
| MCC |
|
|
ROI classification into background and lesion, in terms of accuracy, sensitivity and specificity with respect to BPE degrees. The minimal BPE class (I) is not included.
| BPE II | BPE III–IV | Overall Dataset | |
|---|---|---|---|
| Accuracy (%) | 82.9 (80.0–88.6) | 77.3 (77.3–81.8) | 82.5 (79.0–82.5) |
| Sensitivity (%) | 70.0 (65.0–80.0) | 75.0 (66.7–91.7) | 70.3 (68.8–84.4) |
| Specificity (%) | 100 (100–100) | 85.0 (85.0–90.0) | 94.0 (88.0–96.0) |
| MCC | 0.71 (0.67–0.79) | 0.57 (0.55–0.65) | 0.65 (0.63–0.69) |
All features whose selection frequency is significantly different from the chance in the benign/malignant classification (p-value null model test <0.05). Direction and level of the Haar decomposition, GLCM direction, and gradient’s magnitude or direction of the extracted features are shown.
| Feature Set | Feature | ROI Type | Frequency (%) |
|---|---|---|---|
| COUNT | SIFT | LE |
|
| HAAR | Variance_LL2 | RC |
|
| STAT | RelativeSmoothness | RC |
|
| GRAD | RelativeSmoothness_Gmag | LE |
|
| HAAR | Variance_LL1 | RC |
|
| STAT | Variance | RC |
|
| GRAD | Variance_Gmag | LE |
|
| GLCM | ClusterProminence_HL1 ( | RC |
|
| GLCM | Correlation_LH1 ( | RC |
|
| HAAR | Variance_LH1 | LE |
|
| HAAR | RelativeSmoothness_HL2 | RC |
|
| HAAR | Variance_LH2 | LE |
|
| GLCM | Homogeneity_LH1 ( | RC |
|
| STAT | Standard Deviation | RC |
|
| HAAR | Variance_HL2 | RC |
|
| STAT | Maximum − Minimum | RC |
|
| COUNT | MinEigen | RC |
|
All features whose selection frequency is significantly different from the chance in the background/lesion classification (p-value null model test <0.05). Direction and level of the Haar decomposition, and GLCM direction of the extracted features are shown.
| Feature Set | Feature | ROI Type | Frequency (%) |
|---|---|---|---|
| HAAR | Mean_LL2 | RC |
|
| GLCM | SumEntropy_LH1 ( | RC |
|
| GLCM | Entropy_LH1 ( | RC |
|
| HAAR | Mean_LL2 | LE |
|
| HAAR | Variance_LH1 | RC |
|
| GLCM | Entropy_LH1 ( | RC |
|
| HAAR | Entropy_LH1 | RC |
|
| GLCM | SumEntropy_LH1 ( | RC |
|
| HAAR | RelativeSmoothness_LH1 | RC |
|
| HAAR | RelativeSmoothness_LH2 | RC |
|
| HAAR | Variance_LH2 | RC |
|
| GLCM | Energy_LH1 ( | RC |
|
| HAAR | Mean_LL1 | RC |
|
| HAAR | Variance_LL2 | RC |
|
| HAAR | Mean_LL1 | LE |
|
| GLCM | Entropy_LH1 ( | RC |
|
| STAT | Minimum | RC |
|
| GLCM | ClusterProminence_HH1 ( | RC |
|
| COUNT | MSER | RC |
|
| HAAR | Variance_LL1 | RC |
|
| HAAR | RelativeSmoothness_LL1 | RC |
|
| GLCM | Entropy_LH1 ( | RC |
|
Figure 9Accuracy for background/lesion classification with respect to the number of features. The gray box highlights the accuracy obtained for , significantly different to the model with only two features (p-value of Wilcoxon–Mann–Whitney test ).