| Literature DB >> 31545461 |
Eleftherios Trivizakis1, Georgios S Ioannidis1, Vasileios D Melissianos1, Georgios Z Papadakis1, Aristidis Tsatsakis2, Demetrios A Spandidos3, Kostas Marias1.
Abstract
Potentially suspicious breast neoplasms could be masked by high tissue density, thus increasing the probability of a false‑negative diagnosis. Furthermore, differentiating breast tissue type enables patient pre‑screening stratification and risk assessment. In this study, we propose and evaluate advanced machine learning methodologies aiming at an objective and reliable method for breast density scoring from routine mammographic images. The proposed image analysis pipeline incorporates texture [Gabor filters and local binary pattern (LBP)] and gradient‑based features [histogram of oriented gradients (HOG) as well as speeded‑up robust features (SURF)]. Additionally, transfer learning approaches with ImageNet trained weights were also used for comparison, as well as a convolutional neural network (CNN). The proposed CNN model was fully trained on two open mammography datasets and was found to be the optimal performing methodology (AUC up to 87.3%). Thus, the findings of this study indicate that automated density scoring in mammograms can aid clinical diagnosis by introducing artificial intelligence‑powered decision‑support systems and contribute to the 'democratization' of healthcare by overcoming limitations, such as the geographic location of patients or the lack of expert radiologists.Entities:
Mesh:
Year: 2019 PMID: 31545461 PMCID: PMC6787954 DOI: 10.3892/or.2019.7312
Source DB: PubMed Journal: Oncol Rep ISSN: 1021-335X Impact factor: 3.906
Breast density scoring.
| Methodology or study, authors (Refs.) | mini-MIAS (2-class) ACC/AUC (%) | mini-MIAS (3-class) ACC (%) | DDSM (2-class) ACC/AUC (%) | DDSM (3-class) ACC (%) | DDSM (4-class) ACC (%) | No. of images |
|---|---|---|---|---|---|---|
| Machine learning | ||||||
| HOG | 71.8/52.3 | 53.1 | – | – | – | Full |
| LBP | 83.3/78.0 | 74.2 | 67.1/71.4 | 55.1 | 36.6 | Full |
| SURF | 82.6/77.6 | 68.3 | 67.5 | 46.8 | Full | |
| Gabor + LBP | 76.7/68.4 | 61.7 | 62.8/67.1 | 52.1 | 35.8 | Full |
| Selected HOG | 69.0/48.7 | 53.1 | – | – | – | Full |
| Selected LBP | 77.9/71.1 | 70.2 | 73.7/79.2 | 64.5 | 40.7 | Full |
| Selected SURF | 83.8/77.6 | 73.6 | 75.6/81.5 | 62.9 | 46.8 | Full |
| Selected Gabor + LBP | 64.9/60.9 | 50.9 | 62.1/67.7 | 55.4 | 37.7 | Full |
| Bovis | – | – | 96.6/ - | – | 71.4 | 377 |
| Tzikopoulos | – | 70.3 | – | – | – | Full |
| Oliver | – | – | – | – | 40.3–47 | 300 |
| Deep learning | ||||||
| Proposed architecture | Full | |||||
| Inception 3 | 73.6/75.7 | 70.8 | 72.7/79.1 | 49.5 | 48.8 | Full |
| VGG19 | 68.6/67.8 | 72.4 | 72.1/79.3 | 62 | 36.8 | Full |
| InceptionResNetV2 | 69.9/63.7 | 73.1 | 72.7/79.2 | 55.6 | 37.3 | Full |
| DenseNet201 | 75.5/79.6 | 77.9 | 73.1/80.5 | 61.7 | 36.5 | Full |
| NASNetLarge | 66.5/66.3 | 72.8 | 72.3/78.7 | 61.4 | 37.8 | Full |
The table presents a 5-fold cross-validation averages for the examined methodologies. HOG, histogram of oriented gradients; LBP, local binary pattern; SURF, speeded-up robust features. Values in bold font indicate the optimal performing methodologies.
Figure 1.The data stratification methodology for model fitting, hyperparameter optimization and transparent performance evaluation across every examined image analysis process.
Figure 2.Graphical representation of the machine learning workflow illustrating the three-class breast density mammogram classification case.
Figure 3.Overview of the proposed architecture (fully-trained CNN), including the network layout and layer parameters, such as the receptive field, number of filters, convolutional stride, activation function, number of neurons, dropout and classifier. CNN, convolutional neural network.
Figure 4.The examined pipeline for feature extraction and classification using ‘off-the-shelf’ pre-trained methods.