| Literature DB >> 33265900 |
Mohammad I Daoud1, Samir Abdel-Rahman1, Tariq M Bdair2, Mahasen S Al-Najar3, Feras H Al-Hawari1, Rami Alazrai1.
Abstract
This study aims to enable effective breast ultrasound image classification by combining deep features with conventional handcrafted features to classify the tumors. In particular, the deep features are extracted from a pre-trained convolutional neural network model, namely the VGG19 model, at six different extraction levels. The deep features extracted at each level are analyzed using a features selection algorithm to identify the deep feature combination that achieves the highest classification performance. Furthermore, the extracted deep features are combined with handcrafted texture and morphological features and processed using features selection to investigate the possibility of improving the classification performance. The cross-validation analysis, which is performed using 380 breast ultrasound images, shows that the best combination of deep features is obtained using a feature set, denoted by CONV features that include convolution features extracted from all convolution blocks of the VGG19 model. In particular, the CONV features achieved mean accuracy, sensitivity, and specificity values of 94.2%, 93.3%, and 94.9%, respectively. The analysis also shows that the performance of the CONV features degrades substantially when the features selection algorithm is not applied. The classification performance of the CONV features is improved by combining these features with handcrafted morphological features to achieve mean accuracy, sensitivity, and specificity values of 96.1%, 95.7%, and 96.3%, respectively. Furthermore, the cross-validation analysis demonstrates that the CONV features and the combined CONV and morphological features outperform the handcrafted texture and morphological features as well as the fine-tuned VGG19 model. The generalization performance of the CONV features and the combined CONV and morphological features is demonstrated by performing the training using the 380 breast ultrasound images and the testing using another dataset that includes 163 images. The results suggest that the combined CONV and morphological features can achieve effective breast ultrasound image classifications that increase the capability of detecting malignant tumors and reduce the potential of misclassifying benign tumors.Entities:
Keywords: breast cancer; cancer detection; computer-aided diagnosis; convolution neural networks; deep features; deep learning; morphological features; texture features; tumor classification
Year: 2020 PMID: 33265900 PMCID: PMC7730057 DOI: 10.3390/s20236838
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Graphical illustration of the process of extracting deep features from the ROI in the BUS image using the pre-trained VGG19 model.
The six levels that are employed to extract the deep feature sets from the BUS image.
| Deep Features Extraction Level | Feature Sets | Description |
|---|---|---|
| CF Extraction Level 1 | A total of 11,008 convolution features organized into 16 feature sets are extracted from the ROI that includes the tumor, where each feature set corresponds to one of the convolution layers of the VGG19 model. To generate a given feature set, the convolution features | |
| CF Extraction Level 2 | A total of 11,008 convolution features organized into 5 feature sets are extracted from the ROI that includes the tumor, where each feature set corresponds to one of the convolution blocks of the VGG19 model. To generate a given feature set, the feature sets extracted from the layers of the convolution block that corresponds to the feature set are concatenated and normalized. | |
| CF Extraction Level 3 |
| A total of 11,008 convolution features organized into 1 feature set are extracted from the ROI that includes the tumor. To generate the feature set, the feature sets extracted from all convolution blocks of the VGG19 model are concatenated and normalized. |
| FCF Extraction Level 1 | Two feature sets, where each set includes 4096 fully connected features, are extracted from the ROI that includes the tumor. The computation of the two feature sets is achieved by extracting and normalizing the activations of first and second fully connected layers of the VGG19 model. | |
| FCF Extraction Level 2 |
| A feature set that includes 8192 fully connected features is extracted from the ROI that includes the tumor. The computation of the feature set is achieved by concatenating and normalizing the two feature sets extracted from the first and second fully connected layers of the VGG19 model. |
| Combined CF FCF Extraction |
| A feature set that includes 19,200 convolution and fully connected features is extracted from the ROI that includes the tumor. The computation of the feature set is performed by extracting deep features from all convolution blocks and all fully connected layers of the VGG19 model and then concatenating and normalizing the extracted features. |
The handcrafted texture and morphological features that are extracted from the BUS image.
| Type | Features | Description |
|---|---|---|
| Texture features | Autocorrelation [ | A total of 800 texture features are extracted from the ROI that includes the tumor. In particular, 40 GLCMs are generated using 10 distances |
| Morphological features | Tumor area [ | A total of 18 morphological features are extracted from the tumor outline. In particular, 10 morphological features are computed directly based on the tumor outline. Moreover, 2 morphological features are computed based on the NRL of the tumor. In addition, 6 morphological features are computed by fitting an ellipse to the tumor outline. |
The BUS image classification results obtained using the feature sets extracted from the pre-trained VGG19 model at six deep features extraction levels. For the six performance metrics, the mean ± standard deviation values are computed across the ten folds of the cross-validation procedure performed using Dataset 1.
| Deep Features Extraction Level | Feature Set | Total No. of Features | No. of Selected Features | Selected Features | Accuracy | Sensitivity | Specificity | PPV | NPV | MCC |
|---|---|---|---|---|---|---|---|---|---|---|
| CF Extraction Level 1 |
| 128 | 34 |
|
|
|
|
|
| |
|
| 128 | 21 |
|
|
|
|
|
| ||
|
| 256 | 25 |
|
|
|
|
|
| ||
|
| 256 | 38 |
|
|
|
|
|
| ||
|
| 512 | 33 |
|
|
|
|
|
| ||
|
| 512 | 47 |
|
|
|
|
|
| ||
|
| 512 | 29 |
|
|
|
|
|
| ||
|
| 512 | 86 |
|
|
|
|
|
| ||
|
| 1024 | 20 |
|
|
|
|
|
| ||
|
| 1024 | 61 |
|
|
|
|
|
| ||
|
| 1024 | 31 |
|
|
|
|
|
| ||
|
| 1024 | 32 |
|
|
|
|
|
| ||
|
| 1024 | 58 |
|
|
|
|
|
| ||
|
| 1024 | 41 |
|
|
|
|
|
| ||
|
| 1024 | 23 |
|
|
|
|
|
| ||
|
| 1024 | 31 |
|
|
|
|
|
| ||
| CF Extraction Level 2 |
| 256 | 34 |
|
|
|
|
|
| |
|
| 512 | 23 |
|
|
|
|
|
| ||
|
| 2048 | 15 |
|
|
|
|
|
| ||
|
| 4096 | 34 |
|
|
|
|
|
| ||
|
| 4096 | 27 |
|
|
|
|
|
| ||
| CF Extraction Level 3 |
| 11,008 | 25 | |||||||
| FCF Extraction Level 1 |
| 4096 | 36 |
|
|
|
|
|
| |
|
| 4096 | 98 |
|
|
|
|
|
| ||
| FCF Extraction Level 2 |
| 8192 | 36 |
|
|
|
|
|
| |
| Combined CF FCF Extraction |
| 19,200 | 25 |
The BUS image classification results obtained using the feature set compared with the classification results achieved by combining the feature set with handcrafted texture and morphological features. For the six performance metrics, the mean ± standard deviation values are computed across the ten folds of the cross-validation procedure performed using Dataset 1.
| Features | Total No. of Features | No. of Selected Features | Selected Features | Accuracy | Sensitivity | Specificity | PPV | NPV | MCC |
|---|---|---|---|---|---|---|---|---|---|
| 11,008 | 25 | 94.2 ± 2.7 | 93.3 ± 5.1 | 94.9 ± 4.1 | 93.3 ± 5.6 | 94.9 ± 4.4 | 88.2 ± 5.5 | ||
| 11,808 | 25 | 94.2 ± 2.7 | 93.3 ± 5.1 | 94.9 ± 4.1 | 93.3 ± 5.6 | 94.9 ± 4.4 | 88.2 ± 5.5 | ||
| 11,026 | 21 | ||||||||
| 11,826 | 21 |
Performance comparison between the classification results obtained using the feature set with and without features selection, the combined feature set and morphological features with features selection, the handcrafted texture, morphological, and combined texture and morphological features with features selection, and the fine-tuned VGG19 model. For the six performance metrics, the mean ± standard deviation values are computed across the ten folds of the cross-validation procedure performed using Dataset 1.
| Features | Total No. of Features | No. of Selected Features | Accuracy | Sensitivity | Specificity | PPV | NPV | MCC |
|---|---|---|---|---|---|---|---|---|
| 11,008 | 25 | 94.2± 2.7 | 93.3 ± 5.1 | 94.9± 4.1 | 93.3 ± 5.6 | 94.9 ± 4.4 | 88.2 ± 5.5 | |
| 11,026 | 21 | |||||||
| 11,008 | - | 80.5 ± 4.5 | 82.2 ± 9.0 | 79.3 ± 8.0 | 74.9 ± 7.1 | 85.6 ± 7.4 | 60.9 ± 8.6 | |
| Texture features (with features selection) | 800 | 38 | 84.2 ± 4.3 | 81.0 ± 9.6 | 86.6 ± 6.6 | 82.0 ± 10.6 | 85.8 ± 5.5 | 67.7 ± 9.2 |
| Morphological features (with features selection) | 18 | 8 | 87.1 ± 4.7 | 82.2 ± 8.9 | 90.8 ± 7.9 | 87.0 ± 11.9 | 87.2 ± 6.2 | 73.6 ± 9.7 |
| Texture and morphological features (with features selection) | 818 | 29 | 87.9 ± 6.1 | 82.8 ± 12.3 | 91.7 ± 3.9 | 88.2 ± 6.1 | 87.7 ± 7.7 | 75.2 ± 12.7 |
| Fine-tuning | - | - | 88.2 ± 4.5 | 83.4 ± 8.0 | 91.7 ± 5.5 | 88.3 ± 8.1 | 88.1 ± 6.9 | 75.8 ± 9.1 |
Figure 2The ROC curves obtained using the feature set (with features selection), the combined feature set and morphological features (with features selection), the combined handcrafted texture and morphological features (with features selection), and the fine-tuned VGG19 CNN model.