| Literature DB >> 32372771 |
Ashkan Shakarami1, Hadis Tarrah2.
Abstract
Pattern recognition and feature extraction of images always have been important subjects in improving the performance of image classification and Content-Based Image Retrieval (CBIR). Recently, Machine Learning and Deep Learning algorithms are utilized widely in order to achieve these targets. In this research, an efficient method for image description is proposed which is developed by Machine Learning and Deep Learning algorithms. This method is created using combination of an improved AlexNet Convolutional Neural Network (CNN), Histogram of Oriented Gradients (HOG) and Local Binary Pattern (LBP) descriptors. Furthermore, the Principle Component Analysis (PCA) algorithm has been used for dimension reduction. The experimental results demonstrate the superiority of the offered method compared to existing methods by improving the accuracy, mean Average Precision (mAP) and decreasing the complex computation. The experiments have been run on Corel-1000, OT and FP datasets.Entities:
Keywords: Content-based image retrieval (CBIR); Convolutional neural network (CNN); Image classification; Image descriptor; Machine learning; Pattern recognition
Year: 2020 PMID: 32372771 PMCID: PMC7198219 DOI: 10.1016/j.ijleo.2020.164833
Source DB: PubMed Journal: Optik (Stuttg) ISSN: 0030-4026 Impact factor: 2.443
Fig. 1Proposed method.
The number of categories and images per categories for used datasets.
| Datasets | Number of categories | Number of images |
|---|---|---|
| Corel-1000 | 10 | 1000 |
| OT | 8 | 2688 |
| FP (Catlech-101) | 5 | 380 |
Evaluation of AlexNet CNN on Corel-1000 dataset for Image Classification.
| Method | Train data (Accuracy ± Standard division) | Test data (Accuracy ± Standard division) |
|---|---|---|
| Accuracy ± Standard division | Accuracy ± Standard division | |
| AlexNet CNN | 97.80 ± 0.46 | 90.10 ± 1.80 |
Evaluation of proposed method on Corel-1000 dataset for Image Classification.
| Method | Train data (Accuracy ± Standard division) | Test data (Accuracy ± Standard division) | ||||
|---|---|---|---|---|---|---|
| Classifiers | Classifiers | |||||
| Random forest | SVM | KNN | Random forest | SVM | KNN | |
| Proposed method | 100 | 100 | 100 | 96 ± 0.62 | 94.70 ± 1.08 | 95.20 ± 1.04 |
Evaluation AlexNet CNN on OT dataset for Image Classification.
| Method | Train data (Accuracy ± Standard division) | Test data (Accuracy ± Standard division) |
|---|---|---|
| Accuracy ± Standard division | Accuracy ± Standard division | |
| AlexNet CNN | 95.67 ± 0.61 | 89.17 ± 0.48 |
Evaluation proposed method on OT dataset for Image Classification.
| Method | Train data (Accuracy ± Standard division) | Test data (Accuracy ± Standard division) | ||||
|---|---|---|---|---|---|---|
| Classifiers | Classifiers | |||||
| Random forest | SVM | KNN | Random forest | SVM | KNN | |
| Proposed method | 100 | 99.67 ± 0.03 | 100 | 93.86 ± 0.17 | 93.86 ± 0.21 | 93.19 ± 0.09 |
Evaluation AlexNet CNN on FP dataset for Image Classification.
| Method | Train data (Accuracy ± Standard division) | Test data (Accuracy ± Standard division) |
|---|---|---|
| Accuracy ± Standard division | Accuracy ± Standard division | |
| AlexNet CNN | 98.16 ± 0.88 | 88.95 ± 1.55 |
Evaluation proposed method on FP dataset for Image Classification.
| Method | Train data (Accuracy ± Standard division) | Test data (Accuracy ± Standard division) | ||||
|---|---|---|---|---|---|---|
| Classifiers | Classifiers | |||||
| Random forest | SVM | KNN | Random forest | SVM | KNN | |
| Proposed method | 100 | 99.87 ± 0.10 | 100 | 89.74 ± 0.88 | 88.16 ± 0.82 | 89.74 ± 0.52 |
Evaluation proposed method and comparison with AlexNet CNN on Corel-1000 dataset for CBIR.
| Methods | mAP ± Standard division | ||
|---|---|---|---|
| 5-top | 10-top | All relative images | |
| AlexNet CNN | 93.14 | 91.87 | 75.48 |
| Proposed method | 96.02 ± 1.94 | 95.80 ± 1.82 | 91.65 ± 1.22 |
Evaluation proposed method on OT dataset for CBIR.
| Methods | mAP ± Standard division | ||
|---|---|---|---|
| 5-top | 10-top | All relative images | |
| AlexNet CNN | 93 ± 0.48 | 92.30 ± 0.65 | 71.26 ± 1.58 |
| Proposed method | 94.22 ± 0.36 | 93.91 ± 0.37 | 85.64 ± 0.55 |
Evaluation proposed method on FP dataset for CBIR.
| Methods | mAP ± Standard division | ||
|---|---|---|---|
| 5-top | 10-top | All relative images | |
| AlexNet CNN | 83.78 ± 2.34 | 81.80 ± 2.22 | 71.23 ± 2.36 |
| Proposed method | 87.58 ± 1.44 | 86.86 ± 1.20 | 83 ± 0.50 |
Plot 1The mAP-Mean recall plot of the proposed method for Corel-1000 dataset.
Plot 2The mAP-Mean recall plot of the proposed method for OT dataset.
Plot 3The mAP-Mean recall plot of the proposed method for FP dataset.
The visual results of proposed method for CBIR on Corel-1000 dataset.
The visual results of proposed method for CBIR on OT dataset.
The visual results of proposed method for CBIR on FP dataset.
Comparison results of proposed method with other method for image classification.
| Dataset | Method | Accuracy | Training rate%, Test rate% |
|---|---|---|---|
| Corel-1000 | Fuzzy Topological [ | 62.20 | 33, 67 |
| Color Histogram + Fuzzy Neural Network [ | 73.40 | – | |
| OT | Fusion features [ | 63.89 | – |
| Co-occurance matrix + Bayesian classifier [ | 89 | 10-fold cross validation | |
| Hybrid generative + Dense SIFT [ | 91.08 | 50, 50 | |
| FP | Co-occurance matrix + KNN [ | 87.20 | 10-fold cross validation |
Comparison results of proposed method with other method for CBIR.
| Dataset | Method | mAP | Training rate%, Test rate% | Dimension of proposed method |
|---|---|---|---|---|
| Corel-1000 | CCM + DBPSP [ | 76.10 | 90, 10 | 1 × 21 |
| Block Truncation Coding [ | 77.90 | – | 1 × 96 | |
| HOG + SURF [ | 80.61 | 70, 30 | – | |
| Dense SIFT [ | 84.20 | 50, 50 | 1 × 128 | |
| SURF + FREAK [ | 86 | 70, 30 | 1 × 128 | |
| SURF + MSER [ | 88 | 70, 30 | 1 × 128 | |
| Fusion features [ | 83.50 | – | – | |
| AlexNet CNN [ | 93.80 | – | 1 × 4096 | |
| OT | Co-occurance matrix [ | 76.39 | 10-fold cross validation | 1 × 9 |
| Color moment + Angular Radial Transform + Edge histogram [ | 50.59 | 85, 15 | – | |
| Relevance Feedback [ | 79 | – | – | |
| FP | Co-occurance matrix [ | 78.83 | 10-fold cross validation | 1 × 9 |