| Literature DB >> 35957448 |
Cesar Torres1, Claudia I Gonzalez1, Gabriela E Martinez1.
Abstract
Deep neural networks have demonstrated the capability of solving classification problems using hierarchical models, and fuzzy image preprocessing has proven to be efficient in handling uncertainty found in images. This paper presents the combination of fuzzy image edge-detection and the usage of a convolutional neural network for a computer vision system to classify guitar types according to their body model. The focus of this investigation is to compare the effects of performing image-preprocessing techniques on raw data (non-normalized images) with different fuzzy edge-detection methods, specifically fuzzy Sobel, fuzzy Prewitt, and fuzzy morphological gradient, before feeding the images into a convolutional neural network to perform a classification task. We propose and compare two convolutional neural network architectures to solve the task. Fuzzy edge-detection techniques are compared against their classical counterparts (Sobel, Prewitt, and morphological gradient edge-detection) and with grayscale and color images in the RGB color space. The fuzzy preprocessing methodologies highlight the most essential features of each image, achieving favorable results when compared to the classical preprocessing methodologies and against a pre-trained model with both proposed models, as well as achieving a reduction in training times of more than 20% compared to RGB images.Entities:
Keywords: convolutional neural networks; fuzzy edge-detection; guitar recognition
Mesh:
Year: 2022 PMID: 35957448 PMCID: PMC9371199 DOI: 10.3390/s22155892
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Coordinates of the input image f.
Figure 2(a) Gradient directions . (b) Coefficients .
Figure 3General approach.
Guitar database description.
| Name | Description |
|---|---|
| Total images | 5400 |
| Training images | 4320 |
| Test images | 1080 |
| Image size | 150 × 150 |
| Data base format | JPG |
Figure 4Sample of the guitar database.
Figure 5Input guitar image.
Figure 6(a) Dh input MF; (b) Dv input MF.
Fuzzy rules base for fuzzy MG edge-detector.
| Inputs | Output | Operator | |||
|---|---|---|---|---|---|
| G1 | G2 | G3 | G4 | Edges | |
| HighG1 | HighG2 | HighG3 | HighG4 | Edge | or |
| MiddleG1 | MiddleG2 | MiddleG3 | MiddleG4 | Edge | or |
| LowG1 | LowG2 | LowG3 | LowG4 | Background | and |
Figure 7(a) Fuzzy Sobel edge-detection output. (b) Fuzzy MG output.
Knowledge base with three fuzzy rules for Sobel/Prewitt fuzzy edge-detector.
| Inputs | Output | Operator | |
|---|---|---|---|
| Dh | Dv | Edges | |
| HighDh | HighDv | Edge | or |
| MiddleDh | MiddleDv | Edge | or |
| LowDh | LowDv | Background | and |
Figure 8Inputs MFs for the Fuzzy MG.
Figure 9Output MF for the Fuzzy MG.
Proposed CNN-I.
| Layer/Type | Filters/Neurons | Filter Size |
|---|---|---|
| 0/Input Convolution + ReLu | 40 | 3 × 3 |
| 1/Max Pooling | -- | 2 × 2 |
| 2/Convolution + ReLu | 40 | 3 × 3 |
| 3/Max Pooling | -- | 2 × 2 |
| 4/Convolution+ ReLu | 60 | 3 × 3 |
| 5/Max Pooling | -- | 2 × 2 |
| 6/Convolution + ReLu | 60 | 3 × 3 |
| 7/Max Pooling | -- | 2 × 2 |
| 8/Dropout | -- | 0.45 |
| 9/Flatten | -- | -- |
| 10/Dense | 300 | -- |
| 11/Dense | 250 | -- |
| 12/Output | 6 | -- |
Proposed CNN-II.
| Layer/Type | Filters/Neurons | Filter Size |
|---|---|---|
| 0/Input Convolution + ReLu | 64 | 3 × 3 |
| 1/Max Pooling | -- | 2 × 2 |
| 2/Convolution + ReLu | 64 | 3 × 3 |
| 3/Max Pooling | -- | 2 × 2 |
| 4/Convolution + ReLu | 128 | 3 × 3 |
| 5/Max Pooling | -- | 2 × 2 |
| 6/Convolution + ReLu | 128 | 3 × 3 |
| 7/Max Pooling | -- | 2 × 2 |
| 8/Dropout | -- | 0.75 |
| 9/Flatten | -- | -- |
| 10/Dense | 512 | -- |
| 11/Output | 6 | -- |
Results of the CNN-I model.
| Preprocessing | Min. | Max. | Average | Standard Deviation |
|---|---|---|---|---|
| RGB | 0.6296 | 0.7127 | 0.6687 | 0.0141 |
| Grayscale | 0.6135 | 0.7037 | 0.6601 | 0.0199 |
| MG | 0.4870 | 0.5759 | 0.5307 | 0.0177 |
| Prewitt | 0.6305 | 0.7129 | 0.6763 | 0.0163 |
| Sobel | 0.5894 | 0.6956 | 0.6472 | 0.0282 |
| Fuzzy MG | 0.6315 | 0.7167 | 0.6708 | 0.0186 |
| Fuzzy Prewitt | 0.6380 | 0.7176 | 0.6775 | 0.0151 |
| Fuzzy Sobel | 0.6380 | 0.7176 | 0.6751 | 0.0196 |
Results of the CNN-II model.
| Preprocessing | Min. | Max. | Average | Standard Deviation |
|---|---|---|---|---|
| RGB | 0.6796 | 0.7491 | 0.7121 | 0.0137 |
| Grayscale | 0.6654 | 0.7463 | 0.7031 | 0.0176 |
| MG | 0.5102 | 0.5991 | 0.5597 | 0.0177 |
| Prewitt | 0.6675 | 0.7481 | 0.7130 | 0.0139 |
| Sobel | 0.6712 | 0.6898 | 0.6789 | 0.01794 |
| Fuzzy MG | 0.6657 | 0.7444 | 0.7041 | 0.0153 |
| Fuzzy Prewitt | 0.6824 | 0.7481 | 0.7157 | 0.0145 |
| Fuzzy Sobel | 0.1667 | 0.7518 | 0.7030 | 0.0479 |
Results of the VGG16.
| Preprocessing | Min. | Max. | Average | Standard Deviation |
|---|---|---|---|---|
| RGB | 0.6398 | 0.7356 | 0.6908 | 0.0203 |
Figure 10ROC curve for CNN-I.
Figure 11ROC curve for CNN-II.
Figure 12Average training time per model in seconds for CNN-I.
Figure 13Average training time per model in seconds for CNN-II.
Hypothesis test results for CNN-I model.
| Preprocessing |
|
|
|
| Z-Score |
|---|---|---|---|---|---|
| Fuzzy Sobel vs. RGB | 0.6708 | 0.0186 | 0.6687 | 0.0141 | 1.4347 |
| Fuzzy Prewitt vs. RGB | 0.6775 | 0.0151 | 0.6687 | 0.0141 | 2.3336 |
| Fuzzy MG vs. RGB | 0.6751 | 0.0196 | 0.6687 | 0.0141 | 0.4892 |
| Fuzzy Sobel vs. grayscale | 0.6708 | 0.0186 | 0.6601 | 0.0199 | 2.9244 |
| Fuzzy Prewitt vs. grayscale | 0.6775 | 0.0151 | 0.6601 | 0.0199 | 3.8144 |
| Fuzzy MG vs. grayscale | 0.6751 | 0.0196 | 0.6601 | 0.0199 | 2.1474 |
Hypothesis test results for CNN-II model.
| Preprocessing |
|
|
|
| Z-Score |
|---|---|---|---|---|---|
| Fuzzy Sobel vs. color | 0.7041 | 0.0153 | 0.7121 | 0.0137 | −0.9961 |
| Fuzzy Prewitt vs. color | 0.7157 | 0.0145 | 0.7121 | 0.0137 | 1.0065 |
| Fuzzy MG vs. color | 0.7030 | 0.0479 | 0.7121 | 0.0137 | −2.1186 |
| Fuzzy Sobel vs. grayscale | 0.7041 | 0.0153 | 0.7031 | 0.0176 | −0.006 |
| Fuzzy Prewitt vs. grayscale | 0.7157 | 0.0145 | 0.7031 | 0.0176 | 3.0494 |
| Fuzzy MG vs. grayscale | 0.7030 | 0.0479 | 0.7031 | 0.0176 | 0.2462 |