| Literature DB >> 34177126 |
Gustavo Z Felipe1, Jacqueline N Zanoni1, Camila C Sehaber-Sierakowski1, Gleison D P Bossolani1, Sara R G Souza1, Franklin C Flores1, Luiz E S Oliveira2, Rodolfo M Pereira3, Yandre M G Costa1.
Abstract
Studies recently accomplished on the Enteric Nervous System have shown that chronic degenerative diseases affect the Enteric Glial Cells (EGC) and, thus, the development of recognition methods able to identify whether or not the EGC are affected by these type of diseases may be helpful in its diagnoses. In this work, we propose the use of pattern recognition and machine learning techniques to evaluate if a given animal EGC image was obtained from a healthy individual or one affect by a chronic degenerative disease. In the proposed approach, we have performed the classification task with handcrafted features and deep learning-based techniques, also known as non-handcrafted features. The handcrafted features were obtained from the textural content of the ECG images using texture descriptors, such as the Local Binary Pattern (LBP). Moreover, the representation learning techniques employed in the approach are based on different Convolutional Neural Network (CNN) architectures, such as AlexNet and VGG16, with and without transfer learning. The complementarity between the handcrafted and non-handcrafted features was also evaluated with late fusion techniques. The datasets of EGC images used in the experiments, which are also contributions of this paper, are composed of three different chronic degenerative diseases: Cancer, Diabetes Mellitus, and Rheumatoid Arthritis. The experimental results, supported by statistical analysis, show that the proposed approach can distinguish healthy cells from the sick ones with a recognition rate of 89.30% (Rheumatoid Arthritis), 98.45% (Cancer), and 95.13% (Diabetes Mellitus), being achieved by combining classifiers obtained on both feature scenarios.Entities:
Keywords: Deep learning; Degenerative chronic diseases; Enteric glial cells; Machine learning; Pattern recognition
Year: 2021 PMID: 34177126 PMCID: PMC8211315 DOI: 10.1007/s00521-021-06164-7
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.606
Fig. 1Representation of the approach used in this work, in which handcrafted and non-handcrafted features were approached
Fig. 2Digital image sample of the AIA dataset (left) converted to the grayscale (right)
Fig. 3Digital image sample of the D disease group (a) and the same one pseudo-colored (b) using the HSV color map (c)
Fig. 4Image sample from the C class, from the TW dataset. Sub-figure (a) presents the original data sample, while the remaining sub-figures show the resulting images by applying the edge highlighting methods using the following filters: Laplacian (b), Scharr (c) and Sobel (d)
Fig. 5General scheme of the feature learning
Fig. 6Examples of samples generated using Data Augmentation, by applying four randomly chosen techniques. Some of the techniques experimented are: adding Gaussian noise and/or blur; adding hue and saturation; flipping the image vertically and/or horizontally; changing the color space to BRG; Coarse Dropout; and others. The original image sample can be seen in the top left corner
Fig. 7Image samples from the datasets and both classes. The red coloration is resulted from the immunostaining of the S100 protein. The EGC can be visualized as the most bright points in the samples
Diseases evaluated in this work, according to the categories’ distribution in the experimental groups
| Disease | Abbreviation | Image dimension | Number of samples | |
|---|---|---|---|---|
| Sick (S) | Control (C) | |||
| Arthritis Rheumatoid | AIA | 1024 | 210 | 208 |
| Cancer (Walker’s tumor-256) | TW | 1384 | 192 | 224 |
| Diabetes Mellitus | D | 1384 | 290 | 224 |
F-Measure values found from the experiments using different texture descriptors
| Dataset | Texture Descriptor | ||
|---|---|---|---|
| LBP | RLBP | LPQ | |
| AIA | 0.7895 | 0.7535 | |
| TW | 0.9279 | 0.9231 | |
| D | 0.8520 | 0.8327 | |
F-measure values found from experiments that explored different values for the LPQ’s window size
| Dataset | Window Size | ||||
|---|---|---|---|---|---|
| 3 | 5 | 9 | 11 | 13 | |
| AIA | 0.7751 | 0.7679 | 0.7943 | 0.8014 | |
| TW | 0.9447 | 0.9591 | 0.9447 | 0.9567 | |
| D | 0.7838 | 0.8189 | 0.8690 | 0.8636 | |
Best results found in the experiments that evaluated the performance of handcrafted features, extracted by the use of different texture descriptors
| Dataset | Texture descriptor | F-measure |
|---|---|---|
| LPQ | 0.8014 | |
| AIA | LPQ | |
| RLBP | 0.8062 | |
| LPQ | ||
| TW | LPQ | 0.9567 |
| LPQ | 0.9591 | |
| LPQ | 0.8636 | |
| D | LPQ | 0.8690 |
| LPQ |
Classification rates found by the tests where handcrafted features were extracted from the image samples converted to the grayscale
| Dataset | Texture descriptor | F-measure |
|---|---|---|
| LPQ | 0.8038 | |
| AIA | LPQ | |
| RLBP | 0.7775 | |
| LPQ | ||
| TW | LPQ | 0.9254 |
| LPQ | 0.9303 | |
| LPQ | ||
| D | LPQ | 0.8851 |
| LPQ | 0.8696 |
Results obtained from the tests where handcrafted features were extracted from image samples pseudo-colored with the HSV colormap
| Dataset | Texture descriptor | F-measure |
|---|---|---|
| LPQ | 0.7990 | |
| AIA | LPQ | |
| RLBP | 0.7942 | |
| LPQ | ||
| TW | LPQ | 0.9327 |
| LPQ | 0.9183 | |
| LPQ | ||
| D | LPQ | 0.8811 |
| LPQ | 0.8927 |
Fig. 8Comparison between the best results obtained from the classifications that aimed to make variations in the image samples’ coloration
F-Measure values found from the experiment that highlighted the edges of the image samples, through the use of the Laplacian, Sobel, and Scharr filters
| Disease | Texture descriptor | Filter | ||
|---|---|---|---|---|
| Laplacian | Scharr | Sobel | ||
| LBP | 0.7942 | 0.8062 | ||
| AIA | LPQ | 0.7990 | 0.8158 | 0.8062 |
| RLBP | 0.8038 | 0.8253 | 0.7918 | |
| LBP | 0.8992 | 0.9327 | 0.9423 | |
| TW | LPQ | 0.9424 | 0.9448 | 0.9423 |
| RLBP | 0.9279 | 0.9448 | ||
| LBP | 0.8442 | 0.8105 | 0.8171 | |
| D | LPQ | 0.8304 | 0.8675 | |
| RLBP | 0.8129 | 0.7991 | 0.8128 | |
Fig. 9Comparison between the resulting F-Measure values obtained from the classifications where the image samples had their shapes/edges highlighted and the ones obtained from the unchanged image samples (original)
Classification rates obtained from the experiments that tested the execution of different classification algorithms. The results found by the classification using the SVM algorithm are also shown for comparison reasons
| Dataset | Classification algorithm | ||||
|---|---|---|---|---|---|
| k-NN | RF | NB | GB | SVM | |
| AIA | 0.7249 | 0.7215 | 0.7799 | 0.8325 | |
| TW | 0.8776 | 0.7672 | 0.8968 | 0.9712 | |
| D | 0.8286 | 0.7694 | 0.8576 | 0.8967 | |
Fig. 10Graphical comparison between the F-Measure values obtained by the classifications performed using different classification algorithms
Best results obtained for each CNN architecture, from the experiments that used them to perform the FL and classification
| Dataset | CNN architecture | Learning rate | F-Measure |
|---|---|---|---|
| LeNet5 | 10 | ||
| AIA | MaxNet | 10 | 0.6904 |
| AlexNet | 10 | 0.6984 | |
| LeNet5 | 10 | 0.8035 | |
| TW | MaxNet | 10 | 0.8199 |
| AlexNet | 10 | ||
| LeNet5 | 10 | 0.6772 | |
| D | MaxNet | 10 | 0.7216 |
| AlexNet | 10 |
Best classification rates found from executing the tests using the Transfer Learning method, for each evaluated CNN architecture
| Dataset | CNN | F-Measure | |
|---|---|---|---|
| VGG16 | 4096 | 0.8421 | |
| AIA | InceptionV3 | 4096 | |
| InceptionResNetV2 | 1024 | 0.6579 | |
| VGG16 | 2048 | ||
| TW | InceptionV3 | 4096 | 0.8823 |
| InceptionResNetV2 | 2048 | 0.7444 | |
| VGG16 | 4096 | ||
| D | InceptionV3 | 4096 | 0.7417 |
| InceptionResNetV2 | 2048 | 0.5982 |
Description of the classifiers used to execute the experiments that employed classifiers combination techniques
| Dataset | ID | Feature Extractor | Classification | Observations | F-Measure |
|---|---|---|---|---|---|
| 1 | LPQ | SVM | Samples converted to the greyscale | 0.8325 | |
| 2 | LPQ | RF | Samples converted to the greyscale | 0.7847 | |
| AIA | 3 | LBP | SVM | Border enhancement filter = | 0.8277 |
| 4 | InceptionV3 | SVM | # of | 0.8468 | |
| 5 | VGG16 | SVM | # of | 0.8421 | |
| 6 | LeNet5 | LeNet5 | 0.7224 | ||
| 1 | LPQ | SVM | – | 0.9712 | |
| 2 | LPQ | RF | – | 0.8920 | |
| TW | 3 | RLBP | SVM | Border enhancement filter = | 0.9471 |
| 4 | VGG16 | SVM | # of | 0.9327 | |
| 5 | AlexNet | AlexNet | 0.8721 | ||
| 6 | MaxNet | MaxNet | 0.8199 | ||
| 1 | LPQ | SVM | Pseudo-coloring = HSV | 0.8967 | |
| 2 | LPQ | RF | Pseudo-coloring = HSV | 0.8325 | |
| D | 3 | LPQ | SVM | Border enhancement filter = | 0.8695 |
| 4 | VGG16 | SVM | # of | 0.9043 | |
| 5 | AlexNet | AlexNet | 0.7475 | ||
| 6 | MaxNet | MaxNet | 0.7216 |
Best F-Measure values found from the experiments that combined classifiers
| Dataset | Classifiers (IDs) | Type(s) | Rule | F-Measure |
|---|---|---|---|---|
| 3, 4 | N and H | Max | 0.8863 | |
| 3, 4, 5 | N and H | Max | ||
| AIA | 3, 4, 5 | N and H | Sum | 0.8899 |
| 1, 3, 4, 5 | N and H | Max | 0.8858 | |
| 1, 3, 4, 6 | N and H | Sum | 0.8806 | |
| 1, 2, 5 | N and H | Sum | 0.9818 | |
| 1, 3, 4 | N and H | Sum | 0.9767 | |
| TW | 1, 4, 5 | N and H | Sum | 0.9789 |
| 1, 2, 3, 6 | N and H | Product | 0.9739 | |
| 1, 2, 3, 5, 6 | N and H | Sum | ||
| 1, 2 | H | Max | 0.9413 | |
| 1, 2, 4 | N and H | Sum | ||
| D | 1, 2, 4 | N and H | Product | 0.9495 |
| 1, 2, 4, 5 | N and H | Sum | 0.9459 | |
| 1, 2, 4, 6 | N and H | Sum | 0.9445 |
Type(s) of classifiers involved in the combination.
N stands for “non-handcrafted” features, and H stands for “handcrafted” features
Fig. 11Graphical representation of the configurations architected, so the best classification rates of this work could be achieved, by combining classifiers
Average ranking of the classification results for the handcrafted features
| Disease | Overall Avg. Ranking | |||
|---|---|---|---|---|
| AIA | TW | D | ||
| LBP | 1.50 | 2.50 | 2.33 | 2.11 |
| RLBP | 2.00 | 2.17 | 2.67 | 2.28 |
| LPQ | 2.33 | 1.00 | 1.00 | 1.44 |
Average ranking of the classification results for the non-handcrafted features obtained with pre-configured CNNs
| Disease | Overall Avg. Ranking | |||
|---|---|---|---|---|
| AIA | TW | D | ||
| MaxNet | 2.50 | 2.00 | 2.00 | 2.17 |
| LeNet5 | 2.00 | 3.00 | 3.00 | 2.67 |
| AlexNet | 1.50 | 1.00 | 1.00 | 1.17 |
Average ranking of the classification results for the non-handcrafted features obtained with transfer learning
| Disease | Overall Avg. Ranking | |||
|---|---|---|---|---|
| AIA | TW | D | ||
| VGG16 | 1.40 | 1.00 | 1.00 | 1.13 |
| InceptionV3 | 1.60 | 2.00 | 2.00 | 1.87 |
| InceptionResNetV2 | 3.00 | 3.00 | 3.00 | 3.00 |
Wilcoxon statistical tests for the top-10 classification results with handcrafted features versus non-handcrafted
| Disease | ||
|---|---|---|
| AIA | 19 | 0.1931 |
| TW | 0 | 0.0025 |
| D | 1 | 0.0035 |
Wilcoxon statistical tests comparing the classification results obtained in the different diseases groups
| AIA versus TW | 143 | |
| D versus TW | 210 | |
| AIA versus D | 820 |
Wilcoxon statistical tests comparing the classification results before and after combining the classifiers outputs
| AIA | 0.0 | 0.0216 |
| TW | 0.0 | 0.0216 |
| D | 0.0 | 0.0216 |