| Literature DB >> 27847543 |
Eduardo Ribeiro1, Andreas Uhl2, Georg Wimmer2, Michael Häfner3.
Abstract
Recently, Deep Learning, especially through Convolutional Neural Networks (CNNs) has been widely used to enable the extraction of highly representative features. This is done among the network layers by filtering, selecting, and using these features in the last fully connected layers for pattern classification. However, CNN training for automated endoscopic image classification still provides a challenge due to the lack of large and publicly available annotated databases. In this work we explore Deep Learning for the automated classification of colonic polyps using different configurations for training CNNs from scratch (or full training) and distinct architectures of pretrained CNNs tested on 8-HD-endoscopic image databases acquired using different modalities. We compare our results with some commonly used features for colonic polyp classification and the good results suggest that features learned by CNNs trained from scratch and the "off-the-shelf" CNNs features can be highly relevant for automated classification of colonic polyps. Moreover, we also show that the combination of classical features and "off-the-shelf" CNNs features can be a good approach to further improve the results.Entities:
Mesh:
Year: 2016 PMID: 27847543 PMCID: PMC5101370 DOI: 10.1155/2016/6584725
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Example images of the two classes (a–d) and the pit-pattern types of these two classes (e–f).
Figure 2An illustration of the CNN architecture for colonic polyp classification.
Number of images and patients per class of the CC-i-Scan databases gathered with and without CC (staining) and computed virtual chromoendoscopy (CVC).
| i-Scan mode | No staining | Staining | ||||||
|---|---|---|---|---|---|---|---|---|
| ¬CVC | i-Scan1 | i-Scan2 | i-Scan3 | ¬CVC | i-Scan1 | i-Scan2 | i-Scan3 | |
|
| ||||||||
| Number of images | 39 | 25 | 20 | 31 | 42 | 53 | 32 | 31 |
| Number of patients | 21 | 18 | 15 | 15 | 26 | 31 | 23 | 19 |
|
| ||||||||
| Number of images | 73 | 75 | 69 | 71 | 68 | 73 | 62 | 54 |
| Number of patients | 55 | 56 | 55 | 55 | 52 | 55 | 52 | 47 |
| Total number of images | 112 | 100 | 89 | 102 | 110 | 126 | 94 | 85 |
CNN configuration for input subimages of size 227 × 227 × 3 and its respective accuracy in %.
| Size of inputs | Number of convolutional filters/size | Connected layer | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Layer 1 | Layer 2 | Layer 3 | Layer 4 | Layer 5 | Layer 6 | Layer 7 | Layer 8 | ||
| 227 × 227 × 3 | 96/11 × 11 | 256/5 × 5 | 384/3 × 3 | 384/3 × 3 | 256/3 × 3 | 384/3 × 3 | 384/3 × 3 | 4096/6 × 6 | 4096 |
|
| |||||||||
| Accuracy: 79.00 | |||||||||
Accuracy results from different CNN configurations for inputs of size 128 × 128 × 3 in %.
| Network index | Number of convolutional filters/size | Connected layer | Acc | ||
|---|---|---|---|---|---|
| Layer 1 | Layer 2 | Layer 3 | |||
| CNN-01 | 48/7 × 7 | 72/4 × 4 | 512/5 × 5 | 512 | 76.00 |
| CNN-02 | 48/11 × 11 | 72/5 × 5 | 512/6 × 6 | 512 | 84.00 |
| CNN-03 | 24/11 × 11 | 48/5 × 5 | 1024/6 × 6 | 1024 | 86.00 |
| CNN-04 | 24/11 × 11 | 72/4 × 4 | 2048/5 × 5 | 2048 | 80.00 |
| CNN-05 | 48/11 × 11 | 72/5 × 5 | 1024/6 × 6 | 1024 |
|
Accuracy of different strides for overlapping subimages in the CNN-05 evaluation for i-Scan1 database in %.
| Stride | Number of subimages | Accuracy |
|---|---|---|
| 1 | 16384 | 89.00 |
| 5 | 676 | 89.00 |
| 20 | 49 | 90.00 |
| 32 | 25 |
|
| 48 | 9 | 87.00 |
| Random | 9 | 87.00 |
| Random | 25 | 89.00 |
| Random | 49 | 89.00 |
Accuracy of CNN-05 architecture comparing to classical features for the i-Scan1 and i-Scan3 databases in %.
| Methods | i-Scan1 | i-Scan3 |
|---|---|---|
| CNN-05 |
|
|
| CNN-05 + SVM − LFCL | 83.00 | 72.55 |
| CNN-05 + SVM − PFCL | 80.00 | 66.67 |
| BSAG-LFD | 86.87 | 82.87 |
| Blob SC | 83.33 | 75.22 |
| Shearlet-Weibull | 76.67 | 86.80 |
| GWT-Weibull | 78.67 | 84.28 |
| LCVP | 66.00 | 77.12 |
| MB-LBP | 80.67 | 83.37 |
Accuracies of the methods for the CC-i-Scan databases in %.
| Methods | No staining | Staining | |||||||
|---|---|---|---|---|---|---|---|---|---|
| ¬CVC | i-Scan1 | i-Scan2 | i-Scan3 | ¬CVC | i-Scan1 | i-Scan2 | i-Scan3 |
| |
| 1: CNN-F | 86.16 | 89.33 | 80.65 | 88.41 | 86.52 | 81.40 | 84.22 | 80.62 | 84.66 |
| 2: CNN-M | 87.45 | 90.67 | 81.38 | 83.58 | 87.99 | 89.55 | 87.40 | 90.53 | 87.31 |
| 3: CNN-S | 88.03 | 90.00 | 87.01 | 77.33 | 87.25 | 82.68 | 87.40 | 75.54 | 84.41 |
| 4: CNN-F MCN | 88.84 | 82.00 | 73.15 | 90.73 | 85.78 | 89.55 | 89.72 | 83.15 | 85.36 |
| 5: CNN-M MCN | 89.53 | 90.67 |
|
| 86.97 | 89.29 | 87.40 | 90.53 |
|
| 6: CNN-S MCN | 90.12 |
| 81.38 | 79.85 | 89.18 |
| 81.10 | 84.77 | 86.41 |
| 7: GoogleLeNet | 79.65 | 90.67 | 72.43 | 74.51 | 88.27 | 80.46 | 75.60 | 84.08 | 80.70 |
| 8: VGG-VD16 | 87.45 | 85.33 | 86.38 | 79.65 |
| 89.80 |
|
| 88.59 |
| 9: VGG-VD19 | 83.49 | 82.67 | 83.88 | 87.71 |
| 83.98 | 94.46 | 85.59 | 86.78 |
| 10: AlexNet |
| 87.33 | 75.65 | 89.32 | 87.71 | 83.03 | 84.22 | 79.24 | 84.73 |
| 11: AlexNet MCN | 89.42 | 84.67 | 78.88 | 83.78 | 89.36 | 83.55 | 81.10 | 78.32 | 83.63 |
|
| 87.41 | 87.70 | 80.88 | 84.50 | 88.54 | 86.07 | 86.17 | 84.06 | 85.67 |
|
| |||||||||
| 12: BSAG-LFD |
|
|
| 82.87 | 70.20 |
| 78.78 | 71.39 |
|
| 13: Blob SC | 77.67 | 83.33 | 82.10 | 75.22 | 59.28 | 78.83 | 66.13 | 59.83 | 72.79 |
| 14: Shearlet-Weibull | 73.72 | 76.67 | 79.60 |
|
| 69.91 | 72.38 |
| 78.00 |
| 15: GWT-Weibull | 79.75 | 78.67 | 70.25 | 84.28 |
| 74.54 | 77.17 | 83.39 | 78.66 |
| 16: LCVP | 76.60 | 66.00 | 47.75 | 77.12 | 77.45 | 79.00 | 70.01 | 69.56 | 70.43 |
| 17: MB-LBP | 78.26 | 80.67 | 81.38 | 83.37 | 69.29 | 70.60 | 77.22 | 78.32 | 77.38 |
|
| 78.71 | 78.70 | 74.28 | 81.61 | 73.13 | 75.58 | 73.61 | 74.35 | 76.24 |
|
| |||||||||
| Fusion 5/8 | 88.84 | 85.33 | 83.88 | 92.14 | 93.12 | 90.49 | 96.88 | 94.00 | 90.58 |
| Fusion 5/12 | 92.79 |
| 88.88 |
| 87.71 | 90.49 | 88.26 | 90.53 | 91.03 |
| Fusion 5/8/12 |
| 90.00 | 88.88 | 92.14 | 92.30 | 91.43 | 97.63 |
| 93.22 |
| Fusion 5/8/14 | 91.51 | 88.67 | 87.10 | 93.75 |
| 91.43 |
| 95.85 | 92.67 |
| Fusion 5/8/15 | 90.91 | 90.00 | 88.88 | 92.14 | 93.94 | 89.80 | 96.88 | 95.61 | 92.27 |
| Fusion 5/8/12/14 | 93.38 | 88.00 |
| 93.75 | 93.49 |
| 97.63 | 94.92 | 93.08 |
| Fusion 5/8/12/17 | 93.38 | 90.00 |
| 93.75 | 92.75 |
| 97.63 |
|
|
|
| |||||||||
| CNN-05 | — | 91.00 | — | 89.00 | — | — | — | — | — |
| CNN-05 + SVM | — | 83.00 | — | 72.55 | — | — | — | — | — |
Results from i-Scan1 database with images resized to 224 × 224 and cropped in 25 patches of size 224 × 224.
| CNN-F | CNN-M | CNN-S | CNN-F | CNN-M | CNN-S | Google LeNet | VGG VD16 | VGG VD19 | AlexNet | AlexNet MCN |
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Resizing image | 89.33 | 90.67 | 90.00 | 82.00 | 90.67 | 91.42 | 90.67 | 85.33 | 82.67 | 87.33 | 84.67 |
|
| Cropping 25 images | 84.00 | 82.67 | 84.67 | 78.67 | 84.67 | 88.67 | 91.29 | 89.67 | 78.67 | 85.33 | 85.33 | 84.87 |
Results from i-Scan1 database with images resized to 224 × 224 using the last fully connected layer and the prior fully connected layer.
| CNN-F | CNN-M | CNN-S | CNN-F MCN | CNN-M MCN | CNN-S MCN | Google LeNet | VGG VD16 | VGG VD19 | AlexNet | AlexNet MCN |
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Prior fully connected layer | 89.33 | 90.67 | 90.00 | 82.00 | 90.67 | 91.42 | 90.67 | 85.33 | 82.67 | 87.33 | 84.67 |
|
| Last fully connected layer | 90.67 | 84.67 | 85.33 | 78.67 | 88.00 | 89.33 | 90.67 | 84.67 | 79.33 | 81.33 | 90.67 | 85.75 |
Figure 3Results of the McNemar test for the i-Scan1 (a) and i-Scan3 (b) databases without staining. A black square in the matrix means that the methods are significantly different with significance level α = 0.01 and a grey square in (a) means that the methods are significantly different with significance level α = 0.05. If the square is white then there is no significant difference between the methods.
Figure 4Example results of the classification in agreement from the methods tested in the McNemar test for each prediction outcome.