| Literature DB >> 36262620 |
Akella S Narasimha Raju1, Kayalvizhi Jayavel1, Thulasi Rajalakshmi2.
Abstract
Human colorectal disorders in the digestive tract are recognized by reference colonoscopy. The current system recognizes cancer through a three-stage system that utilizes two sets of colonoscopy data. However, identifying polyps by visualization has not been addressed. The proposed system is a five-stage system called ColoRectalCADx, which provides three publicly accessible datasets as input data for cancer detection. The three main datasets are CVC Clinic DB, Kvasir2, and Hyper Kvasir. After the image preprocessing stages, system experiments were performed with the seven prominent convolutional neural networks (CNNs) (end-to-end) and nine fusion CNN models to extract the spatial features. Afterwards, the end-to-end CNN and fusion features are executed. These features are derived from Discrete Wavelet Transform (DWT) and Vector Support Machine (SVM) classification, that was used to retrieve time and spatial frequency features. Experimentally, the results were obtained for five stages. For each of the three datasets, from stage 1 to stage 3, end-to-end CNN, DenseNet-201 obtained the best testing accuracy (98%, 87%, 84%), ((98%, 97%), (87%, 87%), (84%, 84%)), ((99.03%, 99%), (88.45%, 88%), (83.61%, 84%)). For each of the three datasets, from stage 2, CNN DaRD-22 fusion obtained the optimal test accuracy ((93%, 97%) (82%, 84%), (69%, 57%)). And for stage 4, ADaRDEV2-22 fusion achieved the best test accuracy ((95.73%, 94%), (81.20%, 81%), (72.56%, 58%)). For the input image segmentation datasets CVC Clinc-Seg, KvasirSeg, and Hyper Kvasir, malignant polyps were identified with the UNet CNN model. Here, the loss score datasets (CVC clinic DB was 0.7842, Kvasir2 was 0.6977, and Hyper Kvasir was 0.6910) were obtained.Entities:
Mesh:
Year: 2022 PMID: 36262620 PMCID: PMC9576362 DOI: 10.1155/2022/4325412
Source DB: PubMed Journal: Comput Intell Neurosci
Literature survey.
| Author | Year | Advantages | Limitations |
|---|---|---|---|
| Souaidi and Ansari [ | 2022 | (i) To detect abnormalities in the polyp region of WCE and colonoscopy localization and visualization proposal. | (i) There is no discussion of the numerous CNN models. |
| (ii) Here a detector for deep polyps, such as MP-FSSD, is suggested. | (ii) Only CNN models like VGG-16 are used. | ||
| (iii) In this polyp detection work, VGG-16 backbones are used. | (iii) Only WCE and CVC clinic DB dataset are reviewed for polyp recognition. | ||
| Nisha and Palanisamy [ | 2022 | (i) We automatically detect colorectal polyps with image enhancement. | (i) This method is only effective for a limited number of CNN models. |
| (ii) The proposed work is the dual CNN path for classifying polyps and nonpolyps' patches in colonoscopy images. | (ii) This method is not discussed except with a CNN model. | ||
| (iii) To enhance the image, the dual-path CNN and sigmoid classifier is used to efficiently detect polyps. | (iii) Only two sets of colonoscopy image data were proposed, such as CVC clinic DB and ETIS-Larib datasets. | ||
| (iv) The proposed method is promising, and detects with accuracy of 99.60% and 90.81% with CVC clinic DB and ETIS-Larib datasets, respectively. | (iv) Images in the datasets are enhanced owing to which the accuracy of polyp detection will reduce. | ||
| (v) The number of images is increased or live image datasets are used, and the suggested method for its operation is not addressed. | |||
| Guo et al. [ | 2022 | (i) The two major challenges for the segmentation of colonoscopy image polyps are blurred boundaries and a close resemblance between the polyps and surrounding tissue. | (i) Here, five datasets are tested with a new UnX methodology, so it takes a long time to obtain the results. |
| (ii) This system proposed a new transformer-based encounter network known as the uncertainty eXploration (UnX). | (ii) The precision levels of the results are good, but obtaining results is time-consuming. | ||
| (iii) With this method, the system identified the uncertainty areas of polyps. | (iii) The complexity of the system is increased while comprehension of the system is much more tedious to a layman. | ||
| (iv) This removes the uncertain elements of the images and emphatically recognizes the level of precision of malignant polyps. | (iv) There are inconsistent color distributions in the colonoscopy image system that displays poor results. | ||
| Yeung et al. [ | 2021 | (i) The concept here is the segmentation of polyps and the identification of malignant polyps. | (i) With the five datasets, each image segmentation entails considerable time to obtain the results. |
| (ii) The proposed method is CNN based on double attention for segmenting polyps using Focus-UNet. | (ii) Visualization quality may be good for certain datasets. | ||
| (iii) This system combines efficient attention based on the spatial channel into a single focus gate selective deep learning of polyp characteristics. | (iii) The proposed focus-UNet system should have been upgraded to a lightweight design. | ||
| (iv) Here for experimentation with the proposed methodology, inputs are provided using five colonoscopy datasets. | (iv) It is a complicated system. | ||
| (v) The obtained results, such as the dice similarity coefficient, are 0.941 and 0910. | |||
| Attallah and Sharkas [ | 2021 | (i) Proposed a system called Gastro-CADx to classify several gastrointestinal diseases using deep learning approaches. | (i) Two datasets named dataset I and II, which are Kvasir and Hyper Kvasir, are used to assess the performance of Gastro-CADx. |
| (ii) There are three phases to this system. These four different CNNs are used as feature extractors to extract spatial functionality. | (ii) However, this system has not been used on the numerous datasets. | ||
| (iii) The properties extracted in the first stage are applied to the discrete wave transform (DWT) and the discrete cosine transform (DCT), which are used to extract temporal-frequency and spatial-frequency features. | (iii) The system is not even under discussion for the semantic segmentation concept for locating and identifying malignant polyps. | ||
| Jha et al. [ | 2021 | (i) The design is the detection, localization, and segmentation of polyps in real-time. | (i) The system uses more than just a single dataset for experimentation and recognizing malignant polyps. |
| (ii) This work calls for deep learning in technology. | (ii) The system provides moderate results (not highly accurate). | ||
| (iii) The proposed solution to retrieve polyps from colonoscopy images developed ColonSegNet, which is a decoder-encoder architecture. | (iii) Architecture is complex to comprehend for laymen. | ||
| (iv) detection, location, and segmentation are evaluated using standard computer vision measures. | |||
| (v) The system has a high processing rate of 182.38 frames per second. | |||
| Ahraf et al. [ | 2020 | (i) Suggested automated classification as a new technique for illustrating medical images using deep learning technology. | (i) Vast data of colonoscopy images are classified with different convolutional neural networks and the results are achieved differently. |
| (ii) It helps to categorize the diverse medical images of several organs of the body. | (ii) The notions of interest are not addressed here and this has to be comprehensively addressed. | ||
| (iii) It contains a summary of data and other health image classification applications, which support radiologists' efforts to improve diagnosis. | |||
| Poudel et al. [ | 2020 | (i) Provides a good architecture for classifying endoscopic images using an expanded efficient convolutional neural network. | (i) However, colorectal disorders are classified using convolutional neural networks. |
| (ii) Proposed an architecture to classify endoscopic images using an effective convolutional neural network (CNN). | (ii) However, algorithms integrated with the various algorithms are compared with certain parameters. | ||
| (iii) This is a highly accessible domain of assessing deeper layers by accumulating and reducing the expansion factor of spatial elements. | (iii) The results obtained are regarded as the most accurate and best algorithm for the identification of colorectal cancer (CRC). | ||
| (iv) The investigator compares and evaluates the methodology using a variety of parameters. | |||
| Zhou and Gao [ | 2019 | (i) Here we discuss how CNN technologies enable intelligent recognition of medical motion images. | (i) However, there are no discussions on obtaining colorectal medical images from the colonoscopy screening images. |
| (ii) Now large-scale intelligent recognition of medical motion images is assisted by CNN algorithms. | (ii) There is no explanation of the procedure to retrieve and categorize and then convert to results based on their image characteristics. | ||
| (iii) Here, the features of the dense trajectory are initially learned followed by the features of depth, and the dense path functions are merged into the DL methods. | (iii) The techniques involved are time-consuming and require extensive computer statistics. | ||
| (iv) Finally, extreme learning is functional in CNN where the descriptions of the bottom layer to the top layer are determined for medical image recognition | |||
| Yang et al. [ | 2019 | (i) Proposed a health-based device for categorizing and segmenting CT images for lung disease and hemorrhagic stroke, termed HTSCS for Health Images. | (i) This technique provides an advanced method of categorization and segmentation using art. |
| (ii) Internet Health of-Things (IHoT) uses transferable model learning, based on deep learning concepts with traditional methodologies for the best precision for medical image classification and segmentation | (ii) This Internet of medical Things has worked with various IoT devices with the connection of computed tomography devices. |
Figure 1The flow diagram of the colorectalCADx system.
Figure 2Proposed colorectalCADx block diagram.
Figure 3CVC clinic DB sample dataset.
Figure 4Kvasir2 sample dataset.
Figure 5Hyper kvasir sample dataset.
Figure 6CNN architecture.
Fusion models and their suggested names.
| Fusion model | Suggested name |
|---|---|
| AlexNet + DarkNet-19 + ResNet-50v2 + DenseNet-201 + Efficientnet-B7 + VGG-16 + VGG-19 | ADaRDEV2-22 |
| ResNet-50v2 + DensNet-201 + EfficientNet-B7 + VGG-16 + VGG19 | RDEV2-22 |
| AlexNet + DarkNet-19 + DenseNet-201 + ResNet-50V2 | ADaDR-22 |
| AlexNet + DarkNet-19 + ResNet-50V2 | ADaR-22 |
| DarkNet-19 + ResNet-50V2 + DenseNet-201 | DaRD-22 |
| AlexNet + DarkNet-19 | ADa-22 |
| ResNet-50V2 + DenseNet-20 | RD-22 |
| AlexNet + DenseNet-201 | AD-22 |
| DarkNet-19 + ResNet-50V2 | DaR-22 |
The number of parameters of CNNs.
| CNN architecture models | Introduced year | Total params | Trainable params | Nontrainable params | Layers | |
|---|---|---|---|---|---|---|
| AlexNet [ | 2012 | 2,81,02,775 | 2,80,81,639 | 21,136 | 23 | |
| DarkNet-19 [ | 2017 | 1,60,45,847 | 1,60,32,983 | 12,864 | 19 | |
| ResNet-50v2 [ | 2016 | 2,59,33,975 | 23,69,175 | 2,35,64,800 | 50 | |
| DenseNet-201 [ | 2018 | 1,94,29,463 | 11,07,479 | 1,83,21,984 | 201 | |
| Efficientnet-B7 [ | 2019 | 6,55,73,799 | 14,76,119 | 6,40,97,680 | 813 | |
| VGG-16 [ | 2014 | 1,53,14,391 | 5,99,703 | 1,47,14,688 | 16 | |
| VGG-19 [ | 2014 | 2,06,24,087 | 5,99,703 | 2,00,24,384 | 13 | |
| Proposed fusion models | ADaRDEV2-22 | 2022 | 19,10,28,063 | 7,02,94,911 | 12,07,33,152 | |
| RDEV2-22 | 14,68,78,383 | 2,61,79,231 | 12,06,99,152 | |||
| ADaDR-22 | 8,94,87,664 | 4,75,66,880 | 4,19,20,784 | |||
| ADaR-22 | 2,59,33,000 | 23,68,200 | 2,35,64,800 | |||
| DaRD-22 | 6,13,99,840 | 1,95,00,192 | 4,18,99,648 | |||
| ADa-22 | 4,41,26,048 | 4,40,92,048 | 34,000 | |||
| RD-22 | 4,53,61,624 | 34,74,840 | 4,18,86,784 | |||
| AD-22 | 4,75,16,384 | 2,91,73,264 | 1,83,43,120 | |||
| DaR-22 | 4,19,67,694 | 1,83,90,030 | 2,35,77,664 | |||
Figure 7Families belonging to the DWT.
Figure 8Semantic segmentation using UNet.
System specifications.
| System | Precision tower T5810 |
|---|---|
| Company | Dell |
| Processor | Intel® Xeon® CPU core i7 E5-2630 |
| Speed | 2.20 GHz |
| RAM | 32 GB |
| GPU | GPU NVIDIA Xp. |
| Software environment | Google Colab Pro with python 3.7.12 |
| Software Python packages | Keras and TensorFlow 2.7.0 |
Training and testing split of three datasets.
| Datasets | Training set | Validation set | Test sets | Total images |
|---|---|---|---|---|
| CVC-clinic DB [ | 900 | 102 | 516 | 1518 |
| Kvasir2 [ | 5120 | 480 | 2400 | 8000 |
| Hyper Kvasir Labeled [ | 7470 | 634 | 2577 | 10681 |
Hyperparameters for colorectalCADx system.
| Dataset | Epochs | Batch sizes | Learning rate | Optimizer | Momentum | Dropout |
|---|---|---|---|---|---|---|
| CVC clinic DB | 10 | 16 | 0.0001 | sgd | 0.9 | 0.5 |
| Kvasir2 | 10 | 64 | 0.0001 | sgd | 0.9 | 0.5 |
| Hyper Kvasir labeled | 10 | 64 | 0.0001 | sgd | 0.9 | 0.5 |
End-to-end CNN for CVC clinic DB.
| End-to-end CNNs | Accuracy in % |
|---|---|
| AlexNet | 73.00 |
| DarkNet-19 | 68.00 |
| ResNet-50v2 | 89.00 |
| DenseNet-201 | 98.00 |
| Efficientnet-B7 | 91.00 |
| VGG-16 | 83.00 |
| VGG-19 | 86.00 |
End-to-end CNN for Kvasir 2.
| End-to-end CNNs | Accuracy in % |
|---|---|
| AlexNet | 74.00 |
| DarkNet-19 | 32.00 |
| ResNet-50v2 | 83.00 |
| DenseNet-201 | 87.00 |
| Efficientnet-B7 | 77.00 |
| VGG-16 | 74.00 |
| VGG-19 | 67.00 |
End-to-end CNN for hyper kvasir.
| End-to-end CNNs | Accuracy in % |
|---|---|
| AlexNet | 71.00 |
| DarkNet-19 | 43.00 |
| ResNet-50v2 | 78.00 |
| DenseNet-201 | 84.00 |
| Efficientnet-B7 | 75.00 |
| VGG-16 | 75.00 |
| VGG-19 | 65.00 |
Figure 9End-to-end CNN for CVC clinic DB graphical results.
Figure 10End-to-end CNN for Kvasir 2 graphical results.
Figure 11End-to-end CNN for Hyper Kvasir graphical results.
Comparison accuracies of the end-to-end and fusion CNNs of CVC clinic DB dataset.
| Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % | |
|---|---|---|---|---|---|
|
| |||||
| AlexNet | 74.64 | 73.00 | 57.83 | 59.00 | 73.40 |
| DarkNet-19 | 81.61 | 68.00 | 78.65 | 78.00 | 68.38 |
| ResNet-50v2 | 88.61 | 89.00 | 92.53 | 89.00 | 89.18 |
| DenseNet-201 | 97.78 | 98.00 | 95.64 | 97.00 | 98.06 |
| Efficientnet-B7 | 83.36 | 91.00 | 73.40 | 84.00 | 90.52 |
| VGG-16 | 82.38 | 83.00 | 85.59 | 88.00 | 82.73 |
| VGG-19 | 80.87 | 86.00 | 80.60 | 84.00 | 85.88 |
|
| |||||
|
| |||||
| ADaRDEV2-22 | 94.9 | 95.0 | 95.0 | 97.0 | 94.56 |
| RDEV2-22 | 90.1 | 95.0 | 69.2 | 95.0 | 94.97 |
| ADaDR-22 | 92.5 | 93.0 | 95.3 | 97.0 | 93.22 |
| ADaR-22 | 77.1 | 79.0 | 79.9 | 79.0 | 78.64 |
| DaRD-22 | 88.8 | 92.0 | 89.4 | 92.0 | 91.65 |
| ADa-22 | 77.4 | 82.0 | 75.9 | 64.0 | 82.18 |
| RD-22 | 96.1 | 97.0 | 94.2 | 96.0 | 96.51 |
| AD-22 | 50.1 | 50.0 | 68.7 | 54.0 | 50.00 |
| DaR-22 | 81.9 | 85.0 | 86.2 | 78.0 | 85.46 |
Comparison of precision and support of CVC clinic DB classes.
| Classes | High performed CNN models | ||
|---|---|---|---|
| DenseNet-201 | ADaDR-22 | Support | |
| Precision | Precision | ||
| Nonpolyps | 0.97 | 0.91 | 257 |
| Polyps | 1 | 0.96 | 259 |
The comparison accuracies of the end-to-end and fusion CNNs of Kvasir 2 dataset.
| Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % | |
|---|---|---|---|---|---|
|
| |||||
| AlexNet | 71.54 | 74.00 | 36.02 | 31.00 | 96.89 |
| DarkNet-19 | 73.00 | 32.00 | 77.11 | 43.00 | 87.58 |
| ResNet-50v2 | 67.95 | 83.00 | 62.52 | 84.00 | 98.03 |
| DenseNet-201 | 82.20 | 87.00 | 78.89 | 87.00 | 98.95 |
| Efficientnet-B7 | 62.14 | 77.00 | 54.16 | 68.00 | 97.16 |
| VGG-16 | 54.29 | 74.00 | 69.66 | 77.00 | 96.79 |
| VGG-19 | 51.04 | 67.00 | 62.52 | 72.00 | 95.68 |
|
| |||||
|
| |||||
| ADaRDEV2-22 | 82.04 | 83.00 | 80.46 | 85.00 | 98.52 |
| RDEV2-22 | 68.04 | 76.00 | 74.75 | 81.00 | 97.32 |
| ADaDR-22 | 80.79 | 74.00 | 76.84 | 80.00 | 97.31 |
| ADaR-22 | 71.54 | 68.00 | 69.29 | 75.00 | 93.82 |
| DaRD-22 | 81.64 | 82.00 | 78.39 | 84.00 | 97.91 |
| ADa-22 | 57.79 | 60.00 | 53.92 | 48.00 | 93.52 |
| RD-22 | 64.95 | 66.00 | 67.70 | 75.00 | 94.75 |
| AD-22 | 66.23 | 50.00 | 63.75 | 64.00 | 87.21 |
| DaR-22 | 62.54 | 70.00 | 65.11 | 60.00 | 94.95 |
Comparison of precision and support of Kvasir 2 classes.
| Classes | High performed CNN model | ||
|---|---|---|---|
| DenseNet-201 | DaRD-22 | Support | |
| Precision | Precision | ||
| Dyed-lifted-polyps | 0.78 | 0.81 | 300 |
| Dyed-resection-margins | 0.87 | 0.82 | 300 |
| Esophagitis | 0.79 | 0.73 | 300 |
| Normal-cecum | 0.97 | 0.92 | 300 |
| Normal-pylorus | 0.91 | 0.95 | 300 |
| Normal- | 0.78 | 0.72 | 300 |
| Polyps | 0.9 | 0.76 | 300 |
| Ulcerative-colitis | 0.93 | 0.92 | 300 |
Comparing accuracies of the end-to-end and fusion CNNs of Hyper Kvasir dataset.
| Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % | |
|---|---|---|---|---|---|
|
| |||||
| AlexNet | 71.74 | 71.00 | 10.75 | 11.00 | 96.70 |
| DarkNet-19 | 75.56 | 43.00 | 77.93 | 60.00 | 87.05 |
| ResNet-50v2 | 61.30 | 78.00 | 71.06 | 78.00 | 94.81 |
| DenseNet-201 | 77.12 | 84.00 | 75.94 | 84.00 | 94.48 |
| Efficientnet-B7 | 58.29 | 75.00 | 53.76 | 70.00 | 94.05 |
| VGG-16 | 54.48 | 68.00 | 70.41 | 75.00 | 93.73 |
| VGG-19 | 49.97 | 65.00 | 63.29 | 70.00 | 91.60 |
|
| |||||
|
| |||||
| ADaRDEV2-22 | 69.2 | 68.0 | 67.8 | 63.0 | 83.0 |
| RDEV2-22 | 56.8 | 60.0 | 64.8 | 69.0 | 80.8 |
| ADaDR-22 | 55.6 | 51.0 | 59.1 | 62.0 | 80.1 |
| ADaR-22 | 61.8 | 55.0 | 62.5 | 63.0 | 80.2 |
| DaRD-22 | 69.4 | 69.0 | 64.6 | 57.0 | 80.7 |
| ADa-22 | 57.5 | 36.0 | 48.3 | 36.0 | 75.5 |
| RD-22 | 54.6 | 57.0 | 56.1 | 61.0 | 78.2 |
| AD-22 | 63.8 | 62.0 | 62.5 | 65.0 | 77.4 |
| DaR-22 | 62.7 | 60.0 | 57.5 | 54.0 | 82.1 |
Comparison of precision and support of Hyper Kvasir classes.
| Classes | High performed CNN models | ||
|---|---|---|---|
| DenseNet-201 | DaRD-22 | Support | |
| Precision | Precision | ||
| Barretts | 0 | 0 | 13 |
| Barretts-short-segment | 0 | 0 | 16 |
| Bbps-0-1 | 0.93 | 0.87 | 194 |
| Bbps-2-3 | 0.96 | 0.87 | 345 |
| Cecum | 0.9 | 0.51 | 303 |
| Dyed-lifted-polyps | 0.81 | 0.21 | 301 |
| Dyed-resection-margins | 0.81 | 0.48 | 297 |
| Esophagitis-a | 0.46 | 0 | 121 |
| Esophagitis-b-d | 0.63 | 0 | 78 |
| Hemorrhoids | 0 | 0 | 6 |
| Ileum | 0 | 0 | 3 |
| Impacted-stool | 0.85 | 0 | 40 |
| Polyps | 0.82 | 0.93 | 309 |
| Pylorus | 0.94 | 0.91 | 300 |
| Retroflex-rectum | 0.89 | 0.86 | 117 |
| Retroflex-stomach | 0.99 | 0.99 | 230 |
| Ulcerative-colitis-grade-0-1 | 0 | 0 | 11 |
| Ulcerative-colitis-grade-1 | 0.5 | 0 | 61 |
| Ulcerative-colitis-grade-1-2 | 0 | 0 | 4 |
| Ulcerative-colitis-grade-2 | 0.54 | 0.69 | 133 |
| Ulcerative-colitis-grade-2-3 | 0 | 0 | 9 |
| Ulcerative-colitis-grade-3 | 1 | 0 | 40 |
|
| 0.69 | 0.53 | 280 |
Figure 12Comparing accuracies of end-to-end CNNs with CVC clinic DB dataset.
Figure 13Comparing accuracies of fusion CNNs with CVC clinic DB dataset.
Figure 14Comparing accuracies of end-to-end CNNs with Kvasir 2 dataset.
Figure 15Comparing accuracies of fusion CNNs with Kvasir 2 dataset.
Figure 16Comparing accuracies of end-to-end CNNs with Kvasir 2 dataset.
Figure 17Comparing accuracies of fusion CNNs with Kvasir 2 dataset.
Comparison accuracies of the end-to-end of CVC clinic DB dataset.
| DWT | |||||
|---|---|---|---|---|---|
| End-to-end CNN | Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % |
| AlexNet | 77.67 | 52.13 | 70.64 | 68.00 | 51.94 |
| DarkNet-19 | 85.23 | 86.04 | 83.27 | 77.00 | 86.05 |
| ResNet-50v2 | 90.39 | 96.13 | 90.48 | 95.00 | 96.32 |
| DenseNet-201 | 97.33 | 99.03 | 95.37 | 99.00 | 99.03 |
| Efficientnet-B7 | 84.34 | 90.69 | 71.80 | 71.00 | 90.68 |
| VGG-16 | 80.69 | 82.36 | 84.88 | 86.00 | 82.33 |
| VGG-19 | 78.65 | 86.43 | 80.78 | 88.00 | 86.44 |
Comparison accuracies of the end-to-end Kvasir 2 dataset.
| DWT | |||||
|---|---|---|---|---|---|
| End-to-end CNN | Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % |
| AlexNet | 69.72 | 47.50 | 45.44 | 36.00 | 92.92 |
| DarkNet-19 | 74.26 | 65.66 | 74.15 | 57.00 | 96.10 |
| ResNet-50v2 | 66.12 | 83.20 | 74.15 | 83.00 | 98.09 |
| DenseNet-201 | 81.01 | 88.45 | 80.53 | 88.00 | 99.04 |
| Efficientnet-B7 | 61.67 | 78.45 | 54.16 | 70.00 | 97.36 |
| VGG-16 | 57.24 | 73.87 | 71.32 | 78.00 | 96.63 |
| VGG-19 | 50.11 | 69.00 | 63.83 | 77.00 | 95.79 |
Comparison accuracies of the end-to-end of Hyper Kvasir dataset.
| DWT | |||||
|---|---|---|---|---|---|
| End-to-end CNN | Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % |
| AlexNet | 74.62 | 58.54 | 10.75 | 11.00 | 95.61 |
| DarkNet-19 | 76.32 | 50.70 | 78.15 | 51.00 | 93.72 |
| ResNet-50v2 | 62.84 | 79.32 | 72.84 | 79.00 | 94.94 |
| DenseNet-201 | 77.71 | 83.61 | 78.17 | 84.00 | 93.39 |
| Efficientnet-B7 | 57.84 | 75.02 | 50.51 | 68.00 | 92.86 |
| VGG-16 | 53.59 | 68.29 | 69.44 | 75.00 | 93.25 |
| VGG-19 | 49.45 | 64.96 | 62.85 | 71.00 | 92.09 |
Figure 18Comparing accuracies of DWT end-to-end with CVC clinic DB dataset.
Figure 19Comparing accuracies of DWT end-to-end with Kvasir 2 dataset.
Figure 20Comparing accuracies of DWT end-to-end with Hyper Kvasir dataset.
The comparison accuracies of the fusion CNNs of the CVC clinic DB dataset.
| DWT | |||||
|---|---|---|---|---|---|
| Fusion CNNs | Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % |
| ADaRDEV2-22 | 94.84 | 95.73 | 92.62 | 94.00 | 95.73 |
| RDEV2-22 | 97.24 | 98.00 | 93.42 | 95.00 | 98.06 |
| ADaDR-22 | 93.77 | 95.54 | 91.99 | 93.00 | 95.54 |
| ADaR-22 | 84.16 | 89.53 | 89.32 | 93.00 | 89.52 |
| DaRD-22 | 95.46 | 96.70 | 93.86 | 96.00 | 96.70 |
| ADa-22 | 83.10 | 88.37 | 70.37 | 67.00 | 88.38 |
| RD-22 | 97.24 | 98.44 | 94.48 | 96.00 | 98.45 |
| AD-22 | 90.57 | 95.93 | 92.88 | 96.00 | 95.92 |
| DaR-22 | 88.43 | 93.60 | 91.28 | 93.00 | 93.59 |
Comparison accuracies of the fusion CNNs of the Kvasir 2 dataset.
| DWT | |||||
|---|---|---|---|---|---|
| Fusion CNNs | Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % |
| ADaRDEV2-22 | 79.61 | 81.20 | 79.79 | 81.00 | 98.34 |
| RDEV2-22 | 71.01 | 77.66 | 75.57 | 80.00 | 97.81 |
| ADaDR-22 | 80.52 | 82.29 | 79.31 | 82.00 | 98.33 |
| ADaR-22 | 64.77 | 57.79 | 62.24 | 64.00 | 93.93 |
| DaRD-22 | 78.52 | 80.37 | 77.01 | 82.00 | 97.81 |
| ADa-22 | 64.84 | 57.79 | 53.99 | 44.00 | 90.14 |
| RD-22 | 68.88 | 70.29 | 70.23 | 76.00 | 96.54 |
| AD-22 | 75.27 | 76.83 | 74.66 | 65.00 | 97.49 |
| DaR-22 | 67.62 | 70.83 | 63.97 | 63.00 | 95.50 |
Comparison accuracies of the fusion CNNs of the Hyper Kvasir dataset.
| DWT | |||||
|---|---|---|---|---|---|
| Fusion CNNs | Accuracy in % (training) | Accuracy in % (testing) | SVM in % (training) | SVM in % (testing) | AUC in % |
| ADaRDEV2-22 | 69.54 | 72.56 | 70.44 | 58.00 | 82.30 |
| RDEV2-22 | 60.0% | 62.41 | 62.38 | 66.00 | 88.16 |
| ADaDR-22 | 50.87 | 56.36 | 56.43 | 60.00 | 81.95 |
| ADaR-22 | 63.44 | 60.19 | 62.68 | 48.00 | 79.06 |
| DaRD-22 | 65.53 | 57.20 | 59.02 | 60.00 | 80.54 |
| ADa-22 | 60.82 | 37.02 | 57.63 | 37.00 | 82.70 |
| RD-22 | 53.23 | 55.96 | 52.01 | 54.00 | 77.34 |
| AD-22 | 57.97 | 48.36 | 52.32 | 36.00 | 74.53 |
| DaR-22 | 57.79 | 55.27 | 56.88 | 50.00 | 77.34 |
Figure 21Comparing accuracies of DWT fusion CNN with CVC clinic DB dataset.
Figure 22Comparing accuracies of DWT fusion CNN with Kvasir 2 dataset.
Figure 23Comparing accuracies of DWT fusion CNN with Hyper Kvasir dataset.
Comparison of results related studies for CVC clinic DB, Kvasir 2 , and Hyper Kvasir datasets with previous state-of-the-art methods.
| Dataset | Author | Method | Accuracy in % |
|---|---|---|---|
| CVC clinic DB | Attallah and Sharkas [ | GastroCADx | — |
| Liew et al. [ | Ensemble classifier (ResNet50+Adaboost) | 97.91 | |
| Sharma et al. [ | Ensemble classifier | 98.3 | |
| Nisha and Palanisamy [ | DP-CNN | 99.60 | |
| Souaidi and Ansari [ | MP-FSSD | 91.56 | |
| Ours | ColoRectalCADx (proposed) | 99.00 | |
|
| |||
| Kvasir2 | Attallah and Sharkas [ | GastroCADx | 97.3 |
| Sharma et al. (2022) [ | Ensemble classifier | 97 | |
| Ours | ColoRectalCADx (proposed) | 88.00 | |
|
| |||
| Hyper Kvasir | Attallah and Sharkas [ | GastroCADx | 99.7 |
| Ours | ColoRectalCADx (proposed) | 84.00 | |
Figure 24Confusion matrices of (a) CVC clinic DB dataset. (b) Kvasir 2. (c) Hyper Kvasir.
Figure 25ROC curves of (a) CVC clinic DB dataset. (b) Kvasir 2. (c) Hyper Kvasir labeled.
Parameters of UNet for semantic segmentation.
| Dataset | Epochs | Learning rate | Batch size | Train loss | Test loss | Total time taken for model (s) |
|---|---|---|---|---|---|---|
| CVC clinic DB | 40 | 0.001 | 64 | 0.2998 | 0.7842 | 862.61 |
| Kvasir2 | 40 | 0.001 | 64 | 0.412 | 0.6977 | 1477.32 |
| Hyper Kvasir | 40 | 0.001 | 64 | 0.4005 | 0.691 | 1374.04 |
Figure 26Semantic segmentation for predicted polyps. (a) CVC clinic DB. (b) Kvasir2. (c) Hyper Kvasir.
Figure 27Loss graph. (a) CVC clinic DB. (b) Kvasir2. (c) Hyper Kvasir.