| Literature DB >> 35559018 |
Pallabi Sharma1, Bunil Kumar Balabantaray1, Kangkana Bora2, Saurav Mallik3, Kunio Kasugai4, Zhongming Zhao3,5,6.
Abstract
Colorectal cancer (CRC) is the third leading cause of cancer death globally. Early detection and removal of precancerous polyps can significantly reduce the chance of CRC patient death. Currently, the polyp detection rate mainly depends on the skill and expertise of gastroenterologists. Over time, unidentified polyps can develop into cancer. Machine learning has recently emerged as a powerful method in assisting clinical diagnosis. Several classification models have been proposed to identify polyps, but their performance has not been comparable to an expert endoscopist yet. Here, we propose a multiple classifier consultation strategy to create an effective and powerful classifier for polyp identification. This strategy benefits from recent findings that different classification models can better learn and extract various information within the image. Therefore, our Ensemble classifier can derive a more consequential decision than each individual classifier. The extracted combined information inherits the ResNet's advantage of residual connection, while it also extracts objects when covered by occlusions through depth-wise separable convolution layer of the Xception model. Here, we applied our strategy to still frames extracted from a colonoscopy video. It outperformed other state-of-the-art techniques with a performance measure greater than 95% in each of the algorithm parameters. Our method will help researchers and gastroenterologists develop clinically applicable, computational-guided tools for colonoscopy screening. It may be extended to other clinical diagnoses that rely on image.Entities:
Keywords: colonoscopy; colorectal cancer; deep learning; ensemble classifier; polyp detection
Year: 2022 PMID: 35559018 PMCID: PMC9086187 DOI: 10.3389/fgene.2022.844391
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Workflow of the proposed system.
Summary of datasets used in this study.
| Dataset | # Frames | # Frames with polyps | ||
|---|---|---|---|---|
| Informative | Uninformative | Cancerous | Noncancerous | |
| Aichi-Medical dataset | 397 | 500 | 125 | 272 |
| Kvasir dataset | 500 | 500 | - | - |
| Depeca colonoscopy dataset | — | — | 55 | 21 |
Performance measures for evaluating the detection model.
| Measures | Formula | Description |
|---|---|---|
| Accuracy |
| The ratio of the number of correct prediction with respect to total observations |
| Precision |
| The ratio of the number of correct positive prediction with respect to total positive prediction |
| Recall/Sensitivity |
| The ratio of number of correct positive prediction with respect to actual positive observation |
| F1 score/Dice-coefficient |
| F1 score is the harmonic mean of both precision and recall |
FIGURE 2Five-fold cross-validation accuracy and loss of each individual classifier for (A) informative frame detection and (B) cancerous and noncancerous polyp categorization.
FIGURE 3Test results of all four classifiers for (A) informative frame detection and (B) cancerous and noncancerous polyp classification.
FIGURE 4Confusion matrix of each individual classifier for (A) informative frame detection and (B) cancerous and noncancerous polyp classification.
FIGURE 5Area under the ROC curve analysis for (A) informative frame detection and (B) cancerous and noncancerous polyp classification.
FIGURE 6Performance comparison of Ensemble classifiers. (A) Performance of Ensemble classifier on significant frame detection. (B) Performance of Ensemble classifier on classification of cancerous and noncancerous polyps.
Classification performance in comparison with similar work.
| Objective | Methods | Algorithm | Accuracy | Precision | Recall | F1 Score | Specificity |
|---|---|---|---|---|---|---|---|
| Informative frame detection | Proposed Ensemble | CNN |
| 98.6 |
|
| 98.66 |
|
| CNN | 90.28 | 74.34 | 68.32 | 71.20 | 94.97 | |
|
| Ensemble (SVM + CNN) | 98.0 |
| 97.6 | 98.00 | - | |
|
| CNN | 86.69 | 86.28 | 28.90 | 43.30 | 99.02 | |
|
| Ensemble (ResNet50 + Adaboost) | 97.91 | 99.35 | 96.45 | — |
| |
| Cancerous and noncancerous polyp identification | Proposed Ensemble | CNN |
|
|
|
|
|
|
| CNN | 90.00 | — | 88.1 | — | — | |
|
| Ensemble (SVM + CNN) | 85.90 | 87.30 | 87.60 | 87.00 | — | |
|
| CNN | 90.00 | — | 94.50 | — | — | |
|
| CNN | 83.00 | 81.00 | 86.00 | 83.00 | — |
*Bold values indicate the best performance.
Significance of Ensemble classifier decision in comparison with individual classifiers.
| Classifier | Chi-squared Value |
|
|---|---|---|
| ResNet101 vs. Ensemble | 4.16 | 0.041 |
| GoogLeNet vs. Ensemble | 2.28 | 0.039 |
| Xception vs. Ensemble | 6.75 | 0.009 |
| ReNet101 vs. Ensemble | 2.25 | 0.033 |
| GoogLeNet vs. Ensemble | 2.25 | 0.033 |
| Xception vs. Ensemble | 5.81 | 0.015 |
p-value is based on the McNemar test.