| Literature DB >> 34689442 |
Bofan Song1, Shaobai Li1, Sumsum Sunny2, Keerthi Gurushanth3, Pramila Mendonca4, Nirza Mukhia3, Sanjana Patrick5, Shubha Gurudath3, Subhashini Raghavan3, Imchen Tsusennaro6, Shirley T Leivon6, Trupti Kolur4, Vivek Shetty4, Vidya Bushan4, Rohan Ramesh6, Tyler Peterson1, Vijay Pillai4, Petra Wilder-Smith7, Alben Sigamani4, Amritha Suresh2,4, Moni Abraham Kuriakose8, Praveen Birur3,5, Rongguang Liang1.
Abstract
SIGNIFICANCE: Early detection of oral cancer is vital for high-risk patients, and machine learning-based automatic classification is ideal for disease screening. However, current datasets collected from high-risk populations are unbalanced and often have detrimental effects on the performance of classification. AIM: To reduce the class bias caused by data imbalance. APPROACH: We collected 3851 polarized white light cheek mucosa images using our customized oral cancer screening device. We use weight balancing, data augmentation, undersampling, focal loss, and ensemble methods to improve the neural network performance of oral cancer image classification with the imbalanced multi-class datasets captured from high-risk populations during oral cancer screening in low-resource settings.Entities:
Keywords: deep learning; ensemble learning; imbalanced multi-class datasets; mobile screening device; oral cancer
Mesh:
Year: 2021 PMID: 34689442 PMCID: PMC8536945 DOI: 10.1117/1.JBO.26.10.105001
Source DB: PubMed Journal: J Biomed Opt ISSN: 1083-3668 Impact factor: 3.758
Fig. 1Distribution of imbalanced oral cheek mucosa image dataset collected from high-risk population.
Fig. 2The bootstrap aggregation ensemble method for imbalanced dataset.
The results table of experiments in the study.
| Dataset/loss function | Original/CE | Original/weight-balanced CE | Original/focal loss | Oversampled/CE | RUS and oversampled/CE |
|---|---|---|---|---|---|
| 0.11 | 0.17 | 0.30 | 0.62 | 0.68 | |
| 0.80 | 0.80 | 0.80 | 0.81 | 0.84 | |
| 0.89 | 0.88 | 0.89 | 0.94 | 0.95 | |
| 0.74 | 0.72 | 0.72 | 0.71 | 0.73 | |
| Recall (benign) | 0.10 | 0.12 | 0.23 | 0.57 | 0.69 |
| Recall (malignancy) | 0.75 | 0.67 | 0.67 | 0.71 | 0.75 |
| Recall (normal) | 0.94 | 0.92 | 0.93 | 0.92 | 0.92 |
| Recall (premalignancy) | 0.73 | 0.74 | 0.71 | 0.82 | 0.80 |
| Precision (benign) | 0.36 | 0.35 | 0.41 | 0.67 | 0.69 |
| Precision (malignancy) | 0.86 | 0.98 | 0.98 | 0.94 | 0.95 |
| Precision (normal) | 0.85 | 0.85 | 0.86 | 0.95 | 0.98 |
| Precision (premalignancy) | 0.75 | 0.71 | 0.73 | 0.63 | 0.67 |
| Macroaverage precision | 0.70 | 0.73 | 0.75 | 0.80 | 0.83 |
| Macroaverage recall | 0.62 | 0.61 | 0.64 | 0.76 | 0.79 |
| Macroaverage | 0.64 | 0.65 | 0.68 | 0.77 | 0.81 |
| Total accuracy | 0.81 | 0.80 | 0.81 | 0.78 | 0.81 |
| Balanced accuracy | 0.62 | 0.61 | 0.64 | 0.75 | 0.80 |
Fig. 3Data augmentation examples of our oral cheek mucosa dataset.
Fig. 4Confusion matrix: (a) trained with RUS and data augmented balanced dataset using focal loss and (b) trained with bootstrap aggregation ensemble method.
Fig. 5ROC curve of each class and micro-/macroaverage combined: (a) trained with RUS and data augmented balanced dataset using focal loss and (b) trained with bootstrap aggregation ensemble method (In this figure, classes 0 to 4 represent class benign, malignant, normal, and premalignant, respectively).