| Literature DB >> 36001628 |
Kritsasith Warin1, Wasit Limprasert2, Siriwan Suebnukarn1, Suthin Jinaporntham3, Patcharapon Jantana4, Sothana Vicharueang4.
Abstract
Artificial intelligence (AI) applications in oncology have been developed rapidly with reported successes in recent years. This work aims to evaluate the performance of deep convolutional neural network (CNN) algorithms for the classification and detection of oral potentially malignant disorders (OPMDs) and oral squamous cell carcinoma (OSCC) in oral photographic images. A dataset comprising 980 oral photographic images was divided into 365 images of OSCC, 315 images of OPMDs and 300 images of non-pathological images. Multiclass image classification models were created by using DenseNet-169, ResNet-101, SqueezeNet and Swin-S. Multiclass object detection models were fabricated by using faster R-CNN, YOLOv5, RetinaNet and CenterNet2. The AUC of multiclass image classification of the best CNN models, DenseNet-196, was 1.00 and 0.98 on OSCC and OPMDs, respectively. The AUC of the best multiclass CNN-base object detection models, Faster R-CNN, was 0.88 and 0.64 on OSCC and OPMDs, respectively. In comparison, DenseNet-196 yielded the best multiclass image classification performance with AUC of 1.00 and 0.98 on OSCC and OPMD, respectively. These values were inline with the performance of experts and superior to those of general practictioners (GPs). In conclusion, CNN-based models have potential for the identification of OSCC and OPMDs in oral photographic images and are expected to be a diagnostic tool to assist GPs for the early detection of oral cancer.Entities:
Mesh:
Year: 2022 PMID: 36001628 PMCID: PMC9401150 DOI: 10.1371/journal.pone.0273508
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Examples of the OSCC and OPMDs images from the dataset showing.
(A) OSCC image; (B) annotation of OSCC image by surgeons; (C) OPMDs image; (D) annotation of OPMDs image by surgeons.
Multi-class image classification performances of CNN algorithms on the test sets compared with the average performance of clinicians; ‘oral and maxillofacial surgeons’ vs. ‘GPs’.
| Class | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| OSCC | OPMDs | |||||||||
| Precision | Recall (Sensitivity) | Specificity | F1 score | AUC of ROC curve | Precision | Recall (Sensitivity) | Specificity | F1 score | AUC of ROC curve | |
|
| 0.98 | 0.99 | 0.99 | 0.98 | 1.0 | 0.95 | 0.95 | 0.97 | 0.95 | 0.98 |
|
| 0.96 | 0.92 | 0.94 | 0.94 | 0.99 | 0.97 | 0.97 | 0.94 | 0.97 | 0.97 |
|
| 0.85 | 0.72 | 0.92 | 0.78 | 0.88 | 0.76 | 0.78 | 0.88 | 0.77 | 0.87 |
|
| 0.69 | 0.73 | 0.83 | 0.71 | 0.71 | 0.63 | 0.74 | 0.88 | 0.68 | 0.80 |
|
| - | 0.90 | 0.89 | - | - | - | 0.74 | 0.93 | - | - |
|
| - | 0.77 | 0.87 | - | - | - | 0.68 | 0.86 | - | - |
AUC, area under the curve; ROC, receiver operating characteristics; GPs, General practictioners.
Multi-class object detection performances of CNN algorithms on the test sets.
| Class | ||||||||
|---|---|---|---|---|---|---|---|---|
| OSCC | OPMDs | |||||||
| Precision | Recall (Sensitivity) | F1 score | AUC of precision—recall curve | Precision | Recall (Sensitivity) | F1 score | AUC of precision—recall curve | |
|
| 0.84 | 0.90 | 0.87 | 0.88 | 0.60 | 0.71 | 0.65 | 0.64 |
|
| 0.88 | 0.86 | 0.87 | 0.84 | 0.74 | 0.39 | 0.51 | 0.34 |
|
| 0.98 | 0.82 | 0.89 | 0.81 | 0.92 | 0.57 | 0.70 | 0.55 |
|
| 0.64 | 0.92 | 0.76 | 0.91 | 0.49 | 0.60 | 0.54 | 0.58 |
|
| - | 0.90 | - | - | - | 0.74 | - | - |
|
| - | 0.77 | - | - | - | 0.68 | - | - |
AUC, area under the curve; GPs, General practictioners
Fig 2Example of the Grad-CAM visualization of the DenseNet-169.
(A) Image with OSCC lesion; (B) The model correctly classified OSCC and labeled the correct location. (C) Image with OPMDs lesion (D) The model correctly classified OPMDs and labeled the correct location.
Fig 3(A-B) Bounding box ground truth based on surgeons’ annotations of the imaging of the patient with OSCC at retromolar trigone and lateral tongue, respectively; (C-D) Bounding box ground truth based on surgeons’ annotations of the imaging of the patient with OPMDs at retromolar trigone and lateral tongue, respectively; (E-H) The true positive outputs from the faster R-CNN detection; (I-L) The true positive outputs from the YOLOv5 detection; (M-P) The true positive outputs from the RetinaNet detection; (Q-T) The true positive outputs from the CenterNet2 detection.