| Literature DB >> 31594753 |
Hao Xiong1, Peiliang Lin1, Jin-Gang Yu2, Jin Ye3, Lichao Xiao2, Yuan Tao4, Zebin Jiang5, Wei Lin6, Mingyue Liu3, Jingjing Xu7, Wenjie Hu7, Yuewen Lu7, Huaifeng Liu7, Yuanqing Li8, Yiqing Zheng9, Haidi Yang10.
Abstract
OBJECTIVE: To develop a deep convolutional neural network (DCNN) that can automatically detect laryngeal cancer (LCA) in laryngoscopic images.Entities:
Mesh:
Year: 2019 PMID: 31594753 PMCID: PMC6838439 DOI: 10.1016/j.ebiom.2019.08.075
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Details of the image sets used for experiments.
| Image sets | LCA | PRELCA | BLT | NORM | Total |
|---|---|---|---|---|---|
| DS1 | 1776 (164) | 1476 (162) | 5127 (619) | 2513 (506) | 10,892 (1451) |
| DS2 | 517 (42) | 331 (41) | 1321 (155) | 660 (127) | 2829 (365) |
| DS1 + DS2 | 2293 (206) | 1807 (203) | 6448 (774) | 3191 (633) | 13,721 (1816) |
| DS3 | 132 (44) | 129 (43) | 504 (168) | 411 (137) | 1176 (392) |
| Total | 2425 (250) | 1936 (246) | 6952 (942) | 3602 (770) | 14,897 (2208) |
Fig. 1Overview of the deep learning architecture. Parameters pre-trained on the external ImageNet dataset are used to initialize the deep convolutional neural network, which is then fine-tuned on the target dataset.
Fig. 2Illustration of the changes of the loss function value (A) and the classification accuracy (B) over the training and testing sets.
Fig. 3The sensitivity-specificity curve for Urgent versus Non-urgent binary classification.
Fig. 4Confusion matrix for 4-class categorization.
Fig. 5Comparison between the deep learning algorithm and three human experts in Urgent versus Non-urgent binary classification. Expert 1: expert with 10–20 years of experience. Expert 2: expert with ~3 years of experience. Expert 3: expert with 3–10 years of experience.
Fig. 6Four-class confusion matrices obtained by our DCNN model (A) and the three human experts (B-D). Expert 1: expert with 10–20 years of experience. Expert 2: expert with ~3 years of experience. Expert 3: expert with 3–10 years of experience.
Summary of the quantitative experimental results.
| Evaluation settings/Methods | 2-Class | 4-Class | |||
|---|---|---|---|---|---|
| Sensitivity | Specificity | Accuracy | Accuracy | ||
| DS1 training, DS2 testing | DCNN | 0.731 | 0.922 | 0.867 | 0.745 |
| DS1 + DS2 training, DS3 testing | DCNN | 0.720 | 0.948 | 0.897 | 0.773 |
| Verikas et al. [ | 0.603 | 0.820 | 0.651 | 0.460 | |
| Expert 1 | 0.761 | 0.946 | 0.906 | 0.750 | |
| Expert 2 | 0.875 | 0.801 | 0.817 | 0.647 | |
| Expert 3 | 0.702 | 0.902 | 0.858 | 0.704 | |
Fig. 7Representative attention maps obtained by the DCNN model on the classes of NORM, BLT, PRELCA and LCA from top to bottom respectively. Attention maps are displayed as heat maps overlay upon the original images, where warmer colors indicate higher saliency, i.e., higher contribution to the classification decision.