| Literature DB >> 35485691 |
Tetsuya Tsukamoto1, Atsushi Teramoto2, Ayumi Yamada2, Yuka Kiriyama1,3, Eiko Sakurai1, Ayano Michiba1, Kazuyoshi Imaizumi4, Hiroshi Fujita5.
Abstract
OBJECTIVE: It is essential to accurately diagnose and classify histological subtypes into adenocarcinoma (ADC), squamous cell carcinoma (SCC), and small cell lung carcinoma (SCLC) for the appropriate treatment of lung cancer patients. However, improving the accuracy and stability of diagnosis is challenging, especially for non-small cell carcinomas. The purpose of this study was to compare multiple deep convolutional neural network (DCNN) technique with subsequent additional classifiers in terms of accuracy and characteristics in each histology.Entities:
Keywords: Artificial intelligence; Liquid-based cytology; Machine Learning
Mesh:
Year: 2022 PMID: 35485691 PMCID: PMC9375620 DOI: 10.31557/APJCP.2022.23.4.1315
Source DB: PubMed Journal: Asian Pac J Cancer Prev ISSN: 1513-7368
Figure 1Data Augmentation Methods
Figure 2Architectures of Deep Convolutional Neural Networks and Subsequent Classifiers Used in this Study
Number of Images in Each Dataset for Cross Validation
| SET 1 | SET 2 | SET 3 | Total number of | ||||
|---|---|---|---|---|---|---|---|
| Original | Augmented | Original | Augmented | Original | Augmented | ||
| Adenocarcinoma | 23 | 5000 | 27 | 5000 | 25 | 5000 | 75 |
| Squamous cell carcinoma | 44 | 5000 | 40 | 5000 | 46 | 5000 | 130 |
| Small cell carcinoma | 26 | 5000 | 26 | 5000 | 32 | 5000 | 84 |
| Total | 289 | ||||||
Parameters of the Used Classifiers
| Parameter | Value | |
|---|---|---|
| Naïve Bayes | - | - |
| Support vector machine | Regression loss epsion | 0.1 |
| Cost | 1.00 | |
| Kernel | Polynomial | |
| Random forest | Number of trees | 20 |
| Neural network | Number of Hidden layers | 2 (100-100) |
| Activation function | tanh | |
| Optimizer | SGD | |
| Number of iteration | 300 |
Confusion Matrix of Classification Results by Fine-Tuned DCNN Architectures
| AlexNet | Predicted | Total | Overall accuracy | |||
|---|---|---|---|---|---|---|
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 57 (76.0%) | 10 (13.3%) | 8 (10.7%) | 75 (100%) | |
| SqCC | 37 (28.5%) | 75 (57.7%) | 18 (13.8%) | 130 (100%) | ||
| SmCLC | 1 (1.2%) | 2 (2.4%) | 81 (96.4%) | 84 (100%) | 73.7% | |
| GoogLeNet (InceptionV3) | Predicted | Total | Overall accuracy | |||
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 46 (61.3%) | 17 (22.7%) | 12 (16.0%) | 75 (100%) | |
| SqCC | 36 (20.0%) | 80 (61.5%) | 14 (10.8%) | 130 (100%) | ||
| SmCLC | 13 (15.5%) | 4 (4.8%) | 67 (79.8%) | 84 (100%) | 66.8% | |
| VGG16 | Predicted | Total | Overall accuracy | |||
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 56 (74.7%) | 12 (16.0%) | 7 (9.3%) | 75 (100%) | |
| SqCC | 25 (19.2%) | 89 (68.5%) | 16 (12.3%) | 130 (100%) | ||
| SmCLC | 3 (3.6%) | 4 (4.8%) | 77 (91.7%) | 84 (100%) | 76.8% | |
| ResNet50 | Predicted | Total | Overall accuracy | |||
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 56 (74.7%) | 14 (18.7%) | 5 (6.7%) | 75 (100%) | |
| SqCC | 30 (23.1%) | 88 (67.7%) | 12 (9.2%) | 130 (100%) | ||
| SmCLC | 5 (6.0%) | 9 (10.7%) | 70 (83.3%) | 84 (100%) | 74.0% | |
ADC, adenocarcinoma; SqCC, squamous cell carcinoma; SmCLC, small cell lung carcinoma
Kappa Coefficients between Every 2 Fine-Tuned DCNNs
| AlexNet | Inception V3 | VGG16 | ResNet50 | |
|---|---|---|---|---|
| AlexNet | – | 0.565±0.040 | 0.715±0.035 | 0.654±0.037 |
| Inception V3 | – | – | 0.543±0.041 | 0.548±0.041 |
| VGG16 | – | – | – | 0.672±0.037 |
| ResNet50 | – | – | – | – |
–, not applicable
Figure 3Representative Images of Adenocarcinoma. v and x represent correct- and mis-classifications, respectively. Papanicolaou staining. Ad, adenocarcinoma; Sq, squamous cell carcinoma; Sm, small cell carcinoma
Figure 4Representative Images of Squamous Cell Carcinoma. v and x represent correct- and mis-classifications, respectively. Papanicolaou staining
Figure. 5Representative Images of Small Cell Carcinoma. v and x represent correct- and mis-classifications, respectively. Papanicolaou staining
Figure 6Two Resembling Non-Small Cell Lung Carcinomas, v and x Represent Correct- and mis-classifications, respectively. Papanicolaou and immunostaining for TTF-1 and p40
Figure 7Non-Resembling Images of Squamous Cell Carcinomas from a Same Patient. v and x represent correct- and mis-classifications, respectively. Papanicolaou and immunostaining for TTF-1 and p40
Confusion Matrix of Classification Results by Additional Classifiers
| Naïve Bayes | Predicted | total | Overall accuracy | |||
|---|---|---|---|---|---|---|
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 59 (78.7%) | 11 (14.7%) | 5 (6.7%) | 75 (100%) | |
| SqCC | 30 (23.1%) | 92 (70.8%) | 8 (6.2%) | 130 (100%) | ||
| SmCLC | 5 (6.0%) | 13 (15.5%) | 66 (75.6%) | 84 (100%) | 75.1% | |
| Support vector machine | Predicted | total | Overall accuracy | |||
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 44 (58.7%) | 26 (36.7%) | 5 (6.7%) | 75 (100%) | |
| SqCC | 15(11.5%) | 110 (84.6%) | 5 (3.8%) | 130 (100%) | ||
| SmCLC | 0 (0%) | 14 (16.7%) | 70 (83.3%) | 84 (100%) | 77.5% | |
| Random forest | Predicted | total | Overall accuracy | |||
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 51 (68.0%) | 19 (25.3%) | 5 (6.7%) | 75 (100%) | |
| SqCC | 17 (13.1%) | 107 (82.3%) | 6 (4.6%) | 130 (100%) | ||
| SmCLC | 2 (2.4%) | 14 (16.7%) | 68 (81.0%) | 84 (100%) | 78.2% | |
| Neural network | Predicted | total | Overall accuracy | |||
| ADC | SqCC | SmCLC | ||||
| Actual | ADC | 54 (72.0%) | 17 (22.7%) | 4 (5.3%) | 75 (100%) | |
| SqCC | 21 (16.2%) | 98 (75.4%) | 11 (8.5%) | 130 (100%) | ||
| SmCLC | 4 (4.8%) | 4 (4.8%) | 76 (90.5%) | 84 (100%) | 78.9% | |
ADC, adenocarcinoma; SqCC, squamous cell carcinoma; SmCLC, small cell lung carcinoma