| Literature DB >> 36199768 |
Xiaoya Fan1, Lihui Yu2, Xin Qi2, Xue Lin2, Junjun Zhao2, Dong Wang2, Jing Zhang2.
Abstract
Background: Accurate pathological diagnosis of gastric endoscopic biopsy could greatly improve the opportunity of early diagnosis and treatment of gastric cancer. The Japanese "Group classification" of gastric biopsy corresponds well with the endoscopic diagnostic system and can guide clinical treatment. However, severe shortage of pathologists and their heavy workload limit the diagnostic accuracy. This study presents the first attempt to investigate the applicability and effectiveness of AI-aided system for automated Japanese "Group classification" of gastric endoscopic biopsy.Entities:
Mesh:
Year: 2022 PMID: 36199768 PMCID: PMC9529421 DOI: 10.1155/2022/6899448
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
Figure 1Workflow of the study.
Figure 2Typical examples of Groups 1~5 according to the Japanese “Group classification.” Framed in the left corner of each subfigure is the typical appearance of the corresponding class.
Description of Japanese “Group classification”.
| Group | Description |
|---|---|
| Group 1 | Normal tissue or nonneoplastic lesion tissue |
| Group 2 | Difficult to make diagnosis between neoplastic and nonneoplastic lesions |
| Group 3 | Adenoma |
| Group 4 | Can be diagnosed as neoplastic lesion and is suspected to be carcinoma |
| Group 5 | Carcinoma |
Figure 3An example of annotated WSI, segmented with labelme. Outlined in red was annotated as Group 5 while in green was annotated as Group 1.
Information of datasets.
| Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | |
|---|---|---|---|---|---|
| Number of WSIs | 166 | 84 | 45 | 68 | 58 |
| Dataset 1 (400 × 400) | 45001 | 10274 | 17999 | 20845 | 41521 |
| Dataset 2 (600 × 600) | 21430 | 4307 | 7584 | 8967 | 17834 |
| Dataset 3 (800 × 800) | 10875 | 2291 | 4204 | 4931 | 9944 |
| Dataset 4 (1000 × 1000) | 7243 | 1387 | 2652 | 3044 | 6318 |
| Dataset 5 (mixed size∗) | 39548 | 7985 | 14440 | 16942 | 34096 |
∗Tiles with various sizes (600 × 600, 800 × 800, and 1000 × 1000) were mixed to construct Dataset 5.
Test accuracy of five different models∗.
| Model | AUC | Acc (%) |
|---|---|---|
| ResNet50 | 0.988 | 89.5 |
| VGG16 | 0.970 | 83.1 |
| VGG19 | 0.949 | 76.6 |
| Xception | 0.894 | 66.2 |
| InceptionV3 | 0.881 | 63.0 |
∗The models were trained on Dataset 1.
Figure 4Performance (accuracy, recall, precision, and F1 score) of five different models for each group evaluated on the test set. All models were trained on Dataset 1. The values of the metrics for ResNet50 are highlighted above the corresponding green bars.
AUC and accuracy of ResNet-50 models trained on different datasets.
| Dataset | AUC | Acc (%) |
|---|---|---|
| Dataset 1 (400 × 400) | 0.988 | 89.48% |
| Dataset 2 (600 × 600) | 0.989 | 90.64% |
| Dataset 3 (800 × 800) | 0.983 | 88.09% |
| Dataset 4 (1000 × 1000) | 0.979 | 87.21% |
| Dataset 5 (mixed size) | 0.994 | 93.16% |
Figure 5Performance (accuracy, recall, precision, and F1 score) of ResNet50 evaluated on the test set. The models were trained on five different datasets. The values of the metrics for ResNet50 trained on Dataset 5 are highlighted above the green bars.
Figure 6Normalized confusion matrix of ResNet50 on the test set. The model was trained on Dataset 5.
The false-positive rate (FPR) and false-negative rate (FNR) of each group.
| Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | |
|---|---|---|---|---|---|
| FPR | 5.94% | 16.93% | 5.60% | 10.01% | 4.32% |
| FNR | 7.24% | 11.52% | 8.93% | 9.30% | 3.18% |