| Literature DB >> 35875502 |
Xinyi Zhu1, Cancan Chen2, Qiang Guo3, Jianhui Ma4, Fenglong Sun2, Haizhen Lu1.
Abstract
Introduction: The pathological rare category of thyroid is a type of lesion with a low incidence rate and is easily misdiagnosed in clinical practice, which directly affects a patient's treatment decision. However, it has not been adequately investigated to recognize the rare, benign, and malignant categories of thyroid using the deep learning method and recommend the rare to pathologists.Entities:
Keywords: WSI; deep learning model; pathology; rare category; thyroid cancer
Year: 2022 PMID: 35875502 PMCID: PMC9298848 DOI: 10.3389/fbioe.2022.857377
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Dataset information.
| Ground truth | Common/rare | Subtype | WSI count | Percentage (%) |
|---|---|---|---|---|
| Malignant | Common | PTC | 536 | 39.01 |
| Malignant | Rare | Other TC | 5 | 0.36 |
| Intermediate | Rare | TAL | 72 | 5.24 |
| Intermediate | Rare | TFCN | 45 | 3.28 |
| Benign | Rare | Other BTL | 25 | 1.82 |
| Benign | Common | NG | 691 | 50.29 |
| Total | — | — | 1374 | 100 |
WSI counts for training validation and test datasets.
| Ground truth | Subtype | Training | Validation | Test1 | Test2 |
|---|---|---|---|---|---|
| Malignant | PTC | 200 | 53 | 283 | 283 |
| Malignant | Other TC | 0 | 0 | 0 | 5 |
| Intermediate | TAL | 0 | 0 | 0 | 72 |
| Intermediate | TFCN | 0 | 0 | 0 | 45 |
| Benign | Other BTL | 0 | 0 | 0 | 25 |
| Benign | NG | 296 | 61 | 334 | 334 |
| Total | — | 496 | 114 | 617 | 764 |
Test 1 simulates the dataset for research. Test 2 simulates the real-world dataset in clinical practice.
FIGURE 1An overview of the proposed WSI diagnostic framework presented in this study. (A) The WSI slide with the region of interest (green line) and carcinoma region (blue line). (B) The process of patch-based UNet model training. (C) The process of patch-based UNet inference. (D) The WSI heatmap. (E) A random forest was selected for the WSI-based classification task. (F) The proposed triple classification model.
FIGURE 4Decision tree for distinguishing the rare from the common benign and malignant categories.
The test dataset in clinical practice was relabeled based on our binary classification results and the ground truth for evaluating the triple classification model performance.
| Subtype | The common benign | The common malignant | The rare | Total |
|---|---|---|---|---|
| PTC | 0 | 270 | 13 | 283 |
| Other TC | 0 | 5 | 0 | 5 |
| TAL | 0 | 0 | 72 | 72 |
| TFCN | 0 | 0 | 45 | 45 |
| Other BTL | 20 | 0 | 5 | 25 |
| NG | 325 | 0 | 9 | 334 |
| Total | 345 | 275 | 144 | 764 |
The common benign and malignant represented the types correctly classified by patch-UNet, for the benign (NG) and malignant (PTC) types, respectively. The rare represented the misclassified PTC, other TC, hard determined NG, other BTL, and the intermediate WSIs.
FIGURE 2(A) AUC for the different test datasets. The Test 1 dataset contained common benign and malignant pathological subtypes (PTC and NG). Test 2 contained not only the common PTC and NG types included in Test 1 but also two intermediate types (TAL and TFCN) and two other rare types (other TC and BTL). (B) The confusion matrix for the benign, malignant, and intermediate subtypes.
FIGURE 3Misclassification examples of selected slides in the Test 1 dataset. Examples of true positive, false negative, false positive, and other subtype slides are represented. Our model performed efficiently in tumor carcinomas for true positives (PTC), but the false positives (NG) were mainly caused by the fibrotic tissue in both the normal and carcinoma regions. Fibrotic tissue sometimes has a larger area or diameter than certain carcinoma regions for false negatives (PTC). Because fibrotic tissue is quite common in thyroid WSIs, the TAL, TFCN, and other BTL slides outside our training slides showed clear differences in structural features from PTC and NG slides, resulting in a heatmap far from the ground truth.