| Literature DB >> 33796455 |
Yiqing Hou1, Chao Chen2, Lu Zhang1, Wei Zhou1, Qinyang Lu2, Xiaohong Jia1, Jingwen Zhang1, Cen Guo2, Yuxiang Qin2, Lifeng Zhu3, Ming Zuo3, Jing Xiao2, Lingyun Huang2, Weiwei Zhan1.
Abstract
OBJECTIVE: The aim of this study is to develop a model using Deep Neural Network (DNN) to diagnose thyroid nodules in patients with Hashimoto's Thyroiditis.Entities:
Keywords: Hashimoto’s thyroiditis; deep learning; diagnosis; thyroid nodule; ultrasound
Year: 2021 PMID: 33796455 PMCID: PMC8008116 DOI: 10.3389/fonc.2021.614172
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Architecture of our proposed model. DenseNet-161(k = 48) used as the backbone. Different ROI expansions adopted for annotated nodule images of different sizes.
Rules of expanding nodule ROI.
| Longer side length of nodule ROI | Expanded square ROI size |
|---|---|
| 0 < |
|
| 65 < |
|
| 150 < |
|
|
|
|
Different ROI expansions adopted for nodule images of different sizes in pixels.
Baseline characteristics.
| Training Set | Test Set | |
|---|---|---|
|
| 2,364 | 568 |
| Patients with HT | 1,334 (56.4%) | 332 (58.5%) |
| Patients without HT | 1,030 (43.6%) | 236 (41.5%) |
|
| 7,301 | 1,805 |
| Images from patietns with HT | 4,128 (56.5%) | 1,086 (60.2%) |
| Images from patietns without HT | 3,173 (43.5%) | 722 (39.8%) |
|
| 2,924 | 710 |
| Benign nodules | 1,920 (65.7%) | 476 (67%) |
| malignant nodules | 1,004 (34.3%) | 234 (33%) |
|
| ||
| Benign nodules | 1.09 (0.86) | 1.08 (0.89) |
| malignant nodules | 1.08 (0.63) | 1.06 (0.61) |
|
| ||
| Male | 539 (22.8%) | 136 (23.9%) |
| Female | 1825 (77.2%) | 432 (76.1%) |
|
| 45.29 ± 12.45 | 45.09 ± 12.41 |
Figure 2Comparison of ROC curves and AUC of two DNN models. Baseline DNN model learned only the nodule area. Modified DNN model learned the nodule area as well as the surrounding parenchyma.
Figure 3Youden Index and threshold for modified DNN model.
Performance of model in diagnosing malignant nodules on test set and its subsets.
| AUC | Accuracy | Sensitivity | Specificity | Precision | |
|---|---|---|---|---|---|
| Test set | 0.924 (0.006) | 0.851 (0.018) | 0.881 (0.027) | 0.839 (0.031) | 0.673 (0.038) |
| HT subset | 0.924 (0.010) | 0.852 (0.026) | 0.881 (0.035) | 0.846 (0.036) | 0.540 (0.053) |
| Normal subset | 0.906 (0.010) | 0.843 (0.011) | 0.871 (0.033) | 0.822 (0.029) | 0.784 (0.024) |
| P-Value | 0.587 | 0.938 | 0.178 | <0.01 |
P-Value is that of diagnostic performance on HT subset versus normal subset; AUC, Areas under the ROC curve. All metrics were the average of 10-fold, presented as Mean (SD).
Performance metrics of DNN model in diagnosing malignant nodules of different sizes, evaluated on normal subset versus HT subset.
| HT Subset | Normal subset | ||
|---|---|---|---|
| Average size (SD) | 0.975 (0.51) | 1.25 (0.77) | |
| <5 mm | AUC | 0.915 | 0.895 |
| Accuracy | 0.83 | 0.825 | |
| Sensitivity | 0.859 | 0.82 | |
| Specificity | 0.828 | 0.826 | |
| Precision | 0.327 | 0.651 | |
| 5–10 mm | AUC | 0.909 | 0.895 |
| Accuracy | 0.82 | 0.846 | |
| Sensitivity | 0.902 | 0.868 | |
| Specificity | 0.794 | 0.822 | |
| Precision | 0.577 | 0.841 | |
| 10–20 mm | AUC | 0.883 | 0.907 |
| Accuracy | 0.832 | 0.837 | |
| Sensitivity | 0.854 | 0.878 | |
| Specificity | 0.824 | 0.792 | |
| Precision | 0.652 | 0.827 | |
| >20 mm | AUC | 0.871 | 0.845 |
| Accuracy | 0.836 | 0.801 | |
| Sensitivity | 0.722 | 0.724 | |
| Specificity | 0.864 | 0.837 | |
| Precision | 0.594 | 0.688 |
AUC, Areas under the ROC curve. All metrics were the average of 10-folds.
Figure 4Comparison of ROC curves and performance metrics of DNN model under different nodule sizes. (A, C) ROC curves and performance metrics for different nodule sizes under HT subset. (B, D) ROC curves and performance metrics for different nodule sizes under normal subset.
Performance of model versus radiologists of clinical experience <5 years, 5–10 years, and >10 years in diagnosing malignant nodules on the test set and its subsets.
| Diagnostic method | AUC | Accuracy | Sensitivity | Specificity | Precision | |
|---|---|---|---|---|---|---|
| Test set | Model | 0.924 | 0.851 | 0.881 | 0.839 | 0.673 |
| Radiologist <5 yr | 0.818 | 0.868 | 0.707 | 0.928 | 0.784 | |
| Radiologist 5–10 yr | 0.843 | 0.864 | 0.798 | 0.888 | 0.726 | |
| Radiologist >10 yr | 0.848 | 0.858 | 0.826 | 0.87 | 0.701 | |
| P-Value* | <0.01 | 0.781 | <0.01 | <0.01 | 0.016 | |
| P-Value** | <0.01 | 1.000 | 0.001 | 0.04 | 0.346 | |
| P-Value*** | <0.01 | 0.733 | 0.777 | 0.3 | 0.752 | |
| HT subset | Model | 0.924 | 0.852 | 0.881 | 0.846 | 0.540 |
| Radiologist <5 yr | 0.824 | 0.897 | 0.723 | 0.924 | 0.588 | |
| Radiologist 5–10 yr | 0.857 | 0.875 | 0.831 | 0.882 | 0.514 | |
| Radiologist >10 yr | 0.863 | 0.863 | 0.862 | 0.863 | 0.487 | |
| P-Value* | <0.01 | 0.401 | 0.001 | 0.003 | 0.226 | |
| P-Value** | <0.01 | 0.928 | 0.060 | 0.312 | 0.811 | |
| P-Value*** | <0.01 | 0.787 | 0.486 | 1.000 | 0.874 | |
| Normal subset | Model | 0.906 | 0.843 | 0.871 | 0.822 | 0.784 |
| Radiologist <5 yr | 0.825 | 0.842 | 0.712 | 0.938 | 0.893 | |
| Radiologist 5–10 yr | 0.846 | 0.853 | 0.797 | 0.894 | 0.847 | |
| Radiologist >10 yr | 0.844 | 0.85 | 0.804 | 0.885 | 0.837 | |
| P-Value* | <0.01 | 0.603 | 0 | 0 | 0.017 | |
| P-Value** | <0.01 | 0.916 | 0.01 | 0.035 | 0.179 | |
| P-Value*** | <0.01 | 0.833 | 0.015 | 0.072 | 0.272 |
P-Value* is that of model versus radiologist with <5 years’ clinical experience; P-Value** is that of model versus radiologist with 5–10 years’ clinical experience; P-Value*** is that of model versus radiologist with >10 years’ clinical experience; AUC, Areas under the ROC curve.
All metrics were the average of 10-folds.
Figure 5Performance of DNN model and three groups of radiologists in diagnosing malignant nodules under test set (A), normal subset (B), and HT subset (C).