| Literature DB >> 31780753 |
Vivian Y Park1, Kyunghwa Han1, Yeong Kyeong Seong2, Moon Ho Park2, Eun-Kyung Kim1, Hee Jung Moon1, Jung Hyun Yoon1, Jin Young Kwak3.
Abstract
Computer-aided diagnosis (CAD) systems hold potential to improve the diagnostic accuracy of thyroid ultrasound (US). We aimed to develop a deep learning-based US CAD system (dCAD) for the diagnosis of thyroid nodules and compare its performance with those of a support vector machine (SVM)-based US CAD system (sCAD) and radiologists. dCAD was developed by using US images of 4919 thyroid nodules from three institutions. Its diagnostic performance was prospectively evaluated between June 2016 and February 2017 in 286 nodules, and was compared with those of sCAD and radiologists, using logistic regression with the generalized estimating equation. Subgroup analyses were performed according to experience level and separately for small thyroid nodules 1-2 cm. There was no difference in overall sensitivity, specificity, positive predictive value (PPV), negative predictive value and accuracy (all p > 0.05) between radiologists and dCAD. Radiologists and dCAD showed higher specificity, PPV, and accuracy than sCAD (all p < 0.001). In small nodules, experienced radiologists showed higher specificity, PPV and accuracy than sCAD (all p < 0.05). In conclusion, dCAD showed overall comparable diagnostic performance with radiologists and assessed thyroid nodules more effectively than sCAD, without loss of sensitivity.Entities:
Mesh:
Year: 2019 PMID: 31780753 PMCID: PMC6882804 DOI: 10.1038/s41598-019-54434-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristics of the validation data set.
| Characteristic | Experienced Radiologist | Inexperienced Radiologist | p-Value |
|---|---|---|---|
| Age (years)a | 47.9 ± 13.3 | 45.9 ± 13.0 | 0.239 |
| No. of men | 38 (22.4) | 14 (14.7) | 0.134 |
| No. of women | 132 (77.6) | 81 (85.3) | |
| Nodule size (mm)c | 16.14 ± 0.80 | 16.49 ± 1.07 | 0.795 |
| No. of benign nodules | 86 (46.7) | 44 (43.1) | 0.569 |
| No. of malignant nodules | 98 (53.3) | 58 (56.9) | |
aThe independent two-sample t-test was used for comparison.
bThe chi-square test was used for comparison.
cFor nodule-based comparison, the generalized estimating equations (GEE) method was used.
Overall diagnostic performance of CAD systems and radiologists for diagnosing thyroid malignancy in the validation data set (n = 286).
| Performance measures | Radiologists | dCAD | sCAD | p-Value | p-Value | ||
|---|---|---|---|---|---|---|---|
| Radiologists vs. dCAD | Radiologists vs. sCAD | dCAD vs. sCAD | |||||
| Sensitivity | 94.2% (89.3, 97.0) | 91.0% (85.5, 94.6) | 90.4% (84.7, 94.1) | 0.137 | |||
| Specificity | 76.9% (68.6, 83.6) | 80.0% (72.4, 85.9) | 58.5% (49.9, 66.6) | <0.001 | 0.431 | 0.001 | <0.001 |
| PPV | 83.1% (76.5, 88.0) | 84.5% (78.3, 89.2) | 72.3% (65.6, 78.2) | <0.001 | 0.552 | 0.001 | <0.001 |
| NPV | 91.7% (84.9, 95.7) | 88.1% (81.0, 92.9) | 83.5% (74.4, 89.8) | 0.084 | |||
| Accuracy | 86.4% (81.7, 90.0) | 86.0% (81.6, 89.5) | 75.9 (70.6, 80.5) | <0.001 | 0.862 | <0.001 | <0.001 |
Note – 95% confidence intervals are shown in parentheses.
Diagnostic performance according to the experience level of the radiologists.
| Performance measures | Radiologists | dCAD | sCAD | p | |||
|---|---|---|---|---|---|---|---|
| Radiologists vs. dCAD | Radiologists vs. sCAD | dCAD vs. | |||||
| Sensitivity | 92.9% (85.8, 96.6) | 90.8% (83.3, 95.2) | 90.8% (83.3, 95.2) | 0.599 | |||
| Specificity | 87.2% (78.3, 92.8) | 84.9% (75.9, 90.9) | 58.1% (47.7, 67.9) | <0.001 | 0.527 | <0.001 | <0.001 |
| PPV | 89.2% (81.5, 93.9) | 87.3% (79.4, 92.4) | 71.2% (62.7, 78.4) | <0.001 | 0.476 | <0.001 | <0.001 |
| NPV | 91.5% (83.1, 95.9) | 89.0% (80.2, 94.2) | 84.8% (73.2, 91.9) | 0.318 | |||
| Accuracy | 90.8% (83.3, 95.2) | 88.0% (82.6, 92.0) | 75.5% (68.8, 81.2) | <0.001 | 0.284 | <0.001 | <0.001 |
| Sensitivity | 96.6% (87.5, 99.1) | 91.4% (81.3, 96.3) | 89.7% (78.9, 95.3) | 0.145 | |||
| Specificity | 56.8% (41.6, 70.9) | 70.5% (55.9, 81.8) | 59.1% (44.0, 72.7) | 0.221 | |||
| PPV | 74.7% (62.9, 83.7) | 80.3% (68.8, 88.3) | 74.3% (62.5, 83.3) | 0.270 | |||
| NPV | 92.6% (74.8, 98.1) | 86.1% (70.7, 94.1) | 81.3% (63.9, 91.4) | 0.241 | |||
| Accuracy | 79.4% (70.1, 86.4) | 82.4% (73.9, 88.5) | 76.5% (67.2, 83.8) | 0.409 | |||
Note – 95% confidence intervals are shown in parentheses.
Figure 1Flowchart of study population.
Figure 2Example of ROI correction using semiautomatic segmentation by the first version of the CAD software (sCAD). (a) Image of a 51-year-old female patient with a 4.6-cm FNA-proven benign mass at the right thyroid. (b) When the user selected two points indicating the top-left and bottom-right points of a ROI box enclosing the thyroid nodule of interest, the initial semiautomatic segmentation results calculated by the CAD software included the adjacent normal thyroid tissue and trachea. (c) The user then manually selected a point at the correct nodule margin where the contour was miscalculated, and the CAD software correctly recalculated the contour of the nodule. The segmentation results shown in (c) were used for analysis. The nodule was assessed as possibly benign by both dCAD and sCAD.
Figure 3A conceptual figure of the development of the thyroid US CAD system using deep learning.