| Literature DB >> 30621704 |
Lei Wang1,2, Shujian Yang1, Shan Yang3, Cheng Zhao4, Guangye Tian3, Yuxiu Gao4, Yongjian Chen3, Yun Lu5,6.
Abstract
BACKGROUND: In this study, images of 2450 benign thyroid nodules and 2557 malignant thyroid nodules were collected and labeled, and an automatic image recognition and diagnosis system was established by deep learning using the YOLOv2 neural network. The performance of the system in the diagnosis of thyroid nodules was evaluated, and the application value of artificial intelligence in clinical practice was investigated.Entities:
Keywords: Artificial intelligence; Computer-aided diagnosis systems; Thyroid nodules; Ultrasound; YOLOv2 neural network
Mesh:
Year: 2019 PMID: 30621704 PMCID: PMC6325802 DOI: 10.1186/s12957-019-1558-z
Source DB: PubMed Journal: World J Surg Oncol ISSN: 1477-7819 Impact factor: 2.754
Fig. 1Improved network structure of the YOLOv2 network
Summary of demographic feature
| Pathological findings | |||
|---|---|---|---|
| Benign | Malignant | ||
| No. of patients | 95 | 181 | |
| Age, years, | 50.0 ± 10.6 | 44.3 ± 11.5 | < 0.001 |
| Sex | 0.337 | ||
| Male | 15 (15.79%) | 38 (20.99%) | |
| Female | 80 (84.21%) | 143 (79.01%) | |
| No. of nodules | 109 | 242 | |
| Size, cm | 3.37 ± 1.81 | 1.17 ± 0.87 | < 0.001 |
| < 0.5 | 8 (7.34%) | 23 (9.50%) | |
| 0.5–1.0 | 13 (11.93%) | 107 (44.21%) | |
| 1.0–4.0 | 47 (43.12%) | 106 (43.80%) | |
| ≥ 4.0 | 41 (37.61%) | 6 (2.48%) | |
Fig. 2a, b We analyzed the performance of benign and malignant nodules target detectors using the Precision-Recall curve. In order to get the PR curve, we set the IOU between the bounding box and ground truth greater than 0.3 to be true positive, otherwise false positive. The AP value is used to evaluate the performance of the detector. The AP value can be obtained by calculating the area under the PR curve. As can be seen from the PR curve, the AP values of malignant and benign nodules were 87.94% and 83.90% respectively, and the average (mAP) of the two types of AP was 85.92%
Fig. 3a, b Images of benign nodules. c, d Images of malignant nodules. The artificial intelligence automatic image recognition and diagnosis system can automatically identify thyroid nodules, label the nodules with rectangular frames, and discriminate the nodules and output
Fig. 4Areas under the ROC curve between the artificial intelligence system and radiologist were compared using the method of DeLong et al. The results indicated that the artificial intelligence system had a higher accuracy than radiologists (p = 0.0434)
Performances of the artificial intelligence system and ultrasound radiologists in diagnosing thyroid cancer
| Size, cm | AI system | Radiologist | ||
|---|---|---|---|---|
| Sensitivity, % | < 1.0 | 88.46 | 93.08 | 0.284 |
| 1.0–4.0 | 94.34 | 95.28 | 1.000 | |
| ≥ 4.0 | 66.67 | 83.33 | 1.000 | |
| All | 90.5 | 93.80 | 0.237 | |
| Specificity, % | < 1.0 | 57.14 | 14.29 | 0.009 |
| 1.0–4.0 | 95.74 | 89.36 | 0.435 | |
| ≥ 4.0 | 100 | 97.56 | 1.000 | |
| All | 89.91 | 77.98 | 0.026 | |
| PPV, % | < 1.0 | 92.74 | 87.05 | 0.156 |
| 1.0–4.0 | 98.04 | 95.28 | 0.446 | |
| ≥ 4.0 | 100 | 83.33 | 1.000 | |
| All | 95.22 | 90.44 | 0.053 | |
| NPV, % | < 1.0 | 44.44 | 25 | 0.305 |
| 1.0–4.0 | 88.24 | 89.36 | 1.000 | |
| ≥ 4.0 | 95.35 | 97.56 | 1.000 | |
| All | 80.99 | 85 | 0.477 | |
| Accuracy, % | < 1.0 | 84.11 | 82.12 | 0.759 |
| 1.0–4.0 | 94.77 | 93.46 | 0.809 | |
| ≥ 4.0 | 95.74 | 95.74 | 1.000 | |
| All | 90.31 | 88.89 | 0.621 |
PPV positive predictive value, NPV negative predictive value
Fig. 5A 54-year-old woman with left thyroid nodule. a US images show a 0.3 cm × 0.2 cm solid hypoechoic nodule with macrocalcification in left thyroid gland. Radiologist diagnosed it as a benign nodule. b AI system diagnosed it as a malignant nodule. Histology confirmed diagnosis of papillary thyroid carcinoma