| Literature DB >> 30774015 |
Antonin Prochazka1, Sumeet Gulati2, Stepan Holinka3, Daniel Smutek1,3.
Abstract
In recent years, several computer-aided diagnosis systems emerged for the diagnosis of thyroid gland disorders using ultrasound imaging. These systems based on machine learning algorithms may offer a second opinion to radiologists by evaluating a malignancy risk of thyroid tissue, thus increasing the overall diagnostic accuracy of ultrasound imaging. Although current computer-aided diagnosis systems exhibit promising results, their use in clinical practice is limited. One of the main limitations is that the majority of them use direction-dependent features. Our intention has been to design a computer-aided diagnosis system, which will use only direction-independent features, that is, it will not be dependent on the orientation and the inclination angle of the ultrasound probe when acquiring the image. We have, therefore, applied histogram analysis and segmentation-based fractal texture analysis algorithm, which calculates direction-independent features only. In our study, 40 thyroid nodules (20 malignant and 20 benign) were used to extract several features, such as histogram parameters, fractal dimension, and mean brightness value in different grayscale bands (obtained by 2-threshold binary decomposition). The features were then used in support vector machine and random forests classifiers to differentiate nodules into malignant and benign classes. Using leave-one-out cross-validation method, the overall accuracy was 92.42% for random forests and 94.64% for support vector machine. Results show that both methods are useful in practice; however, support vector machine provides better results for this application. Proposed computer-aided diagnosis system can provide support to radiologists in their current diagnosis of thyroid nodules, whereby it can optimize the overall accuracy of ultrasound imaging.Entities:
Keywords: classification; computer-aided diagnosis; texture analysis; thyroid; ultrasound
Mesh:
Year: 2019 PMID: 30774015 PMCID: PMC6379796 DOI: 10.1177/1533033819830748
Source DB: PubMed Journal: Technol Cancer Res Treat ISSN: 1533-0338
Comparison of Most Recently Published Studies With Our Study.
| Study | Subjects Included | Features | Classifier | Accuracy (%) |
|---|---|---|---|---|
|
[ | 69 benign, 56 malignant | Statistical and textural features | SVM | 93.6 |
|
[ | 10 benign, 10 malignant | Texture and DWT features form CEUS images | kNN | 98.9 |
|
[ | 10 benign, 10 malignant | Texture and DWT features | AdaBoost | 100 |
|
[ | 10 benign, 10 malignant | Fractal dimension, Fourier spectrum descriptor, local binary patterns, Laws texture energy form 3-D HRUS and CEUS images | Gaussian mixture model, Fuzzy, SVM | 98.1-100 |
|
[ | 29 benign, 30 malignant | Texture features | SVM | 98.3 |
|
[ | 211 benign, 31 malignant | Texture features with safe-level SMOTE | C4.5 | 94.3 |
| Our study | 20 benign, 20 malignant | Histogram features, SFTA | SVM, RF | 94.64 |
Abbreviations: CEUS, contrast-enhanced ultrasonography; DWT, discrete wavelet transform; HRUS, high-resolution ultrasound; kNN, k-nearest neighbor; RF, random forests; SFTA, segmentation-based fractal texture analysis; SMOTE, Synthetic Minority Oversampling Technique; SVM, support vector machine.
Figure 1.Sample images of benign and malignant nodules. Dimensions of a nodule are marked with crosses. (A) Malignant nodule from Phillips device; (B) malignant nodule from GE device; (C) benign nodule from Philips device; (D) benign nodule from GE device.
Figure 2.Output of the SFTA algorithm with the number of Otsu’s thresholds n = 4 (labeled T1-T4). Top left: An example of a benign nodule. Also in the top row: Output of 2 threshold band decomposition. Bottom row: Result of simple thresholding using T1 to T4. In total, using the SFTA algorithm, we get 2n − 1 binary images. SFTA indicates segmentation-based fractal texture analysis.
Figure 3.Dependency of total accuracy (evaluated by LOOCV) on number of thresholds of 2-threshold binary decomposition for both RF and SVM classifiers. LOOCV indicates leave-one-out cross-validation; RF, random forests; SVM, support vector machine.
Figure 4.Graph showing importance of features in random forests classifier when number of thresholds of SFTA is equal to 6. Importance score for each variable is scaled to have a maximum value of 100. Principle of the importance calculation is as follows: For each tree, the prediction accuracy of the out-of-bag portion of the data is recorded. The same is then done again after permuting each predictor variable. Importance score for variable is the difference between the 2 accuracies, averaged over all trees and normalized by the standard error.[36] SFTA indicates segmentation-based fractal texture analysis.