Literature DB >> 25960851

Application of texture analysis method for classification of benign and malignant thyroid nodules in ultrasound images.

Ali Abbasian Ardakani1, Akbar Gharbali2, Afshin Mohammadi3.   

Abstract

BACKGROUND: The aim of this study was to evaluate computer aided diagnosis (CAD) system with texture analysis (TA) to improve radiologists' accuracy in identification of thyroid nodules as malignant or benign.
METHODS: A total of 70 cases (26 benign and 44 malignant) were analyzed in this study. We extracted up to 270 statistical texture features as a descriptor for each selected region of interests (ROIs) in three normalization schemes (default, 3s and 1%-99%). Then features by the lowest probability of classification error and average correlation coefficients (POE+ACC), and Fisher coefficient (Fisher) eliminated to 10 best and most effective features. These features were analyzed under standard and nonstandard states. For TA of the thyroid nodules, Principle Component Analysis (PCA), Linear Discriminant Analysis (LDA) and Non-Linear Discriminant Analysis (NDA) were applied. First Nearest-Neighbour (1-NN) classifier was performed for the features resulting from PCA and LDA. NDA features were classified by artificial neural network (A-NN). Receiver operating characteristic (ROC) curve analysis was used for examining the performance of TA methods.
RESULTS: The best results were driven in 1-99% normalization with features extracted by POE+ACC algorithm and analyzed by NDA with the area under the ROC curve ( Az) of 0.9722 which correspond to sensitivity of 94.45%, specificity of 100%, and accuracy of 97.14%.
CONCLUSION: Our results indicate that TA is a reliable method, can provide useful information help radiologist in detection and classification of benign and malignant thyroid nodules.

Entities:  

Keywords:  Artificial Intelligence; Diagnosis, Computer-Assisted; thyroid nodule; ultrasonography

Year:  2015        PMID: 25960851      PMCID: PMC4411473     

Source DB:  PubMed          Journal:  Iran J Cancer Prev        ISSN: 2008-2398


Introduction

Thyroid nodules have an incidence of 33-68% in the general population and can be malignant or benign; however, 5-15% of these nodules are malignant [1, 2]. Although the incident of palpable thyroid nodules in the adult population ranges from 4 to 8%, at autopsy 50% of the population has thyroid nodules [3]. Many techniques like physical examination, fine needle aspiration cytological examination and imaging are used to detect thyroid nodules. A fine needle aspiration biopsy (FNAB) is a non-surgical and accepted diagnostic method for confirming the nature of the nodules. However, it leads to 10% to 20% thyroid nodules with no diagnosis and 15-30% of results are indeterminate. Non-diagnostic FNAB indicates that thyroid nodules have a high probability of malignancy and aspiration should be repeated [2, 4]. Radiographs, computed tomography (CT), magnetic resonance imaging (MRI), positron emission tomography (PET), ultrasound and scintigraphy are diagnostic imaging modalities available for the evaluation of thyroid nodules. Ultrasound is the chosen imaging technique for assessment of thyroid nodules, owing to being in real time, radiation-free, non-invasive, widely available, low cost and ability to detect non-palpable nodules [5]. Nowadays, high frequency transducers provide high definition images and spatial resolution. Grey-scale ultrasound contains useful features to quantitatively characterize thyroid nodules such as texture features, whereas Doppler provides an intra-nodal vascular pattern [6, 7]. Although there is no precise and mathematical definition of texture, simply conceived by human. Image texture can be described by various patterns: coarse, fine, smooth, spatial variations in pixel intensity of objects within an image. Contrary to the computer, human detection of textures may be qualitative and limited by vision. Likewise such patterns within the image may be different but will be perceived by humans as the same texture. Computerized texture analysis (TA) is a mathematical technique that increases and provides rich quantification and information of spatial grey-level intensities in pixels within the image [8-10]. Initial diagnosis by a radiologist is qualitative and may contain errors. To increase and improve the confidence of a radiologist in differentiating between benign and malignant thyroid nodules, computer aided diagnosis (CAD) systems have been developed. Hong et al. [11] used real-time ultrasound elastography to classify benign from malignant thyroid nodules with a sensitivity of 88%, specificity of 90%, PPV of 81%, NPV of 93% and an area under the ROC curve (Az) of 0.94. Ding et al. reported that sensitivity, specificity and accuracy were 94.6%, 92.8% and 93.6% respectively when the energy features of co-occurrence matrix were used. Tsantis et al. [12] Tsantis et al. used speckle-free feature set based on nodules’ boundary shape and ‘wavelet local maxima located’ for differentiating between benign and malignant thyroid nodules with a sensitivity of 93%, specificity of 98%and an area under the ROC curve (Az ) of 0.96. Gopinath et al. [13] utilized texture features derived from the Gabor filter bank at various wavelengths and angles for the classification of benign and malignant thyroid nodules. They used SVM classifier and achieved the sensitivity of 95%, specificity of 100% and accuracy of 96.7%. Gopinath et al. [14] reported that accuracy, sensitivity, and specificity were 90%, 100%, and 90% respectively when two-level wavelet decomposition based on the texture characteristics of the thyroid cells was used to differentiate between benign and malignant thyroid nodules. In this study, we evaluated a non-invasive method based on TA to improve radiologists’ diagnosis in the classification of thyroid nodules as benign or malignant in ultrasound images. Although our study is not the first attempt to use a TA method, as far as we know no other study has employed TA via all groups and classes of texture features.

Materials and Methods

Ultrasound images of 70 patients (53 female, and 17 male) comprising 26 benign (colloid and adenomatous nodules) and 44 malignant thyroid nodules (41 papillary thyroid carcinoma and 3 hurtle cell carcinoma) which had been approved by FNAB. Ultrasound guided needle aspiration were used based on clinical indication. Thyroid nodules images saved in ultrasonography system and the images were compared with biopsy and the reference for statistical analysis. The databases were selected in the time interval from June 2013 to August 2014 in the Radiology department of Imam Khomeini hospital, Urmia, Iran. Ultrasound images were acquired by Accuvix V20 sonography system (Medison, Seoul, Korea) equipped by L5-13IS linear array transducer working in the range 5-13MHz. Ultrasound images were input to MaZda software (MaZda 4.6, The Technical University of Lodz, Institute of Electronics) for TA [15]. We have selected one region of interest (ROI) for each ultrasound image; hence 70 non-overlapping ROIs (26 benign and 44 malignant) were selected in Mazda for TA. We evaluated and analyzed texture features coming from six main categories for differentiating between benign and malignant thyroid nodules: Histogram (statistical class), Absolute gradient (statistical class), Run-length matrix (statistical class), Co-occurrence matrix (statistical class), Auto-Regressive model (model class) and Wavelets (transform class) [9]. Up to 270 texture features are extracted for each ROI in three normalization schemes. N1: default. Default images had the same appearance. An intensity range will be from 1 to 2k, where k is the number of the bits per pixel. N2: 3s, limiting image intensities in the range [μ-3σ, μ+3σ] where μ is the mean grey level value and σ the standard deviation of the grey levels’ intensity inside the ROI. N3: the ROIs grey level ranges between the darkness level at which the image’s accumulated histogram is equal to 1% of its total to the brightness level, where accumulated histogram is equal to 99% of its total. Automated Feature Selection All 270 parameters are not suitable and effective for differentiating between benign and malignant thyroid nodules. By using two reduction algorithms, Fisher and lowest probability of classification error and average correlation coefficients (POE+ACC), these parameters were reduced to the best ten texture features to show the best discrimination between benign and malignant thyroid nodules [16, 17]. Fisher algorithm selected up to ten features, with the highest being a ratio of between-class variance (D) to within-class variance (V). Where μk and VK are mean and variance of class k, F is Fisher coefficient and PK is probability of class k. Maximization of Fisher coefficient is desirable. A POE+ACC algorithm produced set up ten features with minimization probability of classification error (POE) and average correlation coefficients (ACC) between chosen features. The first feature, f1, is selected to minimize POE for all classes: Where POE(fi) is the classification error probability for feature . This probability is defined as follows: Figure 1 shows the sample distribution of feature fi for two classes. The samples marked can be assigned to both class 1 or 2. In this case, these samples cannot be properly classified. The next feature (second feature) is then selected by minimizing the sum of all features except, excluding f1:
Figure 1

A sample distribution of feature fi for two classes. The samples marked cannot be properly classified.

The is the absolute value of the correlation coefficient between the previously calculated feature f1 and the new feature fi. The n-th feature is selected by minimizing the following sum for all remaining features except for the already chosen features: Where the averaged sum is extended for all correlation coefficients between previously selected features and feature fi. This sum is defined as an average correlation coefficient (ACC). In brief, the POE+ACC algorithm introduces ten features with high discriminatory potential and a least correlation with features that are already selected. Texture Analysis Methods LDA, PCA and NDA investigate these features and transform data to lower-dimensional spaces, and increase the discriminative power under two standardization states (standard and nonstandard) [18, 19]. PCA and LDA features were classified by 1-NN classifier and A-NN classifier was performed for the features resulting from NDA [20]. ROC curve analysis was employed to compare the discrimination performance of the applied TA methods using the area under the ROC curve (AZ ) [21]. In addition, six objectives that indices sensitivity (SEN), specificity (SPC), overall accuracy (ACC), positive predictive value (PPV) and negative predictive value (NPV)were also applied to assess the performance of the proposed methods. These indices are defined as follows: Where NTP is the number of malignant cases and NTN is the number of benign cases that have been diagnosed correctly. NFP and NFN are the number of incorrectly diagnosed benign and malignant cases, respectively. Figure 2 shows the steps of CAD processing.
Figure 2

Overview of general texture analysis process in the ultrasound thyroid image.

Results

In this study a total of 70 patients including 26 benign and 44 malignant thyroid nodules were approved by FNAB to ensure the classification accuracy of the proposed method. As mentioned previously, in each normalization scheme the same TA method and the same classifier were applied to the set of features resulting from both POE+ACC and Fisher algorithm. In default normalization, we found out the features extracted by POE+ACC algorithm and analyzed by NDA to be higher than the other two TA methods, PCA and LDA. In this case we reached a sensitivity of 94.28%, specificity of 96.42%, accuracy of 95.6%, PPV of 94.28% and NPV of 96.42% (Table 1). In terms of the normalization role, in 3 sigma normalization the NDA has shown the best discrimination power with features resulting just from the Fisher algorithm, where sensitivity, specificity, accuracy, PPV and NPV were 86.36%, 100%, 91.42%, 100% and 81.25%, respectively (Table 2). In 1-99% normalization and features resulting from POE+ACC algorithm analyzed by NDA have the highest discrimination power to distinguish between benign and malignant thyroid nodules, with which sensitivity, specificity, accuracy, PPV and NPV were 94.45%, 100%, 97.14%, 100% and 92.86% respectively (Table 3).
Table 1

Summary of performance for different features and Method of feature reduction in default normalization.

Feature selection methodMethod of feature reductionSEN (%)SPC (%)ACC (%)PPV (%)NPV (%)AZvalue
FisherN. S. PCA72.2238.4651.4261.935.710.5534
S. PCA79.5473.0777.1483.3367.85 0.763
N. S. LDA81.8265.3875.7180680.736
S. LDA79.5469.2375.7181.3966.67.7438
NDA94.2896.4295.694.2896.420.9535
POE+ACCN. S. PCA85.7185.7185.7178.9490.560.8571
S. PCA85.7187.586.8181.0890.740.866
N. S. LDA91.4291.0791.286.4894.440.9124
S. LDA91.4291.0791.286.4894.440.9124
NDA94.2898.2196.797.0596.490.9625

SEN = sensitivity; SPC = specificity; ACC = accuracy; PPV = positive predictive value; NPV = negative predictive value; AZ= area under ROC curve.

Table 2

Summary of performance for different features and Method of feature reduction in 3sigma normalization.

Feature selection methodMethod of feature reductionSEN (%)SPC (%)ACC (%)PPV (%)NPV (%)AZvalue
FisherN. S. PCA7561.547076.7459.260.6827
S. PCA77.2769.2374.288164.280.7325
N. S. LDA84.0965.3877.1480.4370.630.7473
S. LDA84.0965.3877.1480.4370.630.7473
NDA86.3610091.4210081.250.9318
POE+ACCN. S. PCA70.4553.8564.2872.1051.850.6215
S. PCA81.825872.8677.5565.220.6991
N. S. LDA77.2857.77075.55600.6749
S. LDA77.2857.77075.55600.6749
NDA93.1892.392.8595.3488.890.9274

SEN = sensitivity; SPC = specificity; ACC = accuracy; PPV = positive predictive value; NPV = negative predictive value; AZ= area under ROC curve.

Table 3

Summary of performance for different features and Method of feature reduction in 1-99% normalization.

Feature selection methodMethod of feature reductionSEN (%)SPC (%)ACC (%)PPV (%)NPV (%)value
FisherN. S. PCA72.7353.8565.7172.7353.850.6329
S. PCA72.7353.8565.7172.7353.850.6329
N. S. LDA84.0980.7782.8588.01750.8243
S. LDA84.0980.7782.8588.01750.8243
NDA86.3610091.4310081.250.9118
POE+ACCN. S. PCA72.7353.8568.5772.7353.850.6279
S. PCA86.365072.8674.568.420.6818
N. S. LDA81.8269.2377.1481.8269.230.7552
S. LDA81.8269.2377.1481.8269.230.7552
NDA94.4510097.1410092.860.9722

SEN = sensitivity; SPC = specificity; ACC = accuracy; PPV = positive predictive value; NPV = negative predictive value; AZ= area under ROC curve.

Figure 3, 4 and 5 respectively represent ROC curves plotted on the same graph for each normalization and features resulting from reduction method to compare the discriminating power for classification of benign and malignant thyroid nodules. In general, the NDA TA method has an advantage over PCA and LDA in each state with respect to a greater AZ value.
Figure 3

The diagram of the ROC curve for each texture analysis method in default. (A) Fisher features. (B) POE+ACC features.

Figure 4

The diagram of the ROC curve for each texture analysis method in 3sigma normalization. (A) Fisher features. (B) POE+ACC features.

Figure 5

The diagram of the ROC curve for each texture analysis method in 1%-99% normalization. (A) Fisher features. (B) POE+ACC features.

Discussion

In this study, three normalizations, two reduction algorithms, two standardization states and three TA methods provide a total of 36 states for each ROI case study. In our study, LDA has good performance with the features resulting from POE+ACC in default normalization in which value is 0.9124 (Figure 3). However, PCA has undesirable results in comparison with LDA and NDA. From Figure 3, 4 and 5 one can see that normalization has reflected a positive impact on the performance of the classifier. Hence the best performance in this study was obtained in 1-99% with NDA. The best results were driven in 1-99% normalization with features extracted by POE+ACC algorithm and analyzed by NDA with AZ value of 0.9722, which corresponds to a sensitivity of 94.45%, specificity of 100%, and accuracy of 97.14%. The ROC curve analysis indicates that of all features in the reduction methods and normalizations, NDA is privileged over PCA and LDA in terms of differentiating between benign and malignant nodules (Figure 3, 4 and 5). The proposed method demonstrates a more reliable performance in comparison with other studies [11-14, 22] already conducted on the subject of ultrasonic differentiation between benign and malignant thyroid nodules, with an accuracy ranging from 90% to 96.7%, sensitivity ranging from 88% to 94.6% and specificity ranging from 90% to 100%. In this study, we assessed all six groups of texture features and achieved higher accuracy in respect to the studies that employed one texture subset or combined some texture subsets to differentiate between benign and malignant thyroid nodules. Our results also indicate that texture analysis possess significantly more discriminative ability than the other methods, morphology and elastography [11, 12]. The ten texture parameters leading to the best performance in our study are as follows: Co-occurrence matrix-based features: S (0, 2) Sum entropy (S (0,2) SumEntrp) and S (1,1) Sum of squares (S (1,1) SumOfSqs) where S (i, j) shows the direction of matrix construction and inter pixel distance i along the rows and j along the columns of matrix; Energy of wavelet coefficient in ‘low-high’ energy components in first level wavelet decomposition (WavEnLH_s-1),‘low-low’ energy components in second level wavelet decomposition (WavEnLL_s-2),‘low-high’ energy components in third level wavelet decomposition (WavEnLH_s-3), ‘low-high’ energy components in fifth level wavelet decomposition (WavEnLH_s-5) and ‘high-low’ energy components in sixth level wavelet decomposition (WavEnHL_s-6); Absolute gradient matrix-based features: gradient Kurtosis (GrKurtosis); Auto-Regressive model: Teta1, which is the vector of AR model parameters; Histogram-based features: Kurtosis. Our research indicates that a combination of texture features (Co-occurrence matrix, wavelet, Absolute gradient matrix, Auto-Regressive model and Histogram-based texture features) has a higher level of accuracy than the previous studies that employed one texture group [12-14, 22]. In all conditions, feature standardization affects PCA and leads to an improvement in performance. It does not have any impact on LDA except in default and Fisher features. In order to decrease the training time in ANN classifier, all input feature parameters were standardized previously in NDA. Endocrinologist referred the cases based on clinical indication. Since this study is based on texture analysis and our gold standard is pathology, other nodule characteristic feature does not have any additional information and are not considered. This method will not only be used as an alternative for biopsy, but also helps radiologist to select high malignancy risk of thyroid nodules for FNAB in patients with multi-nodular goiter which is the most common endocrine disease. One of the most important clinical challenges is that which nodules in patient with multi-nodular goiter should be FNAB and our methods can aid to radiologist to select the target nodule between multiple nodules in patients with multi-nodular goiter. The main advantage of this method is that there is no operator dependency, because analysis is performed by the computer and also it does not have any additional time or costs. In this study some limitations should be clearly noted. First, the data group was small, so further investigation with a larger data set is needed. Second, our ultrasound image classifications were performed in comparison with FNAB, although FNAB is highly sensitive for the diagnosis of neoplastic thyroid nodules, for definitive results surgical pathology may be needed. Third, some of our data from the study were excluded due to indeterminate FNAB. Fourth, feature combination tools were not available in MaZda. For example, averaging run-length matrix features of four different orientations were hard to perform. Fifth, only nodules larger than 15 mm were used in this study, because the sizes of the ROIs for texture analysis are 12 mm × 12 mm. Sixth, in this preliminary study all the patients suspicious to thyroiditis or any other inflammatory process were excluded to avoid external perturbation of the blood flow perfusing the thyroid gland and affect texture.

Conclusion

MaZda software was originally developed in 1998 at the Institute of Electronics, Technical University of Lodz for the purpose of automatic TA of magnetic resonance imaging [15]. We also used it for ultrasound images. Generally, our results indicate that MaZda software can provide useful information contributing to the discrimination of thyroid nodules by ultrasound images, and also has the potential to help radiologists in the detection and classification of benign and malignant thyroid nodules.
  15 in total

1.  Nondiagnostic thyroid fine-needle aspiration cytology: management dilemmas.

Authors:  L S Chow; H Gharib; J R Goellner; J A van Heerden
Journal:  Thyroid       Date:  2001-12       Impact factor: 6.568

2.  Shear wave elastography: a new ultrasound imaging mode for the differential diagnosis of benign and malignant thyroid nodules.

Authors:  F Sebag; J Vaillant-Lombard; J Berbis; V Griset; J F Henry; P Petit; C Oliver
Journal:  J Clin Endocrinol Metab       Date:  2010-09-29       Impact factor: 5.958

3.  Management of thyroid nodules detected at US: Society of Radiologists in Ultrasound consensus conference statement.

Authors:  Mary C Frates; Carol B Benson; J William Charboneau; Edmund S Cibas; Orlo H Clark; Beverly G Coleman; John J Cronan; Peter M Doubilet; Douglas B Evans; John R Goellner; Ian D Hay; Barbara S Hertzberg; Charles M Intenzo; R Brooke Jeffrey; Jill E Langer; P Reed Larsen; Susan J Mandel; William D Middleton; Carl C Reading; Steven I Sherman; Franklin N Tessler
Journal:  Radiology       Date:  2005-12       Impact factor: 11.105

Review 4.  Texture analysis of medical images.

Authors:  G Castellano; L Bonilha; L M Li; F Cendes
Journal:  Clin Radiol       Date:  2004-12       Impact factor: 2.350

5.  Morphological and wavelet features towards sonographic thyroid nodules evaluation.

Authors:  Stavros Tsantis; Nikos Dimitropoulos; Dionisis Cavouras; George Nikiforidis
Journal:  Comput Med Imaging Graph       Date:  2008-12-25       Impact factor: 4.790

6.  Revised American Thyroid Association management guidelines for patients with thyroid nodules and differentiated thyroid cancer.

Authors:  David S Cooper; Gerard M Doherty; Bryan R Haugen; Bryan R Hauger; Richard T Kloos; Stephanie L Lee; Susan J Mandel; Ernest L Mazzaferri; Bryan McIver; Furio Pacini; Martin Schlumberger; Steven I Sherman; David L Steward; R Michael Tuttle
Journal:  Thyroid       Date:  2009-11       Impact factor: 6.568

7.  Receiver operating characteristic (ROC) analysis: basic principles and applications in radiology.

Authors:  A R van Erkel; P M Pattynama
Journal:  Eur J Radiol       Date:  1998-05       Impact factor: 3.528

8.  Support Vector Machine based diagnostic system for thyroid cancer using statistical texture features.

Authors:  B Gopinath; N Shanthi
Journal:  Asian Pac J Cancer Prev       Date:  2013

9.  Ultrasonographic characteristics of thyroid nodules: prediction of malignancy.

Authors:  E Koike; S Noguchi; H Yamashita; T Murakami; A Ohshima; H Kawamoto; H Yamashita
Journal:  Arch Surg       Date:  2001-03

10.  Risk of malignancy in nonpalpable thyroid nodules: predictive value of ultrasound and color-Doppler features.

Authors:  Enrico Papini; Rinaldo Guglielmi; Antonio Bianchini; Anna Crescenzi; Silvia Taccogna; Francesco Nardi; Claudio Panunzi; Roberta Rinaldi; Vincenzo Toscano; Claudio M Pacella
Journal:  J Clin Endocrinol Metab       Date:  2002-05       Impact factor: 5.958

View more
  4 in total

1.  CAD system based on B-mode and color Doppler sonographic features may predict if a thyroid nodule is hot or cold.

Authors:  Ali Abbasian Ardakani; Ahmad Bitarafan-Rajabi; Afshin Mohammadi; Sepideh Hekmat; Aylin Tahmasebi; Mohammad Bagher Shiran; Ali Mohammadzadeh
Journal:  Eur Radiol       Date:  2019-01-09       Impact factor: 5.315

Review 2.  Radiomic Detection of Malignancy within Thyroid Nodules Using Ultrasonography-A Systematic Review and Meta-Analysis.

Authors:  Eoin F Cleere; Matthew G Davey; Shane O'Neill; Mel Corbett; John P O'Donnell; Sean Hacking; Ivan J Keogh; Aoife J Lowery; Michael J Kerin
Journal:  Diagnostics (Basel)       Date:  2022-03-24

3.  The Diagnostic Efficiency of Ultrasound Computer-Aided Diagnosis in Differentiating Thyroid Nodules: A Systematic Review and Narrative Synthesis.

Authors:  Nonhlanhla Chambara; Michael Ying
Journal:  Cancers (Basel)       Date:  2019-11-08       Impact factor: 6.639

4.  Automated Breast Volume Scanner (ABVS)-Based Radiomic Nomogram: A Potential Tool for Reducing Unnecessary Biopsies of BI-RADS 4 Lesions.

Authors:  Shi-Jie Wang; Hua-Qing Liu; Tao Yang; Ming-Quan Huang; Bo-Wen Zheng; Tao Wu; Chen Qiu; Lan-Qing Han; Jie Ren
Journal:  Diagnostics (Basel)       Date:  2022-01-12
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.