Literature DB >> 34149343

Characterizing Brain Tumor Regions Using Texture Analysis in Magnetic Resonance Imaging.

Yun Yu1,2, Xi Wu1, Jiu Chen2, Gong Cheng3, Xin Zhang1,4, Cheng Wan1, Jie Hu1, Shumei Miao1,4, Yuechuchu Yin1,4, Zhongmin Wang1,4, Tao Shan1,4, Shenqi Jing1,4, Wenming Wang1,4, Jianjun Guo1,4, Xinhua Hu5, Yun Liu1,4.   

Abstract

PURPOSE: To extract texture features from magnetic resonance imaging (MRI) scans of patients with brain tumors and use them to train a classification model for supporting an early diagnosis.
METHODS: Two groups of regions (control and tumor) were selected from MRI scans of 40 patients with meningioma or glioma. These regions were analyzed to obtain texture features. Statistical analysis was conducted using SPSS (version 20.0), including the Shapiro-Wilk test and Wilcoxon signed-rank test, which were used to test significant differences in each feature between the tumor and healthy regions. T-distributed stochastic neighbor embedding (t-SNE) was used to visualize the data distribution so as to avoid tumor selection bias. The Gini impurity index in random forests (RFs) was used to select the top five out of all features. Based on the five features, three classification models were built respectively with three machine learning classifiers: RF, support vector machine (SVM), and back propagation (BP) neural network.
RESULTS: Sixteen of the 25 features were significantly different between the tumor and healthy areas. Through the Gini impurity index in RFs, standard deviation, first-order moment, variance, third-order absolute moment, and third-order central moment were selected to build the classification model. The classification model trained using the SVM classifier achieved the best performance, with sensitivity, specificity, and area under the curve of 94.04%, 92.3%, and 0.932, respectively.
CONCLUSION: Texture analysis with an SVM classifier can help differentiate between brain tumor and healthy areas with high speed and accuracy, which would facilitate its clinical application.
Copyright © 2021 Yu, Wu, Chen, Cheng, Zhang, Wan, Hu, Miao, Yin, Wang, Shan, Jing, Wang, Guo, Hu and Liu.

Entities:  

Keywords:  MRI; SVM; brain tumor; t-SNE; texture analysis

Year:  2021        PMID: 34149343      PMCID: PMC8209330          DOI: 10.3389/fnins.2021.634926

Source DB:  PubMed          Journal:  Front Neurosci        ISSN: 1662-453X            Impact factor:   4.677


Introduction

Brain cancer remains a diagnostic challenge for clinicians and radiologists because malignant brain tumor cells can invade into the neighboring cells in the brain and spinal cord with fuzzy borders and have a high progression rate (Wild, 2014; Vargo, 2017; Tandel et al., 2019). Treatment of advanced brain tumors is difficult; therefore, early diagnosis is of great importance in clinical settings. The approaches currently employed for the diagnosis of brain tumors include both invasive and noninvasive methods. Although the invasive diagnostic method—biopsy—is viewed as the golden standard for the diagnosis of brain tumors, noninvasive diagnostic methods including magnetic resonance imaging (MRI) are safer and more widely used (Zhao and Jia, 2016). Determination of the accurate location and segmentation of the brain tumor on MRI scans are essential for treatment planning (Mahaley et al., 1989). Several studies have found MRI features capable of differentiating between the tumor and healthy regions (May et al., 1991; Drape et al., 1992; Mullen and Huang, 2017). However, in most cases, the diagnostic accuracy only depends on the proficiency of the medical practitioner reading the MRI scan (Hayward et al., 2008). Many complex patterns, also called image textures, remain imperceptible to the naked eye. Texture analysis is a practical approach for image pattern recognition by extracting objective information through the analysis of the spatial distribution of intensity variations on images (Haralick and Shanmugam, 1973; Haralick, 1979). Furthermore, several studies have confirmed the efficiency of texture analysis (Bayanati et al., 2015; Hodgdon et al., 2015; Skogen et al., 2016). To increase the diagnostic precision and efficiency, many computer-assisted methods have been developed and introduced, including machine learning (ML) and deep learning (DL) (Zhao and Jia, 2016; Boissoneault et al., 2017; Salvador et al., 2017). Texture analysis combined with ML methods has been widely used to evaluate medical images and yielded promising results (Fetit et al., 2015; Li et al., 2016; Bisdas et al., 2018). However, to the best of our knowledge, there are a few reports on the use of t-distributed stochastic neighbor embedding (t-SNE), which is a new dimensionality reduction and visualization technique to foresee data for preventing problems such as incorrect marking of images and that can help increase the accuracy of the classification. We hypothesized that some texture features acquired from MRI scans would serve as classification features and markedly improve classification efficiency. To test our hypothesis, the Gini impurity index in the random forests (RFs) was applied to select features, which were then used to develop classification models. Finally, the performance of the features and the models in confirming our hypothesis was assessed.

Subjects and Methods

Subjects

The data used were collected from the Affiliated Nanjing Brain Hospital of Nanjing Medical University. Patients in whom meningioma or glioma was histopathologically confirmed between January 2014 and December 2014 were selected. In all, 40 patients (average age: 51.10 years) comprising 22 men (average age: 52.36 years) and 18 women (average age: 47.33 years) were included. The exclusion criteria were as follows: (1) presence of other organic mental disorders and nervous system diseases and (2) a history of major physical illnesses. All of the patients met the above criteria. The study was approved by the medical ethics committee of Nanjing Medical University. All patients provided signed written informed consent.

MRI Acquisition

All images were acquired using a 3T Siemens MRI system. The patients were instructed to relax, keep their eyes closed, stay awake, and remain still. Patient compliance was confirmed after scanning was completed. The images were recorded axially for 6 min by using an echo-planar imaging sequence with the following parameters: TR = 1900 ms, TE = 2.49 ms, slice thickness = 1 mm, flip angle = 90°, and matrix size = 256 × 256. All patients underwent MRI without reporting discomfort during or after the procedure.

Classification Based on 25 Texture Features

Preparation Before Classification

For the experimental preparation, the raw sample image format was changed from DICOM to JPG. In the texture analysis, the tumor region in the coronal MRI image was selected as the experimental group, and the symmetrical healthy region on the other side of the brain was selected as the control group. There were 40 tumor regions in the experimental group and 40 healthy regions in the control group. In each group, 25 texture features (belonging to three categories) were calculated, as shown in Table 1.
TABLE 1

The selected texture features in the three categories.

Category (-based parameters)Texture features
Histogram(1) First-order moment; (2) second-order moment; (3) third-order moment; (4) fourth-order moment; (5) the central moment of the four features; (6) the absolute moment of the four features
Run-length matrix(1) Long run emphasis; (2) short run emphasis; (3) gray level nonuniformity; (4) total run-length percentage
Co-occurrence matrix(1) Energy; (2) inertia moment; (3) correlation; (4) entropy; (5) mean; (6) variance; (7) standard deviation; (8) homogeneity; (9) dissimilarity
The selected texture features in the three categories. The 25 texture features were recorded as mean ± SD. Statistical analysis was performed using SPSS (version 20.0), including the Shapiro–Wilk test and Wilcoxon signed-rank test, which was used for testing significant differences in each feature between tumors and healthy areas. Meanwhile, an RF model was employed to predict whether each sample was a tumor or a healthy area and for importance rankings of 25 texture features according to the Gini impurity index in the RF (Menze et al., 2009; Liu et al., 2018). All texture features were selected as predictors to compare the experimental results from the Wilcoxon signed-rank test and RF prediction. In addition, t-SNE, a new dimension reduction and visualization technique for high-dimensionality data, was performed in the exploratory analysis (Li et al., 2017). It was applied to all 40 pairs of samples with 25 features to delete the samples that would apparently have a negative effect on the latter classification.

Classification

The samples were randomly divided into training (70%) and test sets (30%). This was iterated five times to provide five unique training and testing groups. The training set was used to generate classification models with three different classifiers: RF, BP, and SVM. The RF is fast, is flexible, and has become a standard tool in biomedical informatics. Each classifier in the ensemble is a decision tree classifier and is generated using random selection of attributes at each node to determine the split. During classification, each tree votes, and the most popular class is returned. The BP iteratively processes a set of training tuples and compares the network’s prediction with the actual known target value. For each training tuple, the weights are modified to minimize the mean squared error between the network’s prediction and the actual target value. Modifications are made in the backwards direction. The process will reach the terminating condition when the error is very small. The SVM is a classification method for both linear and nonlinear data. It uses nonlinear mapping to transform the original training data into a higher dimension. With the new dimension, it searches for the linear optimal separating hyperplane. With an appropriate nonlinear mapping to a sufficiently high dimension, data from two classes can always be separated by a hyperplane. SVM finds this hyperplane using support vectors and margins. Four indexes were used to evaluate each model, including the area under the curve (AUC), error rate, sensitivity, and specificity. Moreover, the receiver operating characteristic (ROC) curve was constructed for each model.

Results

Texture Feature Analysis

Using the Wilcoxon signed-rank test, 25 texture features could reveal regions with higher and lower texture values when comparing the experimental (tumor region) and control groups (health region), as shown in Tables 2–4. We obtained the importance rankings of the 25 texture features according to the Gini impurity index in the RF with a training set (80%). The top five features were standard deviation, first-order moment, variance, third-order absolute moment, and third-order central moment, as shown in Table 5.
TABLE 2

Wilcoxon signed-rank test results (histogram).

No.Texture featuresTumor regionHealth regionP-value
1First-order moment10.661 ± 3.4143.664 ± 5.1110.000*
2First-order central moment2.327 ± 0.2801.071 ± 0.9340.000*
3First-order absolute moment2.327 ± 1.2801. 071 ± 0.9340.000*
4Second-order moment159.912 ± 51.20554.958 ± 76.6570.000*
5Second-order central moment19,442.302 ± 13,715.9266,490.148 ± 13,691.2110.000*
6Second-order absolute moment19,442.302 ± 13,715.9266,490.148 ± 13,691.2110.000*
7Third-order moment2,398.676 ± 768.069824.368 ± 1,149.8550.000*
8Third-order central moment−14,586,918,376.546 ± 12,193,828,145.790−5,165,830,452.742 ± 11,313,615,571.9150.000*
9Third-order absolute moment14,586,918,376.546 ± 12,193,828,145.7905,165,830,452.768 ± 11,313,615,571.9030.000*
10Fourth-order moment35,980.138 ± 11,521.03112,365.514 ± 17,247.8260.000*
11Fourth-order central moment2,201,613,996,782,910,720.000 ± 2,158,254,619,227,325,440.000827,213,133,922,581,760.000 ± 1,882,686,712,587,172,610.0000.000*
12Fourth-order absolute moment2,201,613,996,782,910,720.000 ± 2,158,254,619,227,325,440.000827,213,133,922,581,760.000 ± 1,882,686,712,587,172,610.0000.000*
TABLE 4

Wilcoxon signed-rank test results (co-occurrence matrix).

No.Texture featuresTumor regionHealthy regionP-value
1Energy0.637 ± 0.1980.812 ± 0.1610.000*
2Entropy0.663 ± 0.3280.404 ± 0.2880.002*
3Inertia moment15.809 ± 10.32213.5043 ± 12.04880.313
4Correlation0.105 ± 0.2760.093 ± 0.1500.026*
5Homogeneity0.939 ± 0.0400.948 ± 0.0470.313
6Dissimilarity0.988 ± 0.6450.844 ± 0.7530.313
7Mean12.444 ± 3.6454.876 ± 5.4650.000*
8Variance152.277 ± 47.37153.902 ± 71.0280.000*
9Standard deviation15.004 ± 2.0848.575 ± 4.6770.000*
TABLE 5

Gini impurity index in the RF.

No.Texture featuresMean decrease GiniRank
1First-order moment1.78974537
2First-order central moment0.849584116
3First-order absolute moment0.787365419
4Second-order moment1.80264586
5Second-order central moment1.514700010
6Second-order absolute moment1.375771413
7Third-order moment1.64303359
8Third-order central moment1.80379725
9Third-order absolute moment1.83163614
10Fourth-order moment2.08927202
11Fourth-order central moment1.060980014
12Fourth-order absolute moment1.412492212
13Energy0.476390925
14Entropy0.516579823
15Inertia moment0.761897220
16Correlation0.735506721
17Homogeneity0.838510617
18Dissimilarity0.734335722
19Mean1.75269298
20Variance1.94829263
21Standard deviation3.11264581
22Long-run emphasis0.866438415
23Short-run emphasis1.429772111
24Total run-length percentage0.831474518
25Gray-level nonuniformity0.491833724
Wilcoxon signed-rank test results (histogram). Wilcoxon signed-rank test results (run-length matrix). Wilcoxon signed-rank test results (co-occurrence matrix). Gini impurity index in the RF. The t-SNE test results are shown in Figure 1. In Figure 1A, the original features were those found in the Wilcoxon signed-rank test (19 features in total), and in Figure 1B, the original features were the top five features determined in the RF’s importance rankings. However, the data distributions after t-SNE were similar. All samples were evidently divided into two clusters, except 12 samples (1, 9, 10, 11, 17, 19, 23, 30, 35, 44, 73, and 79), which were seemingly distributed mistakenly. In addition, t-test was used to examine 40 samples to determine whether their features were relatively different between the tumor and healthy regions. We found that the mean P-value was 0.2390645. The P-values of seven samples—1, 9, 10, 19, 23, 30, and 35—were greater than the mean P-value, and these samples were also mistakenly distributed in the t-SNE picture and were deleted.
FIGURE 1

Feature distributions after t-SNE: (A) features based on the Wilcoxon signed-rank test; (B) features based on the RF’s importance rankings (red figures represent the healthy regions and blue figures represent the tumor regions).

Feature distributions after t-SNE: (A) features based on the Wilcoxon signed-rank test; (B) features based on the RF’s importance rankings (red figures represent the healthy regions and blue figures represent the tumor regions).

Classifier Evaluation

On the basis of the results obtained above, we selected the five features (standard deviation, first-order moment, variance, third-order absolute moment, and third-order central moment) identified in the RF to set up classifiers, which helped save calculation time and resources. Three classification models (RF, SVM, and BP) were applied, and five features were used to train each classifier. A detailed summary of the model’s performance is presented in Table 6.
TABLE 6

Three classifiers evaluation.

ClassifiersAUCError rate (%)Sensitivity (%)Specificity (%)
RF0.85614.182.888.3
SVM0.932  6.994.0492.3
BP0.88411.487.289.6
Three classifiers evaluation. All three models showed satisfactory AUCs of 0.85–0.95. The RF and the BP shared a similar performance based on the AUC, error rate, sensitivity, and specificity. The model trained by the SVM classifier demonstrated the best performance among the three models, with markedly better AUC, error rate, sensitivity, and specificity, indicating that this model could correctly classify the tumor and healthy regions. Receiver operating characteristic (ROC) curves were constructed for the three models to compare their performance directly, as shown in Figure 2.
FIGURE 2

Receiver operating characteristic of the three models: (A) RF model, (B) SVM model, and (C) BP model.

Receiver operating characteristic of the three models: (A) RF model, (B) SVM model, and (C) BP model.

Discussion

Some studies have reported the same methods to select features, and the validity of this approach has been proven. Wang et al. (2018) evaluated the importance of spectra lines based on RFs and then used a support vector machine (SVM) classifier to classify the laser-induced plasma spectra (LIBS) of bacteria species. The primary objective of this study was to characterize tumor regions using MRI-based texture analysis. We used texture analysis to compute 25 texture features from MRI images. Using the Wilcoxon signed-rank test, we confirmed that 19 texture features of the total 25 features were different between the healthy and tumor regions. Using the t-SNE technique, the dataset was divided into two clusters, indicating that there is a high possibility to set up a classification model with these 19 features. However, training a model with high-dimensionality data requires a lot of time and space. To facilitate faster and more accurate classification, the importance rankings of the features in the RF were calculated, and the top five features were found to show the same classification effectiveness as the 19 features selected before. The images for the t-SNE test results showed some seemingly noisy dots. Considering the possibility that all mistakenly distributed samples may be deleted incorrectly, the t-test was applied to generally examine whether the healthy and tumor regions showed significant differences in the 25 texture features for each sample. To determine the modified number of samples that would be deleted, the mean P-value was set as the deletion standard, and seven samples were excluded on the basis of this standard. Since the samples were marked manually and these samples were likely to be marked mistakenly, this was a limitation that has been mentioned in many previous studies. On the basis of the five features, three class-action models were built by training three ML classifiers, namely, RF, SVM, and BP. The SVM classifier was superior to the RF and BP classifiers, as shown in Table 6, since it provided better performance in terms of AUC, error rate, sensitivity, and specificity. These results were shown to be convincing through fivefold confirmation, which was consistent with the findings of previous studies (Zhang et al., 2017). The model in this article was superior to the previous models since it depended on only five features while showing the same AUC. Since the software that is needed to perform texture analysis and build classification models is readily available, clinicians can easily perform such analyses in clinical settings. This study had some limitations. First, the dataset was modified, since the model was trained with only 80 samples. Its robustness needs further examination. Second, some degree of selection bias may exist. Different categories of brain tumors have different texture features. Some unique features were excluded, which may have influenced the results of our analysis. Third, a manual approach was adopted to segment tumors in this study. Although manual segmentation generally works better than automatic methods, segmentation errors could still exist. Some noise dots may have been mistakenly marked manually, negatively influencing the formation of our model.

Conclusion

In conclusion, we hypothesized that a few of the textures acquired from the MRI images could serve the role of classification features, thereby significantly improving the classification efficiency. The Gini impurity index in the RF was applied to select features. On the basis of the five features, three class-action models were built by training three ML classifiers, including RF, SVM, and BP. The classifier model in this article was superior to the previous models, since it depended on only five features. On the basis of our initial findings, tumor regions characterized on the basis of MRI-based texture analysis may have clinical usefulness in differentiating brain tumors.

Data Availability Statement

The datasets presented in this article are not readily available because the data cannot be used out of the hospital. Requests to access the datasets should be directed to med.info@njmu.edu.cn.

Ethics Statement

The studies involving human participants were reviewed and approved by the Medical Ethics Committee of Nanjing Medical University. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
TABLE 3

Wilcoxon signed-rank test results (run-length matrix).

No.Texture featuresTumor regionHealthy regionP-value
1Long run emphasis628.833 ± 512.533782.519 ± 639.5390.188
2Short run emphasis0.228 ± 0.0760.234 ± 0.1000.582
3Total run-length percentage0.084 ± 0.03850.078 ± 0.0460.476
4Gray level nonuniformity207.211 ± 129.511154.085 ± 86.2130.011*
  20 in total

Review 1.  Biomarkers for Musculoskeletal Pain Conditions: Use of Brain Imaging and Machine Learning.

Authors:  Jeff Boissoneault; Landrew Sevel; Janelle Letzen; Michael Robinson; Roland Staud
Journal:  Curr Rheumatol Rep       Date:  2017-01       Impact factor: 4.592

2.  [Utility of texture analysis of magnetic resonance imaging in differential diagnosis of common pediatric cerebellar tumors in children].

Authors:  Y Li; M M Li; Y Zhang; J L Cheng; Z G Shang; C X Bu
Journal:  Zhonghua Yi Xue Za Zhi       Date:  2016-06-21

3.  Prediction of ESRD in IgA Nephropathy Patients from an Asian Cohort: A Random Forest Model.

Authors:  Yexin Liu; Yan Zhang; Di Liu; Xia Tan; Xiaofang Tang; Fan Zhang; Ming Xia; Guochun Chen; Liyu He; Letian Zhou; Xuejing Zhu; Hong Liu
Journal:  Kidney Blood Press Res       Date:  2018-12-07       Impact factor: 2.687

4.  National survey of patterns of care for brain-tumor patients.

Authors:  M S Mahaley; C Mettlin; N Natarajan; E R Laws; B B Peace
Journal:  J Neurosurg       Date:  1989-12       Impact factor: 5.115

Review 5.  Brain Tumors and Metastases.

Authors:  Mary M Vargo
Journal:  Phys Med Rehabil Clin N Am       Date:  2017-02       Impact factor: 1.784

6.  A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data.

Authors:  Bjoern H Menze; B Michael Kelm; Ralf Masuch; Uwe Himmelreich; Peter Bachert; Wolfgang Petrich; Fred A Hamprecht
Journal:  BMC Bioinformatics       Date:  2009-07-10       Impact factor: 3.169

7.  Quantitative CT texture and shape analysis: can it differentiate benign and malignant mediastinal lymph nodes in patients with primary lung cancer?

Authors:  Hamid Bayanati; Rebecca E Thornhill; Carolina A Souza; Vineeta Sethi-Virmani; Ashish Gupta; Donna Maziak; Kayvan Amjadi; Carole Dennie
Journal:  Eur Radiol       Date:  2014-09-13       Impact factor: 5.315

8.  3D Texture Analysis of Heterogeneous MRI Data for Diagnostic Classification of Childhood Brain Tumours.

Authors:  Ahmed E Fetit; Jan Novak; Daniel Rodriguez; Dorothee P Auer; Chris A Clark; Richard G Grundy; Tim Jaspan; Andrew C Peet; Theodoros N Arvanitis
Journal:  Stud Health Technol Inform       Date:  2015

9.  Multiscale CNNs for Brain Tumor Segmentation and Diagnosis.

Authors:  Liya Zhao; Kebin Jia
Journal:  Comput Math Methods Med       Date:  2016-03-16       Impact factor: 2.238

Review 10.  A Review on a Deep Learning Perspective in Brain Cancer Classification.

Authors:  Gopal S Tandel; Mainak Biswas; Omprakash G Kakde; Ashish Tiwari; Harman S Suri; Monica Turk; John R Laird; Christopher K Asare; Annabel A Ankrah; N N Khanna; B K Madhusudhan; Luca Saba; Jasjit S Suri
Journal:  Cancers (Basel)       Date:  2019-01-18       Impact factor: 6.639

View more
  2 in total

1.  MRI whole-lesion texture analysis on ADC maps for the prognostic assessment of ischemic stroke.

Authors:  Yuan Zhang; Yuzhong Zhuang; Yaqiong Ge; Pu-Yeh Wu; Jing Zhao; Hao Wang; Bin Song
Journal:  BMC Med Imaging       Date:  2022-07-01       Impact factor: 2.795

2.  Differentiating Glioblastoma Multiforme from Brain Metastases Using Multidimensional Radiomics Features Derived from MRI and Multiple Machine Learning Models.

Authors:  Salar Bijari; Amin Jahanbakhshi; Parham Hajishafiezahramini; Parviz Abdolmaleki
Journal:  Biomed Res Int       Date:  2022-09-28       Impact factor: 3.246

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.