| Literature DB >> 26089977 |
Sabina Tangaro1, Nicola Amoroso2, Massimo Brescia3, Stefano Cavuoti3, Andrea Chincarini4, Rosangela Errico5, Paolo Inglese1, Giuseppe Longo6, Rosalia Maglietta7, Andrea Tateo8, Giuseppe Riccio3, Roberto Bellotti2.
Abstract
Neurodegenerative diseases are frequently associated with structural changes in the brain. Magnetic resonance imaging (MRI) scans can show these variations and therefore can be used as a supportive feature for a number of neurodegenerative diseases. The hippocampus has been known to be a biomarker for Alzheimer disease and other neurological and psychiatric diseases. However, it requires accurate, robust, and reproducible delineation of hippocampal structures. Fully automatic methods are usually the voxel based approach; for each voxel a number of local features were calculated. In this paper, we compared four different techniques for feature selection from a set of 315 features extracted for each voxel: (i) filter method based on the Kolmogorov-Smirnov test; two wrapper methods, respectively, (ii) sequential forward selection and (iii) sequential backward elimination; and (iv) embedded method based on the Random Forest Classifier on a set of 10 T1-weighted brain MRIs and tested on an independent set of 25 subjects. The resulting segmentations were compared with manual reference labelling. By using only 23 feature for each voxel (sequential backward elimination) we obtained comparable state-of-the-art performances with respect to the standard tool FreeSurfer.Entities:
Mesh:
Year: 2015 PMID: 26089977 PMCID: PMC4450305 DOI: 10.1155/2015/814104
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
The 315 features extracted from the 3D MRI images. Of each group of 66 Haralick features, 13 are the gradients along the 13 diagonals, 5 are the principal moments, and the rest are the three sets of 16 textural features, one set for each plane of the voxels. The gradients for each voxel are measured in all directions at one voxel distance and the relative 3D positions are included as features.
| Number | Description |
|---|---|
| 1 | Position |
| 1 | Grey level |
| 66 | Haralick features for mask 3 × 3 |
| 66 | Haralick features for mask 5 × 5 |
| 66 | Haralick features for mask 7 × 7 |
| 66 | Haralick features for mask 9 × 9 |
| 49 | Haar-like 3D features |
Classification result on all 315 input features using Näive Bayes Classifier in 5-fold cross validation based on confusion matrix.
| 315 input features | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 79% | 62% | 38% |
| Not-hippocampus | 63% | 80% | 20% |
| Efficiency |
| ||
Classification result on the first 197 PCs using Näive Bayes Classifier using in 5-fold cross validation based on confusion matrix.
| 197 PCs | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 60% | 68% | 32% |
| Not-hippocampus | 78% | 72% | 28% |
| Efficiency |
| ||
Classification result on all 315 PCs using Näive Bayes Classifier using in 5-fold cross validation based on confusion matrix.
| 315 PCs | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 86% | 51% | 49% |
| Not-hippocampus | 36% | 78% | 22% |
| Efficiency |
| ||
Classification result on 57 features selected through Kolmogorov-Smirnov test using Näive Bayes Classifier using in 5-fold cross validation based on confusion matrix.
| 57 features | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 84% | 57% | 43% |
| Not-hippocampus | 52% | 81% | 19% |
| Efficiency |
| ||
Figure 1Best dice index of all the possible combinations of the relevant step in (a) sequential forward selection and in (b) sequential backward elimination methods.
Details of the 36 features resulting by the forward selection method using Näive Bayes Classifier.
| 36 features | Haralick features | Haar-like features | Statistical features | |||
|---|---|---|---|---|---|---|
| Orientation | Coordinate | Mask size | Type | Mask size | Entry | |
| Contrast* | 135 |
| 3 | |||
| Gradient* | 5 |
| ||||
| Correlation | 135 |
| 3 | |||
| Position* | Coordinates | |||||
| Normalized gray level* | Value | |||||
| Correlation* | 45 |
| 5 | |||
| Gradient* | 5 |
| ||||
| Correlation* | 90 |
| 9 | |||
| Correlation | 45 |
| 7 | |||
| Skewness* | 7 | |||||
| Homogeneity* | 90 |
| 9 | |||
| Correlation | 0 |
| 5 | |||
| Correlation | 90 |
| 5 | |||
| Correlation* | 45 |
| 3 | |||
| Correlation | 135 |
| 9 | |||
| Correlation | 90 |
| 5 | |||
| Correlation | 135 |
| 5 | |||
| Correlation | 0 |
| 7 | |||
| Correlation | 90 |
| 7 | |||
| Correlation | 90 |
| 9 | |||
| Correlation | 0 |
| 3 | |||
| Correlation | 135 |
| 3 | |||
| Correlation | 0 |
| 9 | |||
| Template* | 1 | |||||
| Skewness* | 5 | |||||
| Correlation | 90 |
| 3 | |||
| Correlation | 45 |
| 5 | |||
| Gradient | 3 |
| ||||
| Template | 2 | |||||
| Correlation* | 45 |
| 9 | |||
| Correlation | 45 |
| 5 | |||
| Correlation | 90 |
| 7 | |||
| Correlation | 45 |
| 5 | |||
| Gradient | 9 |
| ||||
| Homogeneity | 0 |
| 9 | |||
| Correlation | 0 |
| 9 | |||
The asterisk indicates the entries also present in the list of 23 SBE features. For Haralick features, the orientation in degrees, reference coordinate, and the size of the cubic mask used are reported. In case of Haar-like features, the entry value indicates the template type used (see Figure 2). For statistical/positional kind the size of the cubic mask used or the self-explained value is lister, depending on the specific feature type. In particular for gradients, the column named Entry indicates the segment of the reference diagonal as shown in Figure 3. All the features are listed in top-down order of their inclusion during the SFS procedure execution.
Figure 2Haar-like template types 1 (a) and 2 (b) used in the experiments.
Figure 3Representation of a generic cubic mask used for calculating the gradient features. The labeled points are either the vertexes of the cube or the median points of the segments.
Details of the 23 features resulting by the backward elimination method using Näive Bayes Classifier.
| 23 features | Haralick features | Haar-like features | Statistical features | |||
|---|---|---|---|---|---|---|
| Orientation | Coordinate | Mask size | Type | Mask size | Entry | |
| Position* | Coordinates | |||||
| Normalized gray level* | Value | |||||
| Correlation | 0 |
| 7 | |||
| Correlation* | 45 |
| 3 | |||
| Correlation* | 45 |
| 5 | |||
| Correlation | 45 |
| 7 | |||
| Correlation* | 45 |
| 9 | |||
| Correlation | 45 |
| 9 | |||
| Correlation | 45 |
| 5 | |||
| Correlation* | 90 |
| 9 | |||
| Homogeneity | 135 |
| 3 | |||
| Gradient* | 5 |
| ||||
| Gradient | 7 |
| ||||
| Contrast* | 135 |
| 3 | |||
| Gradient | 9 |
| ||||
| Homogeneity* | 90 |
| 9 | |||
| Gradient | 3 |
| ||||
| Skewness* | 7 | |||||
| Gradient* | 5 |
| ||||
| Gradient | 3 |
| ||||
| Template* | 1 | |||||
| Skewness* | 5 | |||||
| Gradient | 5 |
| ||||
The asterisk indicates the entries also present in the list of 36 SFS features. For Haralick features, the orientation in degrees, reference coordinate, and the size of the cubic mask used are reported. In case of Haar-like features, the entry value indicates the template type used (see Figure 2). For statistical/positional kind, the size of the cubic mask used and/or the self-explained value is listed, depending on the specific feature type. In particular for gradients, the column named Entry indicates the segment of the reference diagonal as shown in Figure 3.
Classification result on 36 features selected through forward selection method using Näive Bayes Classifier in 5-fold cross validation based on confusion matrix.
| 36 features | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 82% | 70% | 30% |
| Not-hippocampus | 73% | 84% | 16% |
| Efficiency |
| ||
Classification result on 23 features selected through backward elimination method using Näive Bayes Classifier in 5-fold cross validation based on confusion matrix.
| 23 features | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 83% | 70% | 30% |
| Not-hippocampus | 73% | 85% | 15% |
| Efficiency |
| ||
Classification result on 222 features selected through Random Forest method using Näive Bayes Classifier in 5-fold cross validation based on confusion matrix.
| 222 features | Completeness of a class | Purity of a class | Contamination of a class |
|---|---|---|---|
| Hippocampus | 80% | 62% | 38% |
| Not-hippocampus | 62% | 80% | 20% |
| Efficiency |
| ||
Figure 4Distribution of 2000 random Dice values compared with true Dice (shown with the dashed red line) concerning 36 features obtained by the sequential forward selection.
For each implemented method, size of selected group, mean Dice index (evaluated using Näive Bayes Classifier), and related σ are shown.
| Method | Size of selected group | Dice index |
|---|---|---|
| Original dataset | 315 | 0.69 ± 0.04 |
| PCA selection | 197 | 0.62 ± 0.07 |
| K-S selection | 57 | 0.67 ± 0.04 |
| Forward selection | 36 | 0.75 ± 0.02 |
| Backward elimination | 23 | 0.75 ± 0.02 |
| Random Forest | 222 | 0.69 ± 0.04 |
Figure 5Dice index comparison for the following methods: original dataset (315 for each voxel); PCA (197 selected features); K-S test (57 selected features); SFS (36 selected features); SBE (23 selected features); Random Forest (222 selected features). Boxes have lines at the lower quartile, median, and upper quartile values, with whiskers extending to 1.5 times the interquartile range. Outliers are indicated by a plus sign.