| Literature DB >> 27747593 |
Abstract
Feature selection plays a key role in multi-voxel pattern analysis because functional magnetic resonance imaging data are typically noisy, sparse, and high-dimensional. Although the conventional evaluation criterion is the classification accuracy, selecting a stable feature set that is not sensitive to the variance in dataset may provide more scientific insights. In this study, we aim to investigate the stability of feature selection methods and test the stability-based feature selection scheme on two benchmark datasets. Top-k feature selection with a ranking score of mutual information and correlation, recursive feature elimination integrated with support vector machine, and L1 and L2-norm regularizations were adapted to a bootstrapped stability selection framework, and the selected algorithms were compared based on both accuracy and stability scores. The results indicate that regularization-based methods are generally more stable in StarPlus dataset, but in Haxby dataset they failed to perform as well as others.Entities:
Keywords: Feature selection; Functional MRI; Multi-voxel pattern analysis; Stability
Year: 2016 PMID: 27747593 PMCID: PMC4999569 DOI: 10.1007/s40708-016-0048-0
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Fig. 1A demonstration of MVPA of fMRI data in cognitive experiments. Visual stimuli are presented to subjects during experiment tests and fMRI data are collected over time. Informative voxels are identified as a pattern used for classification among visual stimuli
Fig. 2The feature (beta values) matrix is extracted by applying a general linear model to fMRI BOLD signals. Subject 1 in Haxby dataset is used as an illustrative example
The cross-validation settings of datasets
| Dataset | Training | Test | Validation | Replication |
|---|---|---|---|---|
| StarPlus | 60 | 10 | 10 | 10 |
| Haxby | 6 | 5 | 1 | 12 |
Note that StarPlus dataset is measured in samples, while the Haxby dataset is measured in trials
Summary of results—subject 04799 in StarPlus dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 50.00 | 17.00 | 56 | 0.32 |
| SVM-Corr | 46.00 | 16.47 | 51 | 0.32 |
| SVM-RFE | 55.00 | 17.16 | 70 | 0.40 |
| LASSO ( | 41.00 | 12.87 | 6 | 0.16 |
| LASSO ( | 48.00 | 11.35 | 4 | 0.23 |
| LASSO ( | 48.00 | 11.35 | 7 | 0.36 |
| LASSO ( | 49.00 | 3.16 | 6 | 0.06 |
| ENet ( | 43.00 | 13.37 | 10 | 0.20 |
| ENet ( | 45.00 | 15.81 | 6 | 0.21 |
| ENet ( | 48.00 | 13.17 | 8 | 0.41 |
| ENet ( | 47.00 | 10.59 | 5 | 0.10 |
Summary of results—subject 6 in Haxby dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 87.50 | 10.66 | 176 | 0.73 |
| SVM-Corr | 79.17 | 17.94 | 179 | 0.73 |
| SVM-RFE | 86.46 | 12.45 | 279 | 0.89 |
| LASSO ( | 69.79 | 17.24 | 42 | 0.60 |
| LASSO ( | 62.50 | 17.68 | 43 | 0.59 |
| LASSO ( | 63.54 | 15.50 | 39 | 0.52 |
| LASSO ( | 52.08 | 14.92 | 47 | 0.47 |
| ENet ( | 57.29 | 18.04 | 160 | 0.71 |
| ENet ( | 65.63 | 17.78 | 151 | 0.70 |
| ENet ( | 67.71 | 15.50 | 161 | 0.67 |
| ENet ( | 72.92 | 12.87 | 152 | 0.61 |
Fig. 3An illustration of the distribution of voxels selected by each method in the visual cortex area for (a) subject 1 in Haxby dataset and (b) subject 04820 in StarPlus dataset
Summary of results—subject 04820 in StarPlus dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 90.00 | 10.54 | 164 | 0.40 |
| SVM-Corr | 83.00 | 15.67 | 1845 | 0.98 |
| SVM-RFE | 91.00 | 11.01 | 127 | 0.34 |
| LASSO ( | 85.00 | 8.50 | 8 | 0.78 |
| LASSO ( | 85.00 | 8.50 | 8 | 0.71 |
| LASSO ( | 84.00 | 8.43 | 10 | 0.49 |
| LASSO ( | 73.00 | 14.94 | 6 | 0.24 |
| ENet ( | 85.00 | 10.80 | 14 | 0.64 |
| ENet ( | 85.00 | 10.80 | 12 | 0.71 |
| ENet ( | 85.00 | 10.80 | 15 | 0.92 |
| ENet ( | 86.00 | 8.43 | 14 | 0.89 |
Summary of results—subject 04847 in StarPlus dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 80.00 | 4.71 | 64 | 0.59 |
| SVM-Corr | 82.00 | 10.33 | 1660 | 0.97 |
| SVM-RFE | 83.00 | 9.49 | 50 | 0.39 |
| LASSO ( | 77.00 | 8.23 | 4 | 0.60 |
| LASSO ( | 76.00 | 8.43 | 7 | 0.69 |
| LASSO ( | 79.00 | 9.94 | 5 | 0.82 |
| LASSO ( | 79.00 | 9.94 | 5 | 0.90 |
| ENet ( | 77.00 | 11.60 | 7 | 0.46 |
| ENet ( | 78.00 | 10.33 | 13 | 0.53 |
| ENet ( | 77.00 | 8.23 | 9 | 0.56 |
| ENet ( | 80.00 | 9.43 | 13 | 0.69 |
Summary of results—subject 05675 in StarPlus dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 87.00 | 6.75 | 70 | 0.47 |
| SVM-Corr | 78.00 | 11.35 | 2059 | 0.88 |
| SVM-RFE | 90.00 | 8.16 | 50 | 0.39 |
| LASSO ( | 89.00 | 7.38 | 11 | 0.60 |
| LASSO ( | 87.00 | 10.59 | 11 | 0.54 |
| LASSO ( | 86.00 | 9.66 | 11 | 0.54 |
| LASSO ( | 82.00 | 9.19 | 18 | 0.61 |
| ENet ( | 90.00 | 6.67 | 25 | 0.75 |
| ENet ( | 88.00 | 10.33 | 19 | 0.73 |
| ENet ( | 85.00 | 8.50 | 25 | 0.61 |
| ENet ( | 82.00 | 10.33 | 23 | 0.60 |
Summary of results—subject 05680 in StarPlus dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 74.00 | 8.43 | 85 | 0.52 |
| SVM-Corr | 73.00 | 14.18 | 2211 | 0.99 |
| SVM-RFE | 75.00 | 15.81 | 298 | 0.19 |
| LASSO ( | 80.00 | 8.16 | 4 | 1.00 |
| LASSO ( | 80.00 | 8.16 | 4 | 1.00 |
| LASSO ( | 80.00 | 8.16 | 4 | 1.00 |
| LASSO ( | 80.00 | 8.16 | 4 | 1.00 |
| ENet ( | 79.00 | 7.38 | 6 | 0.82 |
| ENet ( | 78.00 | 7.89 | 9 | 0.84 |
| ENet ( | 80.00 | 8.16 | 8 | 0.76 |
| ENet ( | 80.00 | 8.16 | 8 | 0.72 |
Summary of results—subject 05710 in StarPlus dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 83.00 | 9.49 | 52 | 0.54 |
| SVM-Corr | 70.00 | 6.67 | 1861 | 0.99 |
| SVM-RFE | 76.00 | 10.75 | 93 | 0.27 |
| LASSO ( | 88.00 | 13.17 | 10 | 0.76 |
| LASSO ( | 86.00 | 12.65 | 8 | 0.64 |
| LASSO ( | 84.00 | 12.65 | 9 | 0.68 |
| LASSO ( | 79.00 | 11.01 | 8 | 0.71 |
| ENet ( | 91.00 | 8.76 | 12 | 0.78 |
| ENet ( | 90.00 | 9.43 | 13 | 0.87 |
| ENet ( | 86.00 | 12.65 | 11 | 0.78 |
| ENet ( | 86.00 | 12.65 | 12 | 0.66 |
Summary of results—subject 1 in Haxby dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 90.63 | 12.07 | 122 | 0.70 |
| SVM-Corr | 84.38 | 16.96 | 338 | 0.58 |
| SVM-RFE | 84.38 | 16.96 | 219 | 0.49 |
| LASSO ( | 79.17 | 14.43 | 88 | 0.68 |
| LASSO ( | 77.08 | 13.93 | 75 | 0.67 |
| LASSO ( | 76.04 | 11.25 | 87 | 0.71 |
| LASSO ( | 71.88 | 16.10 | 95 | 0.61 |
| ENet ( | 38.54 | 26.36 | 255 | 0.71 |
| ENet ( | 59.38 | 20.03 | 235 | 0.70 |
| ENet ( | 62.50 | 21.98 | 255 | 0.67 |
| ENet ( | 80.21 | 11.25 | 232 | 0.67 |
Summary of results—subject 2 in Haxby dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 70.83 | 13.41 | 123 | 0.57 |
| SVM-Corr | 71.88 | 12.07 | 357 | 0.86 |
| SVM-RFE | 78.13 | 14.23 | 195 | 0.67 |
| LASSO ( | 55.21 | 6.44 | 94 | 0.57 |
| LASSO ( | 48.96 | 17.24 | 90 | 0.52 |
| LASSO ( | 48.96 | 17.24 | 97 | 0.46 |
| LASSO ( | 43.75 | 12.50 | 104 | 0.46 |
| ENet ( | 32.29 | 16.39 | 269 | 0.64 |
| ENet ( | 50.00 | 18.46 | 264 | 0.62 |
| ENet ( | 53.13 | 22.06 | 214 | 0.58 |
| ENet ( | 47.92 | 12.87 | 252 | 0.55 |
Summary of results—subject 3 in Haxby dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 82.29 | 18.04 | 195 | 0.78 |
| SVM-Corr | 80.21 | 22.27 | 260 | 0.87 |
| SVM-RFE | 85.42 | 13.93 | 157 | 0.66 |
| LASSO ( | 68.75 | 14.60 | 75 | 0.60 |
| LASSO ( | 71.88 | 19.31 | 80 | 0.57 |
| LASSO ( | 64.58 | 18.34 | 79 | 0.58 |
| LASSO ( | 60.42 | 19.09 | 70 | 0.57 |
| ENet ( | 40.63 | 17.78 | 242 | 0.71 |
| ENet ( | 61.46 | 18.04 | 276 | 0.67 |
| ENet ( | 62.50 | 15.08 | 263 | 0.64 |
| ENet ( | 62.50 | 10.66 | 231 | 0.62 |
Summary of results—subject 4 in Haxby dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 68.75 | 12.50 | 58 | 0.58 |
| SVM-Corr | 71.88 | 17.78 | 141 | 0.77 |
| SVM-RFE | 71.88 | 14.23 | 188 | 0.56 |
| LASSO ( | 56.25 | 22.30 | 30 | 0.52 |
| LASSO ( | 45.83 | 21.54 | 31 | 0.51 |
| LASSO ( | 42.71 | 22.90 | 28 | 0.36 |
| LASSO ( | 27.08 | 12.87 | 34 | 0.29 |
| ENet ( | 51.04 | 17.24 | 136 | 0.56 |
| ENet ( | 60.42 | 17.54 | 132 | 0.55 |
| ENet ( | 62.50 | 19.94 | 149 | 0.54 |
| ENet ( | 55.21 | 17.24 | 124 | 0.50 |
Summary of results—subject 5 in Haxby dataset
| Method | Mean accuracy (%) | STD (%) | Average number of selected features | Stability |
|---|---|---|---|---|
| SVM-MI | 64.77 | 30.53 | 142 | 0.62 |
| SVM-Corr | 68.18 | 29.24 | 255 | 0.77 |
| SVM-RFE | 65.91 | 29.63 | 237 | 0.72 |
| LASSO ( | 51.14 | 24.01 | 24 | 0.58 |
| LASSO ( | 46.59 | 21.72 | 23 | 0.57 |
| LASSO ( | 39.77 | 22.23 | 21 | 0.43 |
| LASSO ( | 15.91 | 9.83 | 22 | 0.12 |
| ENet ( | 45.45 | 21.12 | 66 | 0.54 |
| ENet ( | 39.77 | 27.28 | 67 | 0.52 |
| ENet ( | 46.59 | 23.78 | 65 | 0.53 |
| ENet ( | 48.86 | 27.07 | 76 | 0.59 |