| Literature DB >> 30153294 |
Maqbool Ali1,2, Syed Imran Ali1, Dohyeong Kim1, Taeho Hur1, Jaehun Bang1, Sungyoung Lee1, Byeong Ho Kang2, Maqbool Hussain3.
Abstract
Feature selection is considered to be one of the most critical methods for choosing appropriate features from a larger set of items. This task requires two basic steps: ranking and filtering. Of these, the former necessitates the ranking of all features, while the latter involves filtering out all irrelevant features based on some threshold value. In this regard, several feature selection methods with well-documented capabilities and limitations have already been proposed. Similarly, feature ranking is also nontrivial, as it requires the designation of an optimal cutoff value so as to properly select important features from a list of candidate features. However, the availability of a comprehensive feature ranking and a filtering approach, which alleviates the existing limitations and provides an efficient mechanism for achieving optimal results, is a major problem. Keeping in view these facts, we present an efficient and comprehensive univariate ensemble-based feature selection (uEFS) methodology to select informative features from an input dataset. For the uEFS methodology, we first propose a unified features scoring (UFS) algorithm to generate a final ranked list of features following a comprehensive evaluation of a feature set. For defining cutoff points to remove irrelevant features, we subsequently present a threshold value selection (TVS) algorithm to select a subset of features that are deemed important for the classifier construction. The uEFS methodology is evaluated using standard benchmark datasets. The extensive experimental results show that our proposed uEFS methodology provides competitive accuracy and achieved (1) on average around a 7% increase in f-measure, and (2) on average around a 5% increase in predictive accuracy as compared with state-of-the-art methods.Entities:
Mesh:
Year: 2018 PMID: 30153294 PMCID: PMC6112679 DOI: 10.1371/journal.pone.0202705
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1uEFS methodology.
Fig 2UFS algorithm [19].
Fig 3TVS algorithm.
Predictive accuracy (in %age) of classifiers using benchmark datasets.
| %age of Features Retained | Naive Bayes | J48 | kNN | JRip | SVM | Naive Bayes | J48 | kNN | JRip | SVM | Naive Bayes | J48 | kNN | JRip | SVM |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 72.22 | 57.78 | 74.44 | 65.19 | 81.67 | 76.3 | 73.83 | 70.18 | 76.04 | 77.34 | 97.3 | 99.49 | 99.88 | 99.3 | 97.17 | |
| 72.41 | 57.78 | 74.81 | 67.41 | 82.04 | 76.56 | 73.96 | 65.76 | 73.57 | 77.47 | 96.99 | 99.35 | 99.83 | 99.23 | 97.08 | |
| 72.41 | 57.78 | 75 | 66.85 | 82.04 | 76.56 | 73.96 | 65.76 | 73.57 | 77.47 | 96.78 | 99.06 | 99.64 | 99.01 | 96.93 | |
| 72.41 | 57.78 | 75.93 | 66.3 | 82.59 | 76.17 | 73.57 | 65.76 | 73.96 | 76.69 | 96.62 | 99.06 | 99.55 | 99.03 | 96.93 | |
| 72.59 | 57.78 | 76.11 | 66.3 | 82.96 | 76.17 | 73.57 | 65.76 | 73.96 | 76.69 | 96.61 | 98.91 | 99.44 | 98.89 | 96.95 | |
| 71.67 | 57.78 | 76.48 | 66.85 | 82.22 | 76.17 | 73.57 | 65.76 | 73.96 | 76.69 | 96.61 | 98.91 | 99.44 | 98.89 | 96.95 | |
| 71.3 | 57.78 | 76.11 | 68.15 | 80.37 | 74.87 | 72.4 | 67.45 | 71.88 | 74.48 | 96.89 | 98.64 | 99.04 | 98.45 | 96.94 | |
| 71.85 | 56.67 | 77.04 | 67.78 | 79.81 | 74.87 | 72.4 | 67.45 | 71.88 | 74.48 | 96.36 | 98.3 | 98.7 | 98 | 95.94 | |
| 72.04 | 56.67 | 77.04 | 70.19 | 80 | 74.87 | 72.53 | 66.93 | 72.4 | 74.48 | 96.38 | 97.88 | 97.99 | 97.89 | 95.94 | |
| 69.81 | 56.67 | 77.04 | 64.26 | 80.19 | 74.87 | 72.53 | 66.93 | 72.4 | 74.48 | 94.75 | 97.59 | 97.16 | 97.37 | 95.94 | |
| 70 | 56.67 | 76.3 | 66.85 | 80.74 | 74.87 | 72.53 | 66.93 | 72.4 | 74.48 | 94.75 | 97.59 | 97.16 | 97.37 | 95.94 | |
| 70 | 56.67 | 77.41 | 65.19 | 79.81 | 75.13 | 72.53 | 67.84 | 72.79 | 75.39 | 95.94 | 96.89 | 96.1 | 96.68 | 95.94 | |
| 70.19 | 56.67 | 78.89 | 65.93 | 80 | 75.13 | 72.53 | 67.84 | 72.79 | 75.39 | 95.94 | 95.93 | 94.96 | 96 | 95.94 | |
| 69.44 | 56.67 | 81.48 | 61.85 | 76.48 | 74.61 | 72.53 | 67.84 | 72.4 | 75.26 | 95.94 | 95.94 | 95.87 | 95.95 | 95.94 | |
| 69.63 | 56.67 | 80.93 | 56.3 | 76.48 | 74.61 | 72.53 | 67.84 | 72.4 | 75.26 | 95.94 | 95.94 | 95.92 | 95.94 | 95.94 | |
| 70.19 | 56.67 | 80 | 57.41 | 78.7 | 74.61 | 72.53 | 67.84 | 72.4 | 75.26 | 95.94 | 95.94 | 95.92 | 95.94 | 95.94 | |
| 70.19 | 56.67 | 80 | 61.11 | 78.7 | 67.19 | 67.84 | 67.32 | 67.19 | 65.1 | 95.94 | 95.94 | 95.99 | 95.94 | 95.94 | |
| 70 | 56.67 | 80.56 | 60 | 77.96 | 67.19 | 67.84 | 67.32 | 67.19 | 65.1 | 95.94 | 95.94 | 95.94 | 95.94 | 95.94 | |
| 74.63 | 57.78 | 74.26 | 60.37 | 77.96 | 65.1 | 65.1 | 65.1 | 65.1 | 65.1 | 95.94 | 95.94 | 95.94 | 95.94 | 95.94 | |
| 61.48 | 57.78 | 54.81 | 57.78 | 76.85 | 65.1 | 65.1 | 65.1 | 65.1 | 65.1 | 95.94 | 95.94 | 95.94 | 95.94 | 95.94 | |
| 67.79 | 71.15 | 86.54 | 73.08 | 75.96 | 80 | 75.08 | 73.62 | 79.2 | 86.68 | 44.8 | 72.46 | 69.86 | 68.56 | 74.35 | |
| 68.27 | 70.19 | 85.1 | 73.56 | 78.37 | 80.04 | 75.28 | 73.4 | 79.88 | 86.58 | 44.68 | 73.17 | 69.27 | 64.66 | 72.34 | |
| 68.75 | 70.67 | 85.1 | 75 | 77.88 | 79.98 | 75.5 | 74.08 | 79.54 | 86.78 | 44.33 | 73.17 | 69.39 | 67.26 | 71.28 | |
| 68.27 | 74.04 | 86.06 | 74.04 | 77.88 | 80 | 75.86 | 74.64 | 79.7 | 86.76 | 45.27 | 73.17 | 70.57 | 65.84 | 71.51 | |
| 71.15 | 76.44 | 85.58 | 72.12 | 79.81 | 79.98 | 76.16 | 74.72 | 80.38 | 86.76 | 44.44 | 71.75 | 72.46 | 69.15 | 71.75 | |
| 71.63 | 76.44 | 84.62 | 73.56 | 79.33 | 79.96 | 76.22 | 75.32 | 79.7 | 86.7 | 43.85 | 71.63 | 73.29 | 67.73 | 71.28 | |
| 71.15 | 74.04 | 83.65 | 71.15 | 75 | 79.96 | 75.98 | 75.22 | 79.1 | 86.74 | 45.04 | 71.28 | 72.34 | 68.68 | 70.57 | |
| 71.15 | 74.04 | 82.69 | 74.04 | 77.4 | 80 | 76.02 | 76.28 | 79.26 | 86.92 | 44.56 | 69.86 | 71.63 | 66.9 | 70.21 | |
| 68.75 | 71.15 | 82.69 | 77.88 | 75.48 | 80.08 | 76.36 | 77.38 | 79.48 | 86.9 | 44.8 | 70.21 | 72.81 | 67.02 | 69.5 | |
| 65.38 | 72.12 | 79.81 | 76.44 | 73.08 | 80.1 | 76.3 | 77.5 | 79.62 | 86.8 | 46.45 | 70.69 | 71.75 | 65.13 | 68.32 | |
| 65.38 | 71.63 | 84.13 | 74.52 | 74.04 | 80.06 | 76.36 | 78.08 | 80.02 | 86.86 | 46.45 | 70.69 | 71.75 | 65.13 | 68.32 | |
| 67.31 | 72.12 | 81.25 | 75 | 73.56 | 80.36 | 76.96 | 78.7 | 80.06 | 86.8 | 48.23 | 71.99 | 71.04 | 67.73 | 67.73 | |
| 67.79 | 75.96 | 79.33 | 72.6 | 72.6 | 80.2 | 77.06 | 77.82 | 79.16 | 86 | 48.58 | 71.75 | 70.57 | 67.85 | 66.67 | |
| 64.9 | 76.92 | 78.37 | 71.63 | 75 | 80.16 | 74.78 | 75.56 | 78 | 84.12 | 50.24 | 70.21 | 67.85 | 67.38 | 54.96 | |
| 64.42 | 71.15 | 80.29 | 73.08 | 72.12 | 80.12 | 74.74 | 73.22 | 77.2 | 83.24 | 46.81 | 61.7 | 63.83 | 60.64 | 50.47 | |
| 62.98 | 70.67 | 73.56 | 69.23 | 73.56 | 75.24 | 72.92 | 69.62 | 74.42 | 79.86 | 44.92 | 61.58 | 61.58 | 57.68 | 47.52 | |
| 63.46 | 71.63 | 69.23 | 71.15 | 74.52 | 66.3 | 64.62 | 58.28 | 66.82 | 70.52 | 43.85 | 57.33 | 53.31 | 54.49 | 46.57 | |
| 58.65 | 69.23 | 64.9 | 66.83 | 69.23 | 59.14 | 57.58 | 51.32 | 57.42 | 61.22 | 41.49 | 50.12 | 49.29 | 42.08 | 42.55 | |
| 56.73 | 62.02 | 57.69 | 57.69 | 58.17 | 51.78 | 50.42 | 42.28 | 48.54 | 51.78 | 40.07 | 43.62 | 40.9 | 32.62 | 30.85 | |
| 55.29 | 50.48 | 53.85 | 54.33 | 56.73 | 39.02 | 38.56 | 34.44 | 36.06 | 38.38 | 25.65 | 25.65 | 25.65 | 25.65 | 25.65 | |
| 48.6 | 66.82 | 70.56 | 68.69 | 56.07 | 62.39 | 64.38 | 52.88 | 70.8 | 70.13 | ||||||
| 50.47 | 67.29 | 77.1 | 66.36 | 51.87 | 63.05 | 65.27 | 52.65 | 69.69 | 70.35 | ||||||
| 50.47 | 67.29 | 77.1 | 66.36 | 51.87 | 61.95 | 63.5 | 51.77 | 68.58 | 69.91 | ||||||
| 47.66 | 70.09 | 77.1 | 62.15 | 51.87 | 60.84 | 61.95 | 51.33 | 70.13 | 70.35 | ||||||
| 47.66 | 70.09 | 77.1 | 62.15 | 51.87 | 60.4 | 64.38 | 51.77 | 69.91 | 71.02 | ||||||
| 46.26 | 72.9 | 73.36 | 60.28 | 51.87 | 59.51 | 64.82 | 51.11 | 68.81 | 70.8 | ||||||
| 46.26 | 72.9 | 73.36 | 60.28 | 51.87 | 61.28 | 63.27 | 50.22 | 69.47 | 72.12 | ||||||
| 47.66 | 71.5 | 72.9 | 62.62 | 51.4 | 61.95 | 61.95 | 49.34 | 68.81 | 71.46 | ||||||
| 47.66 | 71.5 | 72.9 | 62.62 | 51.4 | 59.96 | 61.95 | 50.22 | 67.26 | 70.13 | ||||||
| 50.93 | 74.3 | 74.77 | 64.49 | 51.4 | 59.73 | 63.27 | 50.22 | 70.58 | 68.14 | ||||||
| 50.93 | 74.3 | 74.77 | 64.49 | 51.4 | 59.73 | 63.27 | 49.56 | 65.49 | 69.47 | ||||||
| 50.93 | 74.3 | 74.77 | 64.49 | 51.4 | 60.62 | 63.72 | 49.78 | 69.47 | 68.58 | ||||||
| 46.73 | 66.36 | 72.9 | 67.76 | 46.73 | 61.5 | 62.61 | 48.23 | 68.36 | 69.25 | ||||||
| 46.73 | 66.36 | 72.9 | 67.76 | 46.73 | 62.17 | 64.38 | 47.79 | 68.14 | 68.36 | ||||||
| 43.46 | 63.55 | 57.01 | 60.28 | 35.51 | 59.07 | 61.5 | 45.35 | 65.93 | 63.94 | ||||||
| 43.46 | 63.55 | 57.01 | 60.28 | 35.51 | 59.29 | 61.95 | 44.03 | 65.93 | 63.27 | ||||||
| 35.98 | 54.67 | 47.2 | 52.8 | 35.51 | 61.5 | 61.95 | 46.24 | 66.15 | 63.27 | ||||||
| 35.98 | 54.67 | 47.2 | 52.8 | 35.51 | 63.05 | 61.5 | 52.65 | 65.04 | 61.73 | ||||||
| 35.51 | 35.51 | 35.51 | 35.51 | 35.51 | 63.05 | 54.2 | 52.21 | 65.04 | 61.5 | ||||||
| 35.51 | 35.51 | 35.51 | 35.51 | 35.51 | 60.18 | 49.34 | 47.12 | 61.5 | 61.5 | ||||||
Fig 4An average predictive accuracy graph using the 10-fold cross-validation technique for threshold value identification.
Fig 5An average predictive accuracy graph using training datasets for threshold value identification.
Selected nontext datasets’ characteristics.
| Nontext Dataset | No. of Instances | No. of Attributes | No. of Distinct Classes |
|---|---|---|---|
| Cylinder-bands | 540 | 40 | 2 |
| Diabetes | 768 | 9 | 2 |
| Letter | 20,000 | 17 | 2 |
| Sonar | 208 | 61 | 2 |
| Waveform | 5,000 | 41 | 3 |
| Vehicle | 846 | 19 | 4 |
| Glass | 214 | 10 | 6 |
| Arrhythmia | 452 | 280 | 13 |
Selected text datasets’ characteristics.
| Text Dataset | No. of Features | No. of Documents | No. of Distinct Classes |
|---|---|---|---|
| MiniNewsGroups | 27,419 | 1,600 | 4 |
| Course-Cotrain | 13,919 | 1,051 | 2 |
| Trec05p-1 | 12,578 | 62,499 | 2 |
| SpamAssassin | 9,351 | 3,000 | 2 |
Fig 6Predictive accuracies of classifiers against benchmark datasets with varying percentages of retained features.
Selected classifier parameters.
| Classifier | Function | Kernel Type | Epsilon | Tolerance | Exponent | Random Seed |
|---|---|---|---|---|---|---|
| SVM | SMO | Polynomial | 1.0E-12 | 0.001 | 1 | 1 |
Fig 7Comparisons of F-measure with existing FS measures.
Comparisons of average classifier precision with existing FS measures.
| Nontext Dataset | Feature Selection Measures | Proposed Methodology | ||||
|---|---|---|---|---|---|---|
| IG | GR | CS | SU | S | uEFS | |
| Cylinder-bands | 0.805 | 0.801 | 0.797 | 0.803 | 0.801 | |
| Diabetes | 0.753 | 0.753 | 0.753 | 0.753 | 0.738 | |
| Letter | 0.920 | 0.962 | 0.920 | 0.962 | 0.920 | |
| Sonar | 0.789 | 0.791 | 0.789 | 0.791 | 0.789 | |
| Waveform | 0.869 | 0.869 | 0.868 | 0.869 | 0.868 | |
| Vehicle | 0.586 | 0.604 | 0.605 | 0.534 | ||
| Glass | 0.477 | 0.484 | 0.451 | 0.550 | ||
| Arrhythmia | 0.640 | 0.647 | 0.639 | 0.640 | 0.639 | |
a IG: information gain,
b GR: gain ratio,
c CS: chi-squared,
d SU: symmetrical uncertainty,
e S: significance
Comparisons of average classifier recall with existing FS measures.
| Nontext Dataset | Feature Selection Measures | Proposed Methodology | ||||
|---|---|---|---|---|---|---|
| IG | GR | CS | SU | S | uEFS | |
| Cylinder-bands | 0.806 | 0.802 | 0.798 | 0.804 | 0.802 | |
| Diabetes | 0.759 | 0.759 | 0.759 | 0.759 | 0.758 | |
| Letter | 0.959 | 0.961 | 0.959 | 0.961 | 0.959 | |
| Sonar | 0.788 | 0.789 | 0.788 | 0.789 | 0.788 | |
| Waveform | 0.868 | 0.868 | ||||
| Vehicle | 0.617 | 0.632 | 0.655 | 0.631 | 0.540 | |
| Glass | 0.579 | 0.584 | 0.481 | 0.584 | ||
| Arrhythmia | 0.719 | 0.723 | 0.717 | 0.719 | 0.719 | |
a IG: information gain,
b GR: gain ratio,
c CS: chi-squared,
d SU: symmetrical uncertainty,
e S: significance
Comparisons of predictive accuracy (in %age) of the uEFS with existing FS measures.
| Nontext Dataset | Feature Selection Measures | Proposed Methodology | One-Sample T-Test | Paired-Samples T-Test | ||||
|---|---|---|---|---|---|---|---|---|
| IG | GR | CS | SU | S | uEFS | p {Sig. (two-tailed)} | p {Sig. (two-tailed)} | |
| Cylinder-bands | 80.56 | 80.19 | 79.81 | 80.37 | 80.19 | |||
| Diabetes | 75.91 | 75.91 | 75.91 | 75.91 | 75.89 | |||
| Letter | 95.94 | 96.08 | 95.94 | 96.08 | 95.94 | |||
| Sonar | 78.85 | 78.86 | 78.85 | 78.86 | 78.85 | |||
| Waveform | 86.88 | 86.88 | 86.86 | 86.88 | 86.86 | |||
| Vehicle | 61.7 | 63.24 | 65.48 | 63.12 | 54.02 | 0.093 | ||
| Glass | 57.94 | 58.41 | 48.13 | 58.41 | 0.400 | |||
| Arrhythmia | 71.9 | 72.35 | 71.68 | 71.9 | 71.9 | |||
a IG: information gain,
b GR: gain ratio,
c CS: chi-squared,
d SU: symmetrical uncertainty,
e S: significance
Paired-samples t-test results.
| State-of-the-art Filter-based Measures’ Mean | Proposed uEFS Methodology | |
|---|---|---|
| Mean | 75.970 | |
| Variance | 164.664 | |
| Pearson Correlation | ||
| Hypothesized Mean Difference | 0 | |
| df | 7 | |
| t Stat | -2.739 | |
| P(T¡ = t) one-tailed | ||
| P(T¡ = t) two-tailed |
Comparisons of time measure (in seconds) with existing FS measures.
| Nontext Dataset | Feature Selection Measures | Proposed Methodology | ATSM | TD | ATD | ||||
|---|---|---|---|---|---|---|---|---|---|
| IG | GR | CS | SU | S | uEFS | (sec) | (sec) | (sec) | |
| Cylinder-bands | 3.28 | 3.82 | 3.79 | 3.59 | 3.72 | 0.81 | |||
| Diabetes | 0.11 | 0.12 | 0.12 | 0.12 | 0.12 | 0.05 | |||
| Letter | 4.60 | 4.12 | 4.28 | 4.60 | 4.45 | 0.32 | |||
| Sonar | 0.06 | 0.05 | 0.06 | 0.06 | 0.06 | 0.08 | |||
| Waveform | 1.11 | 1.09 | 1.11 | 0.98 | |||||
| Vehicle | 0.28 | 0.30 | 0.28 | 0.30 | 0.3 | 0.09 | |||
| Glass | 0.33 | 0.34 | 0.33 | 0.34 | 0 | ||||
| Arrhythmia | 2.67 | 2.68 | 2.54 | 2.64 | 2.65 | 0.66 | |||
a IG: information gain,
b GR: gain ratio,
c CS: chi-squared,
d SU: symmetrical uncertainty,
e S: significance,
f ATSM: average time of state-of-the-art measures,
g TD: time difference,
h ATD: average time difference
Comparisons of predictive accuracy (in %age) with existing FS methods.
| Nontext Dataset | Feature Selection Methods | Proposed Methodology | |
|---|---|---|---|
| OneR | ReliefF | uEFS | |
| Cylinder-bands | 79.63 | 80.37 | |
| Diabetes | 75.39 | 75.52 | |
| Letter | 96.91 | 96.97 | |
| Sonar | 77.88 | 75.96 | |
| Waveform | 86.76 | ||
| Vehicle | 64.89 | 63.83 | |
| Glass | 49.07 | 57.01 | |
| Arrhythmia | 71.02 | 71.46 | |
Comparisons of state-of-the-art ensemble methodologies with the proposed uEFS methodology.
| State-of-the-art ensemble methodology—I | State-of-the-art ensemble methodology—II | ||
|---|---|---|---|
| Borda method [ | uEFS methodology | EMFFS method [ | uEFS methodology |
| 1. Consider three filter measures (IG, symmetric uncertainty, chi-squared) | 1. Consider three filter measures (IG, symmetric uncertainty, chi-squared) | 1. Consider four filter measures (IG, gain ratio, chi-squared, ReliefF) | 1. Consider four filter measures (IG, gain ratio, chi-squared, ReliefF) |
| 2. Compute the ranks using each filter measure | 2. Compute the ranks using each filter measure | 2. Compute the ranks using each filter measure | 2. Compute the ranks using each filter measure |
| 3. Sort the computed ranks in an ascending order | 3. Compute the scaled ranks of each computed ranks | 3. Sort the computed ranks in an ascending order | 3. Compute the scaled ranks of each of the computed ranks |
| 4. Assign a score to each feature in a list based on its position | 4. Compute the combined sum of all computed ranks | 4. Select the top one-third split of each filter measure’s output | 4. Compute the combined sum of all computed ranks |
| 5. Compute the sum of all the positional scores from all the lists | 5. For each feature, compute the combined rank by adding all computed scaled ranks | 5. Define the feature count threshold | 5. For each feature, compute the combined rank by adding all computed scaled ranks |
| 6. Sort the computed sum in an ascending order to generate the final ranked feature set | 6. Sort the list in an ascending order after computing the score, weight, and priority of each feature | 6. Compute the feature occurrence rate among the filter measures | 6. Sort the list in an ascending order after computing the score, weight, and priority of each feature |
| 7. If the feature count is less than the threshold, drop the feature; otherwise, select the feature | 7. Determine the threshold value using the proposed TVS method | ||
| 8. Apply the threshold value to drop the irrelevant features and to select the final ranked feature set | |||
Comparisons of predictive accuracy and F-measure with the Borda method [15].
| Nontext Dataset | Predictive Accuracy (%) | F-measure | ||
|---|---|---|---|---|
| Borda method [ | uEFS (three filter measures) | Borda method [ | uEFS (three filter measures) | |
| Cylinder-bands | 57.78 | 0.423 | ||
| Diabetes | 65.10 | 0.513 | ||
| Letter | ||||
| Sonar | 66.83 | 0.667 | ||
| Waveform | 31.80 | 0.311 | ||
| Vehicle | 59.22 | 0.58 | ||
| Glass | 40.19 | 0.316 | ||
| Arrhythmia | 64.60 | 0.564 | ||
Comparisons of predictive accuracy and F-measure with the EMFFS method [18].
| Nontext Dataset | Predictive Accuracy (%) | F-measure | ||
|---|---|---|---|---|
| EMFFS method [ | uEFS (four filter measures) | EMFFS method [ | uEFS (four filter measures) | |
| Cylinder-bands | 80.74 | 0.805 | ||
| Diabetes | 75.52 | 0.739 | ||
| Letter | ||||
| Sonar | 78.37 | 0.784 | ||
| Waveform | 86.48 | 0.864 | ||
| Vehicle | 41.73 | 0.392 | ||
| Glass | 54.67 | 0.491 | ||
| Arrhythmia | 71.68 | 0.658 | ||
Fig 8Comparisons of F-measure with existing FS measures [29, 37, 39, 48].
Fig 9Comparisons of predictive accuracy with existing FS measures [29, 37, 39, 48].
Comparisons of average classifier precision with existing FS methods [29, 37, 39, 48].
| Text Dataset | Feature Selection Algorithms | Proposed Methodology | |||
|---|---|---|---|---|---|
| IG | Relief | DRB-FS | GR- | uEFS | |
| Course-Cotrain | 0.668 | 0.609 | 0.648 | 0.669 | |
| Trec05p-1 | 0.836 | 0.375 | 0.423 | 0.721 | |
| MiniNewsGroups | 0.730 | 0.708 | 0.272 | 0.764 | |
| SpamAssassin | 0.708 | 0.710 | 0.857 | 0.701 | |
Comparisons of average classifier recall with existing FS methods [29, 37, 39, 48].
| Text Dataset | Feature Selection Algorithms | Proposed Methodology | |||
|---|---|---|---|---|---|
| IG | Relief | DRB-FS | GR- | uEFS | |
| Course-Cotrain | 0.717 | 0.711 | 0.776 | 0.768 | |
| Trec05p-1 | 0.731 | 0.410 | 0.764 | 0.451 | |
| MiniNewsGroups | 0.669 | 0.636 | 0.327 | 0.686 | |
| SpamAssassin | 0.766 | 0.778 | 0.863 | 0.727 | |