| Literature DB >> 36160366 |
Jiefang Jiang1,2, Xianyong Zhang1,2,3, Jilin Yang2,4.
Abstract
Feature selection facilitates intelligent information processing, and the unsupervised learning of feature selection has become important. In terms of unsupervised feature selection, the Laplacian score (LS) provides a powerful measurement and optimization method, and good performance has been achieved using the recent forward iterative Laplacian score (FILS) algorithm. However, there is still room for advancement. The aim of this paper is to improve the FILS algorithm, and thus, feature significance (SIG) is mainly introduced to develop a high-quality selection method, i.e., the incremental forward iterative Laplacian score (IFILS) algorithm. Based on the modified LS, the metric difference in the incremental feature process motivates SIG. Therefore, SIG offers a dynamic characterization by considering initial and terminal states, and it promotes the current FILS measurement on only the terminal state. Then, both the modified LS and integrated SIG acquire granulation nonmonotonicity and uncertainty, especially on incremental feature chains, and the corresponding verification is achieved by completing examples and experiments. Furthermore, a SIG-based incremental criterion of minimum selection is designed to choose optimization features, and thus, the IFILS algorithm is naturally formulated to implement unsupervised feature selection. Finally, an in-depth comparison of the IFILS algorithm with the FILS algorithm is achieved using data experiments on multiple datasets, including a nominal dataset of COVID-19 surveillance. As validated by the experimental results, the IFILS algorithm outperforms the FILS algorithm and achieves better classification performance.Entities:
Keywords: Feature selection; Feature significance; Forward iterative Laplacian score; Granulation nonmonotonicity and uncertainty; Incremental forward iterative Laplacian score; Unsupervised learning
Year: 2022 PMID: 36160366 PMCID: PMC9484723 DOI: 10.1007/s10462-022-10274-6
Source DB: PubMed Journal: Artif Intell Rev ISSN: 0269-2821 Impact factor: 9.588
Measurement values of the modified LS and SIG on incremental chain of Table 2 (the Immunotherapy dataset)
| Measure | Feature-incremental chain | ||||||
|---|---|---|---|---|---|---|---|
| 0.001 | 0.0054 | 0.1013 | 0.0488 | 0.1494 | 0.1745 | 0.2213 | |
| – | |||||||
| − 0.0199 | − 0.0503 | − 0.0705 | − 0.0700 | − 0.0488 | − 0.0468 | 0 | |
Fig. 2Three-way non-monotonicity/uncertainty changes of the modified LS and SIG on incremental chain of Table 2 (the Immunotherapy dataset)
-based three-way value table of the modified LS and SIG on incremental chains of nine UCI datasets
| No. | Dataset | Local or complete feature-incremental chains | ||||||
|---|---|---|---|---|---|---|---|---|
| (1) | Blood | [0.0291, − 0.0029, + 0.017] | [0.032, + 0.005, + 0.0107] | [0.027, + 0.0042,+ 0.0042] | [0.0228, –, 0] | – | – | – |
| (2) | Ccbr | [0.2519, + 0.0543, + 0.1636] | [0.1976, − 0.0178,+ 0.0611] | [0.2153, + 0.0008,+ 0.0141] | [0.2145, − 0.0395, − 0.0191] | [0.254, − 0.0362, − 0.0143] | [0.2902, − 0.034, − 0.0046] | [0.3242, + 0.0258, − 0.0079] |
| (3) | Ecoli | [0.0016, − 0.0162, − 0.0177] | [0.0178, − 0.0003, − 0.0254] | [0.018, − 0.0234, − 0.0296] | [0.0414, − 0.0252, − 0.0249] | [0.0666, − 0.0143, − 0.0241] | [0.0809, − 0.0114, − 0.0114] | [0.0923, –, 0] |
| (4) | Glass | [0.0285, − 0.0601, − 0.034] | [0.0885, + 0.0406, − 0.0152] | [0.0479, − 0.0331, − 0.0322] | [0.081, − 0.027, − 0.0211] | [0.108, − 0.0249, − 0.0194] | [0.133, − 0.0041, − 0.0143] | [0.1371, − 0.0309, − 0.014] |
| (5) | Hayes-roth | [0.0018, − 0.0048, − 0.0256] | [0.0066, − 0.0628, − 0.0676] | [0.0694, − 0.0785, − 0.082] | [0.1479, − 0.0654, − 0.0654] | [0.2133, –, 0] | – | – |
| (6) | Iris | [0.0031, − 0.0401, − 0.0135] | [0.0431, + 0.0074, + 0.0015] | [0.0358, + 0.0012, + 0.0012] | [0.0345, –, 0] | – | – | – |
| (7) | Lung cancer | [0.656, + 0.3857, + 0.549] | [0.2703, − 0.0571, + 0.1239] | [0.3274, − 0.0408, + 0.0638] | [0.3682, + 0.0462, + 0.0562] | [0.3221, − 0.0609, − 0.0496] | [0.3829, − 0.0613, − 0.0551] | [0.4443, − 0.075, − 0.0498] |
| (8) | Wine | [0.0057, − 0.0276, − 0.019] | [0.0333, − 0.0468, − 0.0288 ] | [0.0801, − 0.042, − 0.0239] | [0.1221, − 0.0478, − 0.0164] | [0.1699, − 0.0211, − 0.0042] | [0.1911, − 0.0004, − 0.0046] | [0.1915, − 0.0197, − 0.0013] |
| (9) | COVID-19 | [0.6099, − 0.1356, − 0.175] | [0.7455, − 0.0765, − 0.1013] | [0.822, − 0.0859, − 0.0859] | [0.9079, + 0.0254, + 0.0798] | [0.8825, − 0.0503, + 0.0225] | [0.9328, + 0.032, + 0.032] | [0.9008, –, 0] |
Fig. 3-based three-way value figures of the modified LS and SIG on incremental chains of nine UCI datasets
Fig. 1Procedural flowchart of the IFILS algorithm
Basic descriptions of experimental ten UCI datasets
| No. | Dataset name | Sample number | Feature number | Category number | Data type |
|---|---|---|---|---|---|
| (0) | Immunotherapy | 90 | 7 | 2 | Numeric |
| (1) | Blood | 748 | 4 | 2 | Numeric |
| (2) | Ccbr (Cervical cancer behavior risk) | 72 | 19 | 2 | Numeric |
| (3) | Ecoli | 336 | 7 | 8 | Numeric |
| (4) | Glass | 214 | 9 | 6 | Numeric |
| (5) | Hayes-roth | 132 | 5 | 3 | Numeric |
| (6) | Iris | 150 | 4 | 3 | Numeric |
| (7) | Lung cancer | 32 | 56 | 3 | Numeric |
| (8) | Wine | 178 | 13 | 3 | Numeric |
| (9) | COVID-19 (COVID-19 surveillance) | 14 | 7 | 3 | Nominal |
| (10) | Brain (Multi-view Brain Networks Data Set) | 70 | 70 | 10 | Numeric |
| (11) | Parkinsons | 195 | 22 | 2 | Numeric |
Decision subsystem on the Immunotherapy dataset’s 1st sample subset
| 1.00 | 0.10 | 0.45 | 0.06 | 0.00 | 0.24 | 0.09 | 1 | |
| 1.00 | 0.41 | 1.00 | 0.28 | 1.00 | 0.03 | 0.04 | 0 | |
| 1.00 | 0.00 | 0.41 | 0.61 | 0.00 | 0.05 | 0.07 | 1 | |
| 1.00 | 0.00 | 0.50 | 1.00 | 0.00 | 0.06 | 0.07 | 1 | |
| 0.00 | 0.56 | 1.00 | 0.72 | 0.00 | 0.09 | 0.06 | 0 | |
| 1.00 | 0.93 | 0.82 | 0.00 | 0.50 | 0.03 | 0.34 | 1 | |
| 0.00 | 0.07 | 0.98 | 0.22 | 0.50 | 0.01 | 0.04 | 1 | |
| 1.00 | 0.76 | 0.61 | 0.39 | 0.00 | 0.04 | 0.01 | 1 | |
| 1.00 | 0.20 | 0.52 | 0.28 | 0.00 | 0.01 | 0.00 | 1 |
The FILS algorithm’s procedure on the Immunotherapy dataset’s 1st random sample subset ()
| No. | Selective feature | Meet loop condition | Accuracy | |||
|---|---|---|---|---|---|---|
| (1) | [0.9955, 0.9754, 0.9759, 0.9141, 0.9955, 0.4510, 0.8910] | Yes | 77.78% | |||
| (2) | [0.8769, 0.5850, 0.5617, 0.4852, 0.7436, 0.4437] | Yes | ||||
| (3) | [0.7268, 0.3770, 0.3992, 0.2624, 0.6088] | Yes | ||||
| (4) | [0.5531, 0.1142, 0.1026, 0.3755] | Yes | ||||
| (5) | [0.3286, 0.1330, 0.2067] | No |
The IFILS algorithm’s procedure on the Immunotherapy dataset’s 1st random sample subset ()
| No. | Meet loop condition | Selective feature | Accuracy | |||
|---|---|---|---|---|---|---|
| (1) | Yes | [44.1750, 44.1549, 44.1554, 44.0937, 44.1750, 43.6305, 44.0705] | 100% | |||
| (2) | Yes | [0.5741, 0.8660, 0.8892, 0.9657, 0.7073, 1.0072] | ||||
| (3) | Yes | [1.0814, 1.0928, 1.0789, 1.0355, 1.1500] | ||||
| (4) | Yes | [1.1643, 1.1698, 1.1687, 1.2348] | ||||
| (5) | No | – | – |
Fig. 4The FILS and IFILS algorithms’ accuracies on the Immunotherapy dataset ()
The IFILS algorithm’s 10-fold-average accuracies and standard deviations on the Immunotherapy dataset (with K, r)
| 1 | 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|---|
| 1 | 62.22 ± 13.04 | 52.22 ± 13.91 | 72.22 ± 12.00 | 62.22 ± 15.89 | 76.67 ± 14.30 | 73.33 ± 11.94 |
| 2 | 63.33 ± 12.88 | 51.11 ± 11.94 | 67.78 ± 9.73 | 55.56 ± 9.07 | 73.33 ± 11.94 | |
| 3 | 63.33 ± 17.41 | 55.56 ± 17.37 | 66.67 ± 15.71 | 65.56 ± 15.23 | 72.22 ± 12.00 | 65.56 ± 12.23 |
| 4 | 57.78 ± 21.47 | 64.44 ± 14.63 | 71.11 ± 15.89 | 68.89 ± 8.76 | 71.11 ± 10.73 | |
| 5 | 75.56 ± 11.97 | 68.89 ± 19.12 | 74.45 ± 12.22 | 72.23 ± 10.24 | 77.78 ± 13.15 | 75.56 ± 12.96 |
| 6 | 77.78 ± 13.15 | |||||
| 7 | 74.45 ± 12.22 | 72.23 ± 10.24 | 75.56 ± 6.67 | 74.45 ± 5.09 | 74.45 ± 8.68 | |
Fig. 5The FILS algorithm’ accuracies on the Immunotherapy dataset (K, r)
Fig. 6The FILS and IFILS algorithms’ r-optimal () accuracies on eight UCI datasets (with 10-fold times and three classifiers)
The FILS and IFILS algorithms’ 10-fold-average accuracies and r-optimization statistics on eight UCI datasets based on KNN
| No. | Dataset | Algorithm | Optimization realization on feature set ( | Optimization on feature proper subset ( | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| (1) | Blood | 1 | 2 | 3 | 4 | – | – | – | |||
| FILS | 70.05 ± 3.94 | 73.53 ± 4.75 | 75.94 ± 5.72 | 76.87 ± 5.71 | – | – | – | 75.94 ± 5.72 ( | |||
| IFILS | 70.05 ± 3.94 | 75.00 ± 6.73 | 76.75 ± 5.96 | 76.87 ± 5.71 | – | – | – | ||||
| (2) | Ccbr | 1 | 4 | 7 | 10 | 13 | 16 | 19 | |||
| FILS | 29.29 ± 24.57 | 76.43 ± 16.10 | 77.68 ± 21.02 | 79.29 ± 20.00 | 81.96 ± 16.42 | 83.39 ± 17.38 | 87.68 ± 13.79 | 83.39 ± 17.38 (16) | |||
| IFILS | 29.29 ± 24.57 | 65.36 ± 16.56 | 82.14 ± 14.68 | 91.96 ± 11.33 | 89.11 ± 14.30 | 87.68 ± 13.79 | 87.68 ± 13.79 | – | |||
| (3) | Ecoli | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |||
| FILS | 43.15 ± 3.62 | 49.09 ± 6.66 | 50.87 ± 8.85 | 62.75 ± 10.21 | 80.92 ± 10.24 | 84.78 ± 6.95 | 86.28 ± 5.17 | 84.78 ± 6.95 ( | |||
| IFILS | 49.39 ± 7.54 | 50.56 ± 7.06 | 63.10 ± 6.23 | 77.40 ± 5.17 | 83.32 ± 4.72 | 84.80 ± 5.20 | 86.28 ± 5.17 | ||||
| (4) | Glass | 1 | 3 | 5 | 7 | 9 | – | – | |||
| FILS | 35.54 ± 9.80 | 57.92 ± 10.04 | 62.14 ± 10.83 | 65.35 ± 10.17 | 64.48 ± 8.71 | – | – | – | 65.35 ± 10.17 ( | ||
| IFILS | 34.11 ± 7.94 | 52.81 ± 11.10 | 66.77 ± 8.50 | 67.23 ± 9.34 | 64.48 ± 8.71 | – | – | – | |||
| (5) | Hayes-roth | 1 | 2 | 3 | 4 | 5 | – | – | |||
| FILS | 42.53 ± 9.99 | 34.18 ± 11.16 | 44.67 ± 10.90 | 45.44 ± 10.51 | 36.43 ± 10.28 | – | – | – | 45.44 ± 10.51 ( | ||
| IFILS | 42.53 ± 9.99 | 54.34 ± 14.39 | 56.87 ± 10.36 | 71.15 ± 13.42 | 36.43 ± 10.28 | – | – | – | |||
| (6) | Iris | 1 | 2 | 3 | 4 | – | – | – | |||
| FILS | 52.67 ± 12.75 | 88.67 ± 8.92 | 92.00 ± 8.20 | 96.00 ± 5.62 | – | – | – | 92.00 ± 8.20 ( | |||
| IFILS | 52.67 ± 12.75 | 96.00 ± 5.62 | 96.67 ± 4.71 | 96.00 ± 5.62 | – | – | – | – | |||
| (7) | Lung cancer | 1 | 10 | 19 | 28 | 37 | 46 | 56 | |||
| FILS | 27.50 ± 27.79 | 23.33 ± 31.62 | 27.50 ± 35.15 | 34.17 ± 28.45 | 53.33 ± 24.91 | 47.50 ± 28.34 | 42.50 ± 27.34 | – | 53.33 ± 24.91 (37) | ||
| IFILS | 27.50 ± 27.79 | 55.83 ± 22.92 | 50.00 ± 22.22 | 54.17 ± 32.22 | 53.33 ± 24.91 | 41.67 ± 23.57 | 42.50 ± 27.34 | – | |||
| (8) | Wine | 1 | 3 | 5 | 7 | 9 | 11 | 13 | |||
| FILS | 49.48 ± 12.10 | 68.01 ± 11.02 | 84.35 ± 9.02 | 95.49 ± 2.38 | 94.35 ± 3.82 | 94.90 ± 4.22 | 96.08 ± 3.77 | ||||
| IFILS | 49.48 ± 12.10 | 83.76 ± 12.66 | 91.01 ± 9.52 | 92.68 ± 7.46 | 94.90 ± 4.22 | 95.46 ± 3.67 | 96.08 ± 3.77 | 95.46 ± 3.67 (11) | |||
The FILS and IFILS algorithms’ 10-fold-average accuracies and r-optimization statistics on eight UCI datasets based on CART
| No. | Dataset | Algorithm | Optimization realization on feature set ( | Optimization on feature proper subset ( | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| (1) | Blood | 1 | 2 | 3 | 4 | – | – | – | |||
| FILS | 75.95 ± 5.76 | 73.53 ± 4.22 | 74.87 ± 5.34 | 74.87 ± 5.34 | – | – | – | – | 75.95 ± 5.76 ( | ||
| IFILS | 75.95 ± 5.76 | 75.68 ± 5.56 | 75.68 ± 4.66 | 74.87 ± 5.34 | – | – | – | – | 75.95 ± 5.76 ( | ||
| (2) | Ccbr | 1 | 4 | 7 | 10 | 13 | 16 | 19 | |||
| FILS | 74.82 ± 17.66 | 70.71 ± 10.25 | 68.04 ± 13.37 | 77.68 ± 12.06 | 84.82 ± 10.14 | 83.39 ± 17.37 | 80.54 ± 16.67 | – | 84.82 ± 10.14 (13) | ||
| IFILS | 74.82 ± 17.66 | 74.64 ± 17.83 | 79.11 ± 12.08 | 83.39 ± 8.67 | 86.25 ± 13.10 | 86.25 ± 11.23 | 80.54 ± 16.67 | – | |||
| (3) | Ecoli | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |||
| FILS | 43.44 ± 3.56 | 45.81 ± 4.44 | 47.93 ± 5.87 | 58.33 ± 7.86 | 74.06 ± 10.23 | 76.13 ± 7.78 | 80.60 ± 8.67 | 76.13 ± 7.78 ( | |||
| IFILS | 55.34 ± 6.92 | 46.11 ± 4.41 | 59.82 ± 6.99 | 73.51 ± 3.80 | 78.83 ± 8.20 | 80.02 ± 9.50 | 80.60 ± 8.67 | ||||
| (4) | Glass | 1 | 3 | 5 | 7 | 9 | – | – | |||
| FILS | 45.26 ± 16.10 | 63.98 ± 11.92 | 68.33 ± 10.32 | 68.64 ± 12.23 | 67.36 ± 9.80 | – | – | – | 68.64 ± 12.23 ( | ||
| IFILS | 45.26 ± 16.10 | 58.27 ± 10.98 | 68.64 ± 10.79 | 68.77 ± 10.91 | 67.36 ± 9.80 | – | – | – | |||
| (5) | Hayes-roth | 1 | 2 | 3 | 4 | 5 | – | – | |||
| FILS | 46.98 ± 12.90 | 45.44 ± 11.85 | 46.15 ± 15.28 | 59.07 ± 9.04 | 77.15 ± 14.58 | – | – | 59.07 ± 9.04 ( | |||
| IFILS | 46.98 ± 12.90 | 53.74 ± 11.85 | 65.17 ± 9.60 | 77.15 ± 14.58 | 77.15 ± 14.58 | – | – | ||||
| (6) | Iris | 1 | 2 | 3 | 4 | – | – | – | |||
| FILS | 51.33 ± 8.34 | 84.00 ± 10.04 | 94.00 ± 2.11 | 95.33 ± 4.50 | – | – | – | 94.00 ± 2.11 ( | |||
| IFILS | 51.33 ± 8.34 | 94.00 ± 5.84 | 96.00 ± 4.66 | 95.33 ± 4.50 | – | – | – | – | |||
| (7) | Lung cancer | 1 | 10 | 19 | 28 | 37 | 46 | 56 | |||
| FILS | 45.00 ± 18.92 | 45.00 ± 31.48 | 30.83 ± 25.47 | 35.00 ± 37.02 | 50.83 ± 39.37 | 47.50 ± 32.88 | 49.17 ± 32.50 | – | 50.83 ± 39.37 (37) | ||
| IFILS | 45.00 ± 18.92 | 45.83 ± 23.33 | 64.17 ± 24.23 | 56.67 ± 21.08 | 56.67 ± 21.08 | 62.50 ± 29.72 | 49.17 ± 2.50 | – | |||
| (8) | Wine | 1 | 3 | 5 | 7 | 9 | 11 | 13 | |||
| FILS | 45.59 ± 9.12 | 61.83 ± 10.30 | 76.99 ± 11.79 | 89.90 ± 5.09 | 90.46 ± 10.49 | 88.72 ± 7.94 | 88.76 ± 6.93 | – | 90.46 ± 10.49 (9) | ||
| IFILS | 45.59 ± 9.12 | 83.66 ± 9.35 | 87.03 ± 9.95 | 91.57 ± 5.42 | 90.42 ± 5.93 | 89.31 ± 6.67 | 88.76 ± 6.93 | – | |||
The FILS and IFILS algorithms’ 10-fold-average accuracies and r-optimization statistics on eight UCI datasets based on SVM
| No. | Dataset | Algorithm | Optimization realization on feature set ( | Optimization on feature proper subset ( | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| (1) | Blood | 1 | 2 | 3 | 4 | – | – | – | |||
| FILS | 76.21 ± 6.05 | 76.21 ± 6.05 | 76.48 ± 6.85 | 76.62 ± 6.85 | – | – | – | 76.48 ± 6.85( | |||
| IFILS | 76.21 ± 6.05 | 76.62 ± 6.74 | 76.75 ± 6.88 | 76.62 ± 6.85 | – | – | – | – | |||
| (2) | Ccbr | 1 | 4 | 7 | 10 | 13 | 16 | 19 | |||
| FILS | 73.57 ± 19.30 | 77.68 ± 13.81 | 76.43 ± 18.71 | 79.28 ± 11.76 | 87.32 ± 14.20 | 88.93 ± 11.19 | 88.93 ± 13.06 | 88.93 ± 11.19(16) | |||
| IFILS | 73.57 ± 19.30 | 73.57 ± 19.30 | 77.50 ± 25.52 | 90.54 ± 14.63 | 90.54 ± 14.63 | 89.11 ± 14.30 | 88.93 ± 13.06 | – | |||
| (3) | Ecoli | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |||
| FILS | 43.15 ± 3.62 | 47.60 ± 3.93 | 53.89 ± 8.66 | 65.15 ± 4.46 | 79.44 ± 9.15 | 83.01 ± 7.19 | 85.69 ± 6.46 | 83.01 ± 7.19 ( | |||
| IFILS | 50.85 ± 4.21 | 48.78 ± 3.53 | 64.29 ± 4.43 | 78.86 ± 7.35 | 84.81 ± 4.96 | 85.71 ± 5.21 | 85.69 ± 6.46 | – | |||
| (4) | Glass | 1 | 3 | 5 | 7 | 9 | – | – | |||
| FILS | 41.08 ± 13.08 | 42.60 ± 10.37 | 56.45 ± 13.64 | 61.19 ± 9.16 | 61.64 ± 8.80 | – | – | 61.19 ± 9.16 ( | |||
| IFILS | 38.70 ± 11.95 | 46.86 ± 13.57 | 59.37 ± 8.66 | 65.39 ± 8.79 | 61.64 ± 8.80 | – | – | – | |||
| (5) | Hayes-roth | 1 | 2 | 3 | 4 | 5 | – | – | |||
| FILS | 34.29 ± 13.47 | 33.52 ± 10.06 | 46.04 ± 11.70 | 58.41 ± 15.58 | 63.08 ± 13.57 | – | – | 58.41 ± 15.58 ( | |||
| IFILS | 34.29 ± 13.47 | 47.86 ± 15.00 | 56.87 ± 8.24 | 73.46 ± 6.55 | 63.08 ± 13.57 | – | – | – | |||
| (6) | Iris | 1 | 2 | 3 | 4 | – | – | – | |||
| FILS | 54.67 ± 15.01 | 83.33 ± 13.79 | 84.67 ± 9.46 | 90.00 ± 11.44 | – | – | – | 84.67 ± 9.46 ( | |||
| IFILS | 54.67 ± 15.01 | 92.67 ± 5.84 | 92.00 ± 6.88 | 90.00 ± 11.44 | – | – | – | – | |||
| (7) | Lung cancer | 1 | 10 | 19 | 28 | 37 | 46 | 56 | |||
| FILS | 45.00 ± 18.92 | 38.33 ± 21.94 | 48.33 ± 37.02 | 44.17 ± 27.79 | 46.67 ± 24.91 | 50.83 ± 28.45 | 66.67 ± 23.90 | 50.83 ± 28.45 (46) | |||
| IFILS | 45.00 ± 18.92 | 60.00 ± 31.87 | 56.67 ± 21.08 | 50.83 ± 17.77 | 50.83 ± 28.45 | 51.67 ± 25.09 | 66.67 ± 23.90 | ||||
| (8) | Wine | 1 | 3 | 5 | 7 | 9 | 11 | 13 | |||
| FILS | 44.38 ± 9.45 | 64.12 ± 9.95 | 84.38 ± 9.34 | 94.35 ± 5.32 | 95.46 ± 5.83 | 97.71 ± 4.05 | 98.30 ± 2.74 | ||||
| IFILS | 44.38 ± 9.45 | 86.50 ± 9.54 | 89.80 ± 11.19 | 94.93 ± 4.88 | 96.57 ± 4.82 | 97.16 ± 4.85 | 98.30 ± 2.74 | 97.16 ± 4.85 (11) | |||
Fig. 7The FILS and IFILS algorithms’ 10-fold-average classification accuracies on eight UCI datasets (with number r and three classifiers)
The FILS and IFILS algorithms’ r-optimal () three-way indexes (i.e., number, accuracy, time) on eight UCI datasets based on KNN, CART, SVM
| No. | Dataset | Algorithm | KNN | CART | SVM | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Number | Accuracy ( | Time (s) | Number | Accuracy ( | Time (s) | Number | Accuracy( | Time (s) | |||
| 1 | Blood | FILS | 3 | 75.94 ± 5.72 | 2234.34 ± 60.75 | 1 | 75.95 ± 5.76 | 856.29 ± 46.24 | 3 | 76.48 ± 6.85 | 2234.67 ± 60.76 |
| IFILS | 3 | 76.75 ± 5.96 | 2270.09 ± 48.89 | 1 | 75.95 ± 5.76 | 878.86 ± 57.67 | 3 | 76.75 ± 6.88 | 2270.47 ± 48.91 | ||
| 2 | Ccbr | FILS | 16 | 83.39 ± 17.38 | 67.22 ± 2.09 | 13 | 84.82 ± 10.14 | 75.49 ± 1.54 | 16 | 88.93 ± 11.19 | 67.31 ± 2.09 |
| IFILS | 10 | 91.96 ± 11.33 | 87.64 ± 2.89 | 13 | 86.25 ± 13.10 | 103.24 ± 3.19 | 10 | 90.54 ± 14.63 | 87.71 ± 2.88 | ||
| 3 | Ecoli | FILS | 6 | 84.78 ± 6.95 | 455.74 ± 25.63 | 6 | 76.13 ± 7.78 | 455.82 ± 25.63 | 6 | 83.01 ± 7.19 | 455.92 ± 25.63 |
| IFILS | 6 | 84.80 ± 5.20 | 655.07 ± 28.10 | 6 | 80.02 ± 9.50 | 655.15 ± 28.09 | 6 | 85.71 ± 5.21 | 655.23 ± 28.09 | ||
| 4 | Glass | FILS | 7 | 65.35 ± 10.17 | 234.02 ± 5.81 | 7 | 68.33 ± 10.32 | 234.11 ± 5.82 | 7 | 61.19 ± 9.16 | 234.16 ± 5.82 |
| IFILS | 7 | 67.23 ± 9.34 | 318.13 ± 2.28 | 7 | 68.77 ± 10.91 | 318.23 ± 2.29 | 7 | 65.39 ± 8.79 | 318.27 ± 2.29 | ||
| 5 | Hayesroth | FILS | 4 | 45.44 ± 10.51 | 17.62 ± 0.42 | 4 | 59.07 ± 9.04 | 17.70 ± 0.43 | 4 | 58.41 ± 15.58 | 17.71 ± 0.43 |
| IFILS | 4 | 71.15 ± 13.42 | 24.59 ± 0.65 | 4 | 77.15 ± 14.58 | 24.69 ± 0.65 | 4 | 73.46 ± 6.55 | 24.69 ± 0.64 | ||
| 6 | Iris | FILS | 3 | 92.00 ± 8.20 | 14.90 ± 0.02 | 3 | 94.00 ± 2.11 | 14.91 ± 0.03 | 3 | 84.67 ± 9.46 | 14.91 ± 0.004 |
| IFILS | 3 | 96.67 ± 4.71 | 21.61 ± 0.01 | 3 | 96.00 ± 4.66 | 21.62 ± 0.01 | 2 | 92.67 ± 5.84 | 16.55 ± 0.02 | ||
| 7 | Lungcancer | FILS | 37 | 53.33 ± 24.91 | 101.06 ± 5.59 | 37 | 50.83 ± 39.37 | 101.08 ± 5.59 | 46 | 50.83 ± 28.45 | 109.36 ± 4.590 |
| IFILS | 10 | 55.83 ± 22.92 | 45.10 ± 1.70 | 19 | 64.17 ± 24.23 | 83.65 ± 2.68 | 10 | 60.00 ± 31.87 | 45.19 ± 1.702 | ||
| 8 | Wine | FILS | 7 | 95.49 ± 2.38 | 234.52 ± 3.33 | 9 | 90.46 ± 10.49 | 289.98 ± 9.75 | 11 | 97.70 ± 14.05 | 307.67 ± 10.513 |
| IFILS | 11 | 95.46 ± 3.67 | 451.72 ± 21.38 | 7 | 91.57 ± 5.42 | 348.67 ± 5.64 | 11 | 97.16 ± 4.85 | 451.83 ± 21.376 | ||
| (1)-(8) | Average | FILS | 10.38 | 74.46 ± 20.96 | 419.93 ± 704.26 | 10 | 74.95 ± 20.82 | 255.67 ± 270.04 | 12 | 75.15 ± 20.25 | 430.21 ± 701.85 |
| IFILS | 484.24 ± 713.86 | 304.26 ± 299.02 | 483.74 ± 714.38 | ||||||||
The FILS and IFILS algorithms’ 3-fold-average accuracies and standard deviations on COVID-19 dataset (with K, r)
| Algorithm | |||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | ||
| 1 | FILS | 58.33 ± 14.34 | |||
| IFILS | 58.33 ± 14.34 | 58.33 ± 14.34 | 58.33 ± 14.34 | ||
| 2 | FILS | 56.67 ± 4.71 | 56.67 ± 4.71 | ||
| IFILS | 58.33 ± 14.34 | 58.33 ± 14.34 | 58.33 ± 14.34 | ||
| 3 | FILS | 58.33 ± 14.34 | |||
| IFILS | |||||
| 4 | FILS | 50.00 ± 8.16 | |||
| IFILS | 58.33 ± 14.34 | 56.67 ± 4.71 | 58.33 ± 14.34 | ||
| 5 | FILS | 41.67 ± 14.34 | 50.00 ± 8.16 | 50.00 ± 8.16 | 50.00 ± 8.16 |
| IFILS | 58.33 ± 14.34 | 50.00 ± 8.16 | 63.33 ± 12.47 | ||
| 6 | FILS | 41.67 ± 14.34 | 50.00 ± 8.16 | 50.00 ± 24.49 | |
| IFILS | 50.00 ± 8.16 | 35.00 ± 17.80 | 50.00 ± 8.16 | ||
| 7 | FILS | 50.00 ± 8.16 | 50.00 ± 8.16 | 35.00 ± 17.80 | |
| IFILS | 50.00 ± 8.16 | 50.00 ± 8.16 | 35.00 ± 17.80 | ||
| FILS | |||||
| IFILS | |||||
Fig. 8The FILS and IFILS algorithms’ accuracies on the COVID-19 dataset
The FILS and IFILS algorithms’ 10-fold-average accuracies and r-optimization statistics on the Brain and Parkinsons datasets based on KNN, CART and SVM
| No. | Dataset | Classifier | Algorithm |
| Optimization realization on feature set ( | Optimization on feature proper subset ( | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 12 | 23 | 34 | 45 | 56 | 70 | ||||||
| (10) | Brain | KNN | FILS | 24.29 ± 15.13 | 44.29 ± 15.72 | 57.14 ± 21.3 | 50.00 ± 24.51 | 58.57 ± 20.7 | 61.43 ± 26.13 | 85.71 ± 11.66 | 61.43 ± 26.13(56) | |
| IFILS | 24.29 ± 15.13 | 67.14 ± 26.98 | 72.86 ± 20.7 | 77.14 ± 23.52 | 77.14 ± 20.43 | 72.86 ± 19.58 | 85.71 ± 11.66 | |||||
| CART | FILS | 11.43 ± 11.27 | 54.29 ± 24.09 | 54.29 ± 26.77 | 52.86 ± 25.24 | 54.29 ± 22.13 | 60.00 ± 24.09 | 62.86 ± 18.07 | 60.00 ± 24.09(56) | |||
| IFILS | 11.43 ± 11.27 | 44.29 ± 14.21 | 61.43 ± 19.11 | 54.29 ± 18.81 | 58.57 ± 19.58 | 54.29 ± 16.22 | 62.86 ± 18.07 | |||||
| SVM | FILS | 5.71 ± 12.05 | 45.71 ± 13.13 | 68.57 ± 19.98 | 67.14 ± 21.35 | 72.86 ± 20.70 | 82.86 ± 16.22 | 88.57 ± 13.13 | 82.86 ± 16.22(56) | |||
| IFILS | 5.71 ± 12.05 | 78.57 ± 27.15 | 85.71 ± 16.50 | 82.86 ± 14.75 | 85.71 ± 13.47 | 84.29 ± 12.51 | 88.57 ± 13.13 | |||||