| Literature DB >> 33286568 |
Ping Zhang1,2, Wanfu Gao1,2,3, Juncheng Hu1,2, Yonghao Li1,2.
Abstract
Multi-label data often involve features with high dimensionality and complicated label correlations, resulting in a great challenge for multi-label learning. Feature selection plays an important role in multi-label learning to address multi-label data. Exploring label correlations is crucial for multi-label feature selection. Previous information-theoretical-based methods employ the strategy of cumulative summation approximation to evaluate candidate features, which merely considers low-order label correlations. In fact, there exist high-order label correlations in label set, labels naturally cluster into several groups, similar labels intend to cluster into the same group, different labels belong to different groups. However, the strategy of cumulative summation approximation tends to select the features related to the groups containing more labels while ignoring the classification information of groups containing less labels. Therefore, many features related to similar labels are selected, which leads to poor classification performance. To this end, Max-Correlation term considering high-order label correlations is proposed. Additionally, we combine the Max-Correlation term with feature redundancy term to ensure that selected features are relevant to different label groups. Finally, a new method named Multi-label Feature Selection considering Max-Correlation (MCMFS) is proposed. Experimental results demonstrate the classification superiority of MCMFS in comparison to eight state-of-the-art multi-label feature selection methods.Entities:
Keywords: Max-Correlation; information theory; multi-label feature selection; multi-label learning
Year: 2020 PMID: 33286568 PMCID: PMC7517369 DOI: 10.3390/e22070797
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1The correlation between feature and the label set for the first-order and second-order label correlations.
Figure 2The correlation between feature and the label set for the high-order label correlations.
The time complexity of six methods.
| Methods | Time Complexity |
|---|---|
| MCMFS |
|
| SCLS |
|
| D2F |
|
| MDMR |
|
| PMU |
|
| LRFS |
|
Figure 3Experimental framework.
An artificial data.
| O |
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 0 |
|
| 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 |
|
| 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
|
| 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
|
| 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 |
|
| 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
|
| 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
|
| 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
Experimental results on the artificial data set.
| Methods | Feature Ranking | HL↓ | ZOL↓ | Macro-F1 | Micro-F1 | Macro-F1 | Micro-F1 |
|---|---|---|---|---|---|---|---|
| MCMFS |
|
|
|
|
|
|
|
| D2F |
| 0.3433 | 0.9583 | 0.3342 | 0.3985 | 0.2350 | 0.3148 |
| LRFS |
| 0.3625 | 0.9292 | 0.3550 | 0.4191 | 0.2883 | 0.3829 |
| MDMR |
| 0.3342 | 0.9333 | 0.3533 | 0.4261 | 0.3617 | 0.4646 |
| PMU |
| 0.3758 | 0.9875 | 0.2733 | 0.3395 | 0.1917 | 0.2823 |
| SCLS |
| 0.3408 | 0.9583 | 0.3517 | 0.4321 | 0.3375 | 0.4220 |
Description of data sets.
| No. | Data Set | #Instances | #Features | #Labels | #Training | #Test |
|---|---|---|---|---|---|---|
| 1 | medical | 978 | 1449 | 45 | 333 | 645 |
| 2 | scene | 2407 | 294 | 6 | 1211 | 1196 |
| 3 | Enron | 1702 | 1001 | 53 | 1123 | 579 |
| 4 | Arts | 5000 | 462 | 26 | 2000 | 3000 |
| 5 | Business | 5000 | 438 | 30 | 2000 | 3000 |
| 6 | Education | 5000 | 550 | 33 | 2000 | 3000 |
| 7 | Entertain | 5000 | 640 | 21 | 2000 | 3000 |
| 8 | Health | 5000 | 612 | 32 | 2000 | 3000 |
| 9 | Recreation | 5000 | 606 | 22 | 2000 | 3000 |
| 10 | Reference | 5000 | 793 | 33 | 2000 | 3000 |
| 11 | Science | 5000 | 743 | 40 | 2000 | 3000 |
| 12 | Social | 5000 | 1047 | 39 | 2000 | 3000 |
Experimental results of multi-label feature selection methods in terms of Hamming Loss (HL) (mean ± std).
| Data set | MCMFS | PPT + MI | PPT + CHI | MIFS | D2F | MDMR | PMU | SCLS | LRFS |
|---|---|---|---|---|---|---|---|---|---|
| medical |
| 0.018 ± 0.001 | 0.017 ± 0.002 | 0.017 ± 0.002 | 0.02 ± 0.001 | 0.018 ± 0.001 | 0.02 ± 0.001 | 0.023 ± 0 | 0.018 ± 0.001 |
| scene |
| 0.167 ± 0.006 | 0.167 ± 0.007 | 0.17 ± 0.01 | 0.149 ± 0.006 | 0.145 ± 0.007 | 0.147 ± 0.007 | 0.173 ± 0.003 | 0.142 ± 0.01 |
| enron |
| 0.053 ± 0.002 | 0.059 ± 0.001 | 0.057 ± 0.001 | 0.052 ± 0.001 | 0.053 ± 0.003 | 0.052 ± 0.001 | 0.053 ± 0.001 | 0.055 ± 0.003 |
| Arts |
| 0.062 ± 0.001 | 0.062 ± 0.001 | 0.061 ± 0.001 | 0.064 ± 0.001 | 0.061 ± 0.001 | 0.064 ± 0.001 | 0.063 ± 0.001 | 0.061 ± 0.001 |
| Business | 0.029 ± 0.000 | 0.029 ± 0.001 | 0.029 ± 0.000 |
| 0.029 ± 0.001 | 0.029 ± 0.001 | 0.029 ± 0.000 | 0.029 ± 0 | 0.029 ± 0.001 |
| Education |
| 0.043 ± 0.001 | 0.043 ± 0.001 | 0.044 ± 0.001 | 0.044 ± 0.001 | 0.043 ± 0.001 | 0.045 ± 0.001 | 0.044 ± 0.001 | 0.043 ± 0.001 |
| Entertain |
| 0.064 ± 0.001 | 0.065 ± 0.001 | 0.066 ± 0.001 | 0.066 ± 0.001 | 0.063 ± 0.002 | 0.067 ± 0.001 | 0.066 ± 0.001 | 0.063 ± 0.001 |
| Health |
| 0.046 ± 0.001 | 0.045 ± 0.002 | 0.05 ± 0.001 | 0.048 ± 0.001 | 0.045 ± 0.001 | 0.049 ± 0.001 | 0.049 ± 0.001 | 0.045 ± 0.001 |
| Recreation |
| 0.062 ± 0.001 | 0.062 ± 0.001 | 0.062 ± 0.001 | 0.062 ± 0.001 | 0.062 ± 0.001 | 0.065 ± 0.001 | 0.064 ± 0.001 | 0.061 ± 0.001 |
| Reference |
| 0.032 ± 0.001 | 0.032 ± 0.001 | 0.031 ± 0.001 | 0.032 ± 0.001 | 0.031 ± 0.001 | 0.034 ± 0.001 | 0.033 ± 0 | 0.031 ± 0.001 |
| Science |
| 0.036 ± 0.001 | 0.036 ± 0.000 | 0.036 ± 0.000 | 0.036 ± 0.000 | 0.035 ± 0.000 | 0.036 ± 0.000 | 0.036 ± 0.000 | 0.035 ± 0.001 |
| Social |
| 0.028 ± 0.001 | 0.03 ± 0.001 | 0.032 ± 0.001 | 0.03 ± 0.001 | 0.028 ± 0.001 | 0.031 ± 0 | 0.029 ± 0.001 | 0.027 ± 0.001 |
| Average |
| 0.053 | 0.054 | 0.055 | 0.053 | 0.051 | 0.053 | 0.055 | 0.051 |
Experimental results of multi-label feature selection methods in terms of Zero-One Loss (ZOL) (mean ± std).
| Data set | MCMFS | PPT + MI | PPT + CHI | MIFS | D2F | MDMR | PMU | SCLS | LRFS |
|---|---|---|---|---|---|---|---|---|---|
| medical |
| 0.59 ± 0.05 | 0.55 ± 0.06 | 0.55 ± 0.08 | 0.66 ± 0.04 | 0.58 ± 0.04 | 0.66 ± 0.04 | 0.83 ± 0.01 | 0.58 ± 0.04 |
| scene |
| 0.78 ± 0.08 | 0.8 ± 0.09 | 0.83 ± 0.12 | 0.61 ± 0.06 | 0.61 ± 0.07 | 0.61 ± 0.07 | 0.74 ± 0.04 | 0.6 ± 0.08 |
| enron |
| 0.9 ± 0.03 | 0.98 ± 0 | 0.98 ± 0.01 | 0.9 ± 0.02 | 0.91 ± 0.03 | 0.9 ± 0.03 | 0.94 ± 0.03 | 0.93 ± 0.04 |
| Arts |
| 0.93 ± 0.03 | 0.95 ± 0.02 | 0.92 ± 0.03 | 0.95 ± 0.01 | 0.92 ± 0.03 | 0.97 ± 0.02 | 0.95 ± 0.01 | 0.92 ± 0.03 |
| Business |
| 0.48 ± 0.01 | 0.47 ± 0.01 | 0.47 ± 0.01 | 0.48 ± 0.01 | 0.47 ± 0.01 | 0.48 ± 0.01 | 0.48 ± 0.01 | 0.47 ± 0.01 |
| Education |
| 0.91 ± 0.02 | 0.94 ± 0.02 | 0.95 ± 0.03 | 0.95 ± 0.01 | 0.9 ± 0.02 | 0.95 ± 0.01 | 0.93 ± 0.01 | 0.9 ± 0.02 |
| Entertain |
| 0.87 ± 0.04 | 0.9 ± 0.03 | 0.93 ± 0.03 | 0.91 ± 0.01 | 0.85 ± 0.03 | 0.94 ± 0.01 | 0.9 ± 0.01 | 0.86 ± 0.03 |
| Health |
| 0.73 ± 0.06 | 0.67 ± 0.01 | 0.8 ± 0.09 | 0.77 ± 0.05 | 0.71 ± 0.05 | 0.77 ± 0.05 | 0.74 ± 0.04 | 0.71 ± 0.05 |
| Recreation |
| 0.89 ± 0.02 | 0.89 ± 0.02 | 0.88 ± 0.03 | 0.92 ± 0.01 | 0.87 ± 0.02 | 0.97 ± 0.01 | 0.95 ± 0.01 | 0.88 ± 0.02 |
| Reference | 0.74 ± 0.08 | 0.74 ± 0.08 |
| 0.78 ± 0.07 | 0.8 ± 0.04 | 0.76 ± 0.06 | 0.81 ± 0.05 | 0.83 ± 0.04 | 0.76 ± 0.06 |
| Science |
| 0.94 ± 0.01 | 0.96 ± 0.01 | 0.93 ± 0.03 | 0.97 ± 0.01 | 0.94 ± 0.01 | 0.98 ± 0.01 | 0.95 ± 0.01 | 0.94 ± 0.01 |
| Social |
| 0.72 ± 0.05 | 0.76 ± 0.13 | 0.88 ± 0.09 | 0.73 ± 0.09 | 0.72 ± 0.05 | 0.78 ± 0.07 | 0.74 ± 0.04 | 0.72 ± 0.05 |
| Average |
| 0.79 | 0.80 | 0.82 | 0.80 | 0.77 | 0.82 | 0.83 | 0.77 |
Experimental results of multi-label feature selection methods in terms of Macro-F1 (mean ± std) using the Support Vector Machine (SVM) classifier.
| Data set | MCMFS | PPT + MI | PPT + CHI | MIFS | D2F | MDMR | PMU | SCLS | LRFS |
|---|---|---|---|---|---|---|---|---|---|
| medical |
| 0.25 ± 0.05 | 0.26 ± 0.04 | 0.22 ± 0.05 | 0.19 ± 0.05 | 0.32 ± 0.07 | 0.19 ± 0.06 | 0.08 ± 0.01 | 0.32 ± 0.07 |
| scene |
| 0.22 ± 0.09 | 0.21 ± 0.1 | 0.21 ± 0.15 | 0.46 ± 0.08 | 0.43 ± 0.07 | 0.47 ± 0.09 | 0.26 ± 0.05 | 0.44 ± 0.08 |
| enron | 0.12 ± 0.03 | 0.1 ± 0.03 | 0.07 ± 0.02 | 0.07 ± 0.02 |
| 0.11 ± 0.03 | 0.13 ± 0.05 | 0.12 ± 0.03 | 0.1 ± 0.03 |
| Arts |
| 0.06 ± 0.02 | 0.07 ± 0.02 | 0.07 ± 0.02 | 0.03 ± 0.00 | 0.08 ± 0.03 | 0.01 ± 0.01 | 0.03 ± 0.00 | 0.07 ± 0.02 |
| Business |
| 0.05 ± 0.00 | 0.05 ± 0.00 | 0.04 ± 0.00 | 0.05 ± 0.00 | 0.05 ± 0.00 | 0.03 ± 0.00 | 0.04 ± 0.00 | 0.05 ± 0.00 |
| Education |
| 0.06 ± 0.01 | 0.05 ± 0.01 | 0.03 ± 0.02 | 0.05 ± 0.01 | 0.06 ± 0.01 | 0.03 ± 0.01 | 0.04 ± 0.01 | 0.06 ± 0.01 |
| Entertain |
| 0.11 ± 0.03 | 0.09 ± 0.02 | 0.06 ± 0.02 | 0.08 ± 0.01 | 0.12 ± 0.02 | 0.05 ± 0.00 | 0.07 ± 0.01 | 0.12 ± 0.02 |
| Health |
| 0.13 ± 0.03 | 0.14 ± 0.03 | 0.06 ± 0.03 | 0.09 ± 0.01 | 0.14 ± 0.03 | 0.08 ± 0.01 | 0.09 ± 0.01 | 0.14 ± 0.03 |
| Recreation |
| 0.1 ± 0.02 | 0.1 ± 0.02 | 0.09 ± 0.03 | 0.08 ± 0.01 | 0.11 ± 0.02 | 0.03 ± 0.00 | 0.04 ± 0.00 | 0.11 ± 0.02 |
| Reference |
| 0.07 ± 0.01 | 0.07 ± 0.02 | 0.06 ± 0.02 | 0.04 ± 0.00 | 0.07 ± 0.01 | 0.03 ± 0.01 | 0.02 ± 0.00 | 0.07 ± 0.01 |
| Science |
| 0.05 ± 0.02 | 0.05 ± 0.01 | 0.04 ± 0.02 | 0.02 ± 0.00 | 0.05 ± 0.02 | 0.01 ± 0.01 | 0.02 ± 0.00 | 0.05 ± 0.02 |
| Social |
| 0.09 ± 0.02 | 0.09 ± 0.02 | 0.05 ± 0.03 | 0.07 ± 0.01 | 0.1 ± 0.03 | 0.05 ± 0.01 | 0.05 ± 0.01 | 0.1 ± 0.03 |
| Average |
| 0.11 | 0.10 | 0.09 | 0.11 | 0.14 | 0.09 | 0.07 | 0.14 |
Experimental results of multi-label feature selection methods in terms of Macro-F1 (mean ± std) using the 3-Nearest Neighbors (3NN) classifier.
| Data set | MCMFS | PPT + MI | PPT + CHI | MIFS | D2F | MDMR | PMU | SCLS | LRFS |
|---|---|---|---|---|---|---|---|---|---|
| medical |
| 0.16 ± 0.03 | 0.19 ± 0.02 | 0.16 ± 0.02 | 0.12 ± 0.02 | 0.19 ± 0.03 | 0.11 ± 0.02 | 0.06 ± 0.01 | 0.19 ± 0.03 |
| scene |
| 0.37 ± 0.08 | 0.36 ± 0.08 | 0.29 ± 0.14 | 0.49 ± 0.05 | 0.51 ± 0.06 | 0.49 ± 0.07 | 0.37 ± 0.03 | 0.53 ± 0.07 |
| enron |
| 0.12 ± 0.02 | 0.07 ± 0.01 | 0.09 ± 0.01 | 0.12 ± 0.01 | 0.12 ± 0.02 | 0.12 ± 0.02 | 0.11 ± 0.01 | 0.11 ± 0.02 |
| Arts |
| 0.08 ± 0.02 | 0.1 ± 0.03 | 0.08 ± 0.03 | 0.06 ± 0.01 | 0.1 ± 0.02 | 0.06 ± 0.01 | 0.07 ± 0.02 | 0.1 ± 0.02 |
| Business |
| 0.08 ± 0.01 | 0.09 ± 0.01 | 0.09 ± 0.02 | 0.07 ± 0.01 | 0.09 ± 0.01 | 0.05 ± 0.01 | 0.07 ± 0.01 | 0.08 ± 0.01 |
| Education |
| 0.08 ± 0.02 | 0.08 ± 0.02 | 0.04 ± 0.02 | 0.06 ± 0.01 | 0.07 ± 0.01 | 0.06 ± 0.01 | 0.06 ± 0.01 | 0.07 ± 0.01 |
| Entertain |
| 0.13 ± 0.02 | 0.11 ± 0.02 | 0.08 ± 0.02 | 0.11 ± 0.01 | 0.13 ± 0.02 | 0.08 ± 0.01 | 0.09 ± 0.01 | 0.14 ± 0.02 |
| Health |
| 0.11 ± 0.02 | 0.12 ± 0.03 | 0.05 ± 0.03 | 0.09 ± 0.01 | 0.12 ± 0.02 | 0.09 ± 0.01 | 0.09 ± 0.01 | 0.12 ± 0.02 |
| Recreation |
| 0.1 ± 0.01 | 0.11 ± 0.02 | 0.12 ± 0.03 | 0.08 ± 0.01 | 0.12 ± 0.02 | 0.05 ± 0.01 | 0.07 ± 0.01 | 0.12 ± 0.02 |
| Reference |
| 0.07 ± 0.01 | 0.08 ± 0.02 | 0.07 ± 0.01 | 0.04 ± 0 | 0.07 ± 0.01 | 0.03 ± 0.01 | 0.04 ± 0.01 | 0.07 ± 0.01 |
| Science |
| 0.05 ± 0.01 | 0.07 ± 0.01 | 0.06 ± 0.02 | 0.04 ± 0.01 | 0.07 ± 0.02 | 0.03 ± 0.01 | 0.03 ± 0 | 0.06 ± 0.01 |
| Social |
| 0.08 ± 0.01 | 0.1 ± 0.01 | 0.07 ± 0.03 | 0.06 ± 0.01 | 0.09 ± 0.01 | 0.05 ± 0.01 | 0.05 ± 0 | 0.09 ± 0.01 |
| Average |
| 0.12 | 0.12 | 0.10 | 0.11 | 0.14 | 0.10 | 0.09 | 0.14 |
Experimental results of multi-label feature selection methods in terms of Micro-F1 (mean ± std) using the SVM classifier.
| Data set | MCMFS | PPT + MI | PPT + CHI | MIFS | D2F | MDMR | PMU | SCLS | LRFS |
|---|---|---|---|---|---|---|---|---|---|
| medical |
| 0.73 ± 0.05 | 0.74 ± 0.07 | 0.71 ± 0.11 | 0.63 ± 0.07 | 0.76 ± 0.05 | 0.63 ± 0.08 | 0.37 ± 0.01 | 0.76 ± 0.05 |
| scene |
| 0.25 ± 0.1 | 0.24 ± 0.11 | 0.24 ± 0.16 | 0.48 ± 0.07 | 0.46 ± 0.07 | 0.49 ± 0.08 | 0.3 ± 0.05 | 0.47 ± 0.07 |
| enron |
| 0.47 ± 0.04 | 0.35 ± 0.02 | 0.37 ± 0.03 | 0.51 ± 0.03 | 0.47 ± 0.05 | 0.5 ± 0.04 | 0.49 ± 0.03 | 0.45 ± 0.06 |
| Arts |
| 0.14 ± 0.05 | 0.12 ± 0.04 | 0.17 ± 0.05 | 0.08 ± 0.01 | 0.17 ± 0.05 | 0.03 ± 0.02 | 0.07 ± 0.02 | 0.16 ± 0.05 |
| Business |
| 0.68 ± 0.00 | 0.68 ± 0.00 | 0.67 ± 0.00 | 0.67 ± 0.00 | 0.68 ± 0.00 | 0.67 ± 0.00 | 0.67 ± 0 | 0.68 ± 0.00 |
| Education |
| 0.2 ± 0.04 | 0.13 ± 0.04 | 0.12 ± 0.06 | 0.12 ± 0.02 | 0.21 ± 0.05 | 0.08 ± 0.01 | 0.14 ± 0.02 | 0.21 ± 0.04 |
| Entertain |
| 0.23 ± 0.06 | 0.17 ± 0.05 | 0.11 ± 0.05 | 0.16 ± 0.01 | 0.26 ± 0.06 | 0.1 ± 0.01 | 0.15 ± 0.02 | 0.25 ± 0.06 |
| Health |
| 0.45 ± 0.07 | 0.47 ± 0.03 | 0.39 ± 0.05 | 0.42 ± 0.01 | 0.47 ± 0.04 | 0.39 ± 0.03 | 0.41 ± 0 | 0.48 ± 0.03 |
| Recreation |
| 0.19 ± 0.03 | 0.17 ± 0.04 | 0.18 ± 0.05 | 0.14 ± 0.02 | 0.2 ± 0.04 | 0.04 ± 0 | 0.07 ± 0.01 | 0.2 ± 0.04 |
| Reference | 0.32 ± 0.04 | 0.35 ± 0.07 |
| 0.33 ± 0.1 | 0.31 ± 0.04 | 0.34 ± 0.06 | 0.27 ± 0.05 | 0.26 ± 0.04 | 0.34 ± 0.06 |
| Science |
| 0.12 ± 0.03 | 0.09 ± 0.03 | 0.11 ± 0.05 | 0.05 ± 0.01 | 0.13 ± 0.03 | 0.02 ± 0.02 | 0.06 ± 0.01 | 0.13 ± 0.03 |
| Social |
| 0.42 ± 0.07 | 0.38 ± 0.14 | 0.2 ± 0.12 | 0.4 ± 0.07 | 0.43 ± 0.07 | 0.31 ± 0.07 | 0.38 ± 0.05 | 0.43 ± 0.07 |
| Average |
| 0.35 | 0.33 | 0.30 | 0.33 | 0.38 | 0.29 | 0.28 | 0.38 |
Experimental results of multi-label feature selection methods in terms of Micro-F1 (mean ± std) using the 3NN classifier.
| Data set | MCMFS | PPT + MI | PPT + CHI | MIFS | D2F | MDMR | PMU | SCLS | LRFS |
|---|---|---|---|---|---|---|---|---|---|
| medical |
| 0.62 ± 0.04 | 0.64 ± 0.06 | 0.61 ± 0.1 | 0.53 ± 0.04 | 0.64 ± 0.03 | 0.52 ± 0.04 | 0.35 ± 0.01 | 0.64 ± 0.03 |
| scene |
| 0.39 ± 0.06 | 0.38 ± 0.06 | 0.34 ± 0.11 | 0.49 ± 0.04 | 0.52 ± 0.05 | 0.5 ± 0.05 | 0.38 ± 0.02 | 0.54 ± 0.05 |
| enron |
| 0.45 ± 0.01 | 0.34 ± 0.03 | 0.41 ± 0.02 | 0.47 ± 0.03 | 0.44 ± 0.04 | 0.47 ± 0.02 | 0.44 ± 0.03 | 0.42 ± 0.05 |
| Arts |
| 0.17 ± 0.05 | 0.24 ± 0.04 | 0.18 ± 0.05 | 0.15 ± 0.03 | 0.25 ± 0.04 | 0.14 ± 0.03 | 0.17 ± 0.03 | 0.25 ± 0.04 |
| Business |
| 0.67 ± 0.00 | 0.66 ± 0.01 | 0.65 ± 0.08 | 0.66 ± 0.00 | 0.67 ± 0.01 | 0.65 ± 0.04 | 0.60 ± 0.12 | 0.67 ± 0.01 |
| Education |
| 0.24 ± 0.04 | 0.28 ± 0.04 | 0.16 ± 0.06 | 0.19 ± 0.03 | 0.23 ± 0.03 | 0.18 ± 0.04 | 0.19 ± 0.03 | 0.23 ± 0.03 |
| Entertain | 0.27 ± 0.05 |
| 0.21 ± 0.05 | 0.22 ± 0.08 | 0.24 ± 0.03 | 0.26 ± 0.04 | 0.22 ± 0.05 | 0.22 ± 0.03 | 0.27 ± 0.03 |
| Health | 0.37 ± 0.09 |
| 0.37 ± 0.14 | 0.2 ± 0.07 | 0.37 ± 0.05 | 0.38 ± 0.06 | 0.36 ± 0.04 | 0.37 ± 0.06 | 0.38 ± 0.05 |
| Recreation |
| 0.19 ± 0.02 | 0.21 ± 0.04 | 0.23 ± 0.05 | 0.16 ± 0.02 | 0.23 ± 0.04 | 0.09 ± 0.02 | 0.12 ± 0.02 | 0.23 ± 0.03 |
| Reference |
| 0.41 ± 0.05 | 0.39 ± 0.13 | 0.35 ± 0.09 | 0.36 ± 0.05 | 0.43 ± 0.05 | 0.35 ± 0.04 | 0.29 ± 0.05 | 0.43 ± 0.04 |
| Science |
| 0.17 ± 0.03 | 0.12 ± 0.03 | 0.17 ± 0.02 | 0.12 ± 0.02 | 0.16 ± 0.03 | 0.1 ± 0.02 | 0.15 ± 0.03 | 0.16 ± 0.03 |
| Social |
| 0.4 ± 0.06 | 0.44 ± 0.1 | 0.39 ± 0.05 | 0.39 ± 0.05 | 0.42 ± 0.06 | 0.36 ± 0.05 | 0.37 ± 0.04 | 0.41 ± 0.05 |
| Average |
| 0.36 | 0.36 | 0.33 | 0.34 | 0.38 | 0.33 | 0.30 | 0.38 |
Figure 4Classification performance on Arts data set: (a) Hamming Loss, (b) Zero-One Loss, (c) Macro-F1 on SVM classifier, (d) Macro-F1 on 3NN classifier, (e) Micro-F1 on SVM classifier, (f) Micro-F1 on 3NN classifier.
Figure 5Classification performance on medical data set: (a) Hamming Loss, (b) Zero-One Loss, (c) Macro-F1 on SVM classifier, (d) Macro-F1 on 3NN classifier, (e) Micro-F1 on SVM classifier, (f) Micro-F1 on 3NN classifier.
Figure 6Classification performance on scene data set: (a) Hamming Loss, (b) Zero-One Loss, (c) Macro-F1 on SVM classifier, (d) Macro-F1 on 3NN classifier, (e) Micro-F1 on SVM classifier, (f) Micro-F1 on 3NN classifier.
Running time (seconds).
| Data Set | MCMFS | D2F | MDMR | PMU | SCLS | LRFS | PPT + MI | PPT + CHI | MIFS |
|---|---|---|---|---|---|---|---|---|---|
| medical | 142.2 | 10,698.5 | 9910.0 | 11521.6 | 38.2 | 6961.9 | 1.2 | 8.6 | 30.3 |
| scene | 23.5 | 244.1 | 246.1 | 257.0 | 5.5 | 147.0 | 0.4 | 1.8 | 47.4 |
| enron | 267.3 | 22,164.0 | 20,583.0 | 25749.6 | 93.6 | 16,631.9 | 0.8 | 6.4 | 41.2 |
| Arts | 111.9 | 4355.8 | 4185.4 | 5002.2 | 39.6 | 3421.2 | 1.0 | 4.1 | 35.5 |
| Business | 107.4 | 4536.1 | 4361.8 | 5354.0 | 41.5 | 4079.8 | 1.0 | 3.1 | 41.9 |
| Education | 161.2 | 8397.3 | 7700.0 | 9037.2 | 59.1 | 6323.1 | 1.2 | 4.3 | 99.3 |
| Entertain | 194.4 | 6809.9 | 6539.2 | 7386.6 | 54.3 | 4519.1 | 1.4 | 5.0 | 98.7 |
| Health | 195.1 | 9376.0 | 9086.8 | 10,797.1 | 68.4 | 7523.4 | 1.3 | 4.8 | 17.2 |
| Recreation | 183.4 | 6394.2 | 6127.6 | 7016.8 | 51.7 | 4369.0 | 1.3 | 4.6 | 39.5 |
| Reference | 313.5 | 16,269.8 | 15,637.3 | 18,483.6 | 96.2 | 11,685.1 | 1.8 | 5.7 | 57.5 |
| Science | 291.4 | 17,419.9 | 16,665.2 | 20,137.1 | 102.7 | 14,344.9 | 1.6 | 6.6 | 60.0 |
| Social | 553.8 | 34,108.8 | 33,195.6 | 37,246.0 | 156.1 | 23,754.5 | 2.8 | 9.0 | 43.4 |
Figure 7The running time of nine methods: (a) all methods, (b) Multi-label Feature Selection considering the Max-Correlation (MCMFS), Pruned Problem Transformation (PPT) + mutual information (MI), PPT + CHI, Multi-label Informed Feature Selection (MIFS) and Scalable Criterion for a Large Label Set (SCLS).