| Literature DB >> 33267251 |
Zhi-Yi Duan1, Li-Min Wang1, Musa Mammadov2, Hua Lou3, Ming-Hui Sun4.
Abstract
Machine learning techniques have shown superior predictive power, among which Bayesian network classifiers (BNCs) have remained of great interest due to its capacity to demonstrate complex dependence relationships. Most traditional BNCs tend to build only one model to fit training instances by analyzing independence between attributes using conditional mutual information. However, for different class labels, the conditional dependence relationships may be different rather than invariant when attributes take different values, which may result in classification bias. To address this issue, we propose a novel framework, called discriminatory target learning, which can be regarded as a tradeoff between probabilistic model learned from unlabeled instance at the uncertain end and that learned from labeled training data at the certain end. The final model can discriminately represent the dependence relationships hidden in unlabeled instance with respect to different possible class labels. Taking k-dependence Bayesian classifier as an example, experimental comparison on 42 publicly available datasets indicated that the final model achieved competitive classification performance compared to state-of-the-art learners such as Random forest and averaged one-dependence estimators.Entities:
Keywords: Bayesian network; discriminatory target learning; unlabeled instance
Year: 2019 PMID: 33267251 PMCID: PMC7515026 DOI: 10.3390/e21050537
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1The distributions of on Waveform dataset, where . The x-axis represents the index of each instance, the y-axis represents the value of .
Figure 2Example of (a) unrestricted BNC, and (b) restricted BNC.
Figure 3The framework of discriminatory target learning.
Figure 4Example of (a) kdb1, and (b) kdb2.
Datasets. Imbalanced datasets are annotated with the symbol “*”.
| Index | Dataset | Instance | Attribute | Class | Index | Dataset | Instance | Attribute | Class |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Contact-lenses | 24 | 4 | 3 | 22 | Kr-vs-kp | 3196 | 36 | 2 |
| 2 | Labor | 57 | 16 | 2 | 23 | Dis * | 3772 | 29 | 2 |
| 3 | Echocardiogram | 131 | 6 | 2 | 24 | Hypo | 3772 | 29 | 4 |
| 4 | Lymphography | 148 | 18 | 4 | 25 | Sick * | 3772 | 29 | 2 |
| 5 | Sonar | 208 | 60 | 2 | 26 | Abalone * | 4177 | 8 | 3 |
| 6 | Glass-id | 214 | 9 | 3 | 27 | Waveform-5000 | 5000 | 40 | 3 |
| 7 | New-thyroid * | 215 | 5 | 3 | 28 | Phoneme | 5438 | 7 | 50 |
| 8 | Heart-disease-c | 303 | 13 | 2 | 29 | Wall-following | 5456 | 24 | 4 |
| 9 | Soybean-large | 307 | 35 | 19 | 30 | Page-blocks | 5473 | 10 | 5 |
| 10 | Ionosphere * | 351 | 34 | 2 | 31 | Satellite * | 6435 | 36 | 6 |
| 11 | Dermatology | 366 | 34 | 6 | 32 | Thyroid | 9169 | 29 | 20 |
| 12 | House-votes-84 * | 435 | 16 | 2 | 33 | Pendigits | 10,992 | 16 | 10 |
| 13 | Chess * | 551 | 39 | 2 | 34 | Sign | 12,546 | 8 | 3 |
| 14 | Soybean * | 683 | 35 | 19 | 35 | Nursery | 12,960 | 8 | 5 |
| 15 | Breast-cancer-w | 699 | 9 | 2 | 36 | Magic | 19,020 | 10 | 2 |
| 16 | Tic-tac-toe | 958 | 9 | 2 | 37 | Letter-recog | 20,000 | 16 | 26 |
| 17 | Vowel | 990 | 13 | 11 | 38 | Adult * | 48,842 | 14 | 2 |
| 18 | Car * | 1728 | 6 | 4 | 39 | Shuttle * | 58,000 | 9 | 7 |
| 19 | Mfeat-mor | 2000 | 6 | 10 | 40 | Connect-4 | 67,557 | 42 | 3 |
| 20 | Segment | 2310 | 19 | 7 | 41 | Waveform * | 100,000 | 21 | 3 |
| 21 | Hypothyroid * | 3163 | 25 | 2 | 42 | Localization | 164,860 | 5 | 11 |
Experimental results of average zero-one loss.
| Dataset | NB | TAN | KDB | AODE | RF | kdb | KDB |
|---|---|---|---|---|---|---|---|
| Contact-lenses | 0.3750 | 0.3750 |
| 0.3750 | 0.3438 | 0.3750 | 0.2917 |
| Labor |
| 0.0526 |
| 0.0526 | 0.0939 |
|
|
| Echocardiogram | 0.3359 | 0.3282 | 0.3435 | 0.3206 | 0.3489 |
|
|
| Lymphography |
| 0.1757 | 0.2365 | 0.1689 | 0.2132 | 0.1757 | 0.2095 |
| Sonar | 0.2308 | 0.2212 | 0.2452 | 0.2260 |
| 0.2452 | 0.2308 |
| Glass-id | 0.2617 | 0.2196 | 0.2196 | 0.2523 |
| 0.2243 | 0.2150 |
| New-thyroid | 0.0512 | 0.0651 | 0.0698 |
| 0.0816 | 0.0605 | 0.0605 |
| Heart-disease-c |
| 0.2079 | 0.2244 | 0.2013 | 0.2212 | 0.1947 | 0.2079 |
| Soybean-large | 0.1238 | 0.1107 | 0.0879 |
| 0.1107 | 0.1270 | 0.0814 |
| Ionosphere | 0.1054 | 0.0684 | 0.0741 | 0.0741 | 0.0766 | 0.0912 |
|
| Dermatology | 0.0191 | 0.0328 | 0.0656 |
| 0.0367 | 0.0546 | 0.0519 |
| House-votes-84 | 0.0943 | 0.0552 | 0.0506 | 0.0529 |
| 0.0575 | 0.0437 |
| Chess | 0.1125 |
| 0.0998 | 0.0998 | 0.1074 |
|
|
| Soybean | 0.0893 |
| 0.0556 |
| 0.0703 | 0.0542 | 0.0527 |
| Breast-cancer-w |
| 0.0415 | 0.0744 | 0.0358 | 0.0386 | 0.0401 | 0.0629 |
| Tic-tac-toe | 0.3069 | 0.2286 | 0.2035 | 0.2651 | 0.2115 |
| 0.2004 |
| Vowel | 0.4242 |
| 0.1818 | 0.1495 | 0.1674 | 0.1788 | 0.1626 |
| Car | 0.1400 | 0.0567 |
| 0.0816 | 0.0772 | 0.0596 | 0.0411 |
| Mfeat-mor | 0.3140 |
| 0.3060 | 0.3145 | 0.3000 | 0.3015 | 0.3035 |
| Segment | 0.0788 | 0.0390 | 0.0472 |
| 0.0413 | 0.0355 | 0.0433 |
| Hypothyroid | 0.0149 | 0.0104 | 0.0107 | 0.0136 | 0.0122 |
| 0.0095 |
| Kr-vs-kp | 0.1214 | 0.0776 | 0.0416 | 0.0842 |
| 0.0460 | 0.0382 |
| Dis | 0.0159 | 0.0159 | 0.0138 | 0.0130 | 0.0133 | 0.0127 |
|
| Hypo | 0.0138 | 0.0141 | 0.0114 |
| 0.0122 | 0.0098 | 0.0098 |
| Sick | 0.0308 | 0.0257 |
| 0.0273 | 0.0263 | 0.0270 | 0.0233 |
| Abalone | 0.4762 | 0.4587 | 0.4563 |
| 0.4823 | 0.4534 | 0.4484 |
| Waveform-5000 | 0.2006 | 0.1844 | 0.2000 |
| 0.1558 | 0.1782 | 0.1756 |
| Phoneme | 0.2615 | 0.2733 | 0.1984 | 0.2392 |
| 0.3139 | 0.1931 |
| Wall-following | 0.1054 | 0.0554 | 0.0401 | 0.0370 |
| 0.0398 | 0.0387 |
| Page-blocks | 0.0619 | 0.0415 | 0.0391 | 0.0338 |
| 0.0323 | 0.0322 |
| Satellite | 0.1806 | 0.1214 | 0.1080 | 0.1148 | 0.1085 | 0.1265 |
|
| Thyroid | 0.1111 | 0.0720 | 0.0706 | 0.0701 | 0.0750 |
| 0.0642 |
| Pendigits | 0.1181 | 0.0321 | 0.0294 |
| 0.0339 | 0.0202 | 0.0248 |
| Sign | 0.3586 | 0.2755 | 0.2539 | 0.2821 |
| 0.2685 | 0.2419 |
| Nursery | 0.0973 | 0.0654 | 0.0289 | 0.0730 |
| 0.0509 | 0.0356 |
| Magic | 0.2239 | 0.1675 | 0.1637 | 0.1752 | 0.1674 | 0.1716 |
|
| Letter-recog | 0.2525 | 0.1300 | 0.0986 | 0.0883 | 0.0902 |
| 0.0861 |
| Adult | 0.1592 | 0.1380 | 0.1383 | 0.1493 |
| 0.1315 | 0.1316 |
| Shuttle | 0.0039 | 0.0015 | 0.0009 | 0.0008 |
| 0.0006 | 0.0007 |
| Connect-4 | 0.2783 | 0.2354 | 0.2283 | 0.2420 |
| 0.2337 | 0.2268 |
| Waveform | 0.0220 | 0.0202 | 0.0256 |
| 0.1558 | 0.0194 | 0.0193 |
| Localization | 0.4955 | 0.3575 | 0.2964 | 0.3596 | 0.2976 |
| 0.2743 |
Experimental results of average RMSE.
| Dataset | NB | TAN | KDB | AODE | RF | kdb | KDB |
|---|---|---|---|---|---|---|---|
| Contact-lenses | 0.3778 | 0.4496 |
| 0.4066 | 0.4098 | 0.4086 | 0.3825 |
| Labor |
| 0.2185 | 0.1685 | 0.1900 | 0.2824 | 0.3647 | 0.2271 |
| Echocardiogram | 0.4896 | 0.4886 | 0.4889 | 0.4903 |
| 0.4782 | 0.4813 |
| Lymphography |
| 0.2684 | 0.3031 | 0.2478 | 0.2701 | 0.2729 | 0.2680 |
| Sonar | 0.4421 | 0.4131 | 0.4084 | 0.4285 |
| 0.4071 | 0.3959 |
| Glass-id | 0.3540 | 0.3332 | 0.3395 | 0.3439 |
| 0.3311 | 0.3275 |
| New-thyroid |
| 0.1731 | 0.1797 | 0.1614 | 0.1560 | 0.1689 | 0.1714 |
| Heart-disease-c | 0.3743 | 0.3775 | 0.3963 | 0.3659 | 0.3696 |
| 0.3690 |
| Soybean-large | 0.1032 | 0.0963 | 0.0858 | 0.0858 | 0.1143 | 0.1051 |
|
| Ionosphere | 0.3157 | 0.2615 | 0.2714 | 0.2506 |
| 0.2822 | 0.2523 |
| Dermatology |
| 0.0851 | 0.1206 | 0.0692 | 0.1303 | 0.1857 | 0.1313 |
| House-votes-84 | 0.2997 | 0.2181 | 0.1969 | 0.1994 |
| 0.1962 | 0.1847 |
| Chess | 0.2944 |
| 0.2615 | 0.2725 | 0.2771 | 0.2937 | 0.2642 |
| Soybean | 0.0933 |
| 0.0654 | 0.0656 | 0.0922 | 0.0754 | 0.0643 |
| Breast-cancer-w |
| 0.1928 | 0.2497 | 0.1848 | 0.1796 | 0.2194 | 0.2137 |
| Tic-tac-toe | 0.4309 | 0.4023 | 0.3772 | 0.3995 |
| 0.3830 | 0.3693 |
| Vowel | 0.2270 |
| 0.1582 | 0.1425 | 0.1581 | 0.1685 | 0.1516 |
| Car | 0.2252 | 0.1617 |
| 0.2005 | 0.1782 | 0.1749 | 0.1505 |
| Mfeat-mor | 0.2086 |
| 0.1974 | 0.1985 | 0.2074 | 0.1948 | 0.1954 |
| Segment | 0.1398 | 0.0967 | 0.1034 |
| 0.1061 | 0.0957 | 0.0919 |
| Hypothyroid | 0.1138 | 0.0955 | 0.0937 | 0.1036 |
| 0.0979 | 0.0913 |
| Kr-vs-kp | 0.3022 | 0.2358 | 0.1869 | 0.2638 |
| 0.2626 | 0.2091 |
| Dis | 0.1177 | 0.1103 | 0.1024 | 0.1080 |
| 0.1074 | 0.1021 |
| Hypo | 0.0766 | 0.0738 | 0.0671 | 0.0650 | 0.0715 | 0.0719 |
|
| Sick | 0.1700 | 0.1434 |
| 0.1572 | 0.1487 | 0.1489 | 0.1394 |
| Abalone | 0.4630 | 0.4250 | 0.4277 |
| 0.4539 | 0.4220 | 0.4220 |
| Waveform-5000 | 0.3348 | 0.2947 | 0.3149 |
| 0.3036 | 0.2950 | 0.2869 |
| Phoneme | 0.0880 | 0.0902 | 0.0784 | 0.0885 |
| 0.0952 | 0.0783 |
| Wall-following | 0.2177 | 0.1586 | 0.1363 | 0.1292 |
| 0.1315 | 0.1210 |
| Page-blocks | 0.1450 | 0.1187 | 0.1128 | 0.1021 | 0.0974 |
| 0.0991 |
| Satellite | 0.2400 | 0.1851 | 0.1777 | 0.1800 | 0.1682 | 0.1865 |
|
| Thyroid | 0.0967 | 0.0746 | 0.0744 | 0.0745 | 0.0770 |
| 0.0679 |
| Pendigits | 0.1427 | 0.0725 | 0.0687 |
| 0.0979 | 0.0793 | 0.0646 |
| Sign | 0.3984 | 0.3505 | 0.3334 | 0.3524 |
| 0.3468 | 0.3300 |
| Nursery | 0.1766 | 0.1385 | 0.1121 | 0.1571 |
| 0.1372 | 0.1217 |
| Magic | 0.3974 | 0.3461 | 0.3470 | 0.3541 | 0.3571 | 0.3514 |
|
| Letter-recog | 0.1184 | 0.0860 | 0.0768 | 0.0707 | 0.0896 | 0.0756 |
|
| Adult | 0.3409 | 0.3076 | 0.3089 | 0.3245 | 0.3274 | 0.3021 |
|
| Shuttle | 0.0298 | 0.0182 | 0.0146 | 0.0126 | 0.0142 |
| 0.0125 |
| Connect-4 | 0.3587 | 0.3315 | 0.3247 | 0.3370 |
| 0.3409 | 0.3279 |
| Waveform | 0.1176 | 0.0951 | 0.1145 | 0.0860 |
| 0.0999 | 0.0901 |
| Localization | 0.2390 | 0.2095 | 0.1960 | 0.2095 | 0.1939 |
| 0.1846 |
W/D/L comparison results of zero-one loss on all datasets.
| NB | TAN | KDB | AODE | kdb | |
|---|---|---|---|---|---|
| TAN | 29/7/6 | - | - | - | - |
| KDB | 30/5/7 | 20/9/13 | - | - | - |
| AODE | 33/5/4 | 16/14/12 | 20/6/16 | - | - |
| kdb | 30/5/7 | 17/18/7 | 20/11/11 | 13/15/14 | - |
| KDB | 34/3/5 | 23/13/6 | 26/13/3 | 22/10/10 | 14/20/8 |
W/D/L comparison results of RMSE on all datasets.
| NB | TAN | KDB | AODE | kdb | |
|---|---|---|---|---|---|
| TAN | 32/4/6 | - | - | - | - |
| KDB | 32/4/6 | 16/19/7 | - | - | - |
| AODE | 29/9/4 | 16/19/7 | 15/15/12 | - | - |
| kdb | 30/5/7 | 9/21/12 | 11/17/14 | 7/19/16 | - |
| KDB | 34/3/5 | 20/19/3 | 17/20/5 | 17/17/8 | 21/21/0 |
Figure 5Scatter plot of zero-one loss and RMSE comparisons for KDB, KDB and AODE.
Experimental results of average bias.
| Dataset | NB | TAN | KDB | AODE | RF | kdb | KDB |
|---|---|---|---|---|---|---|---|
| Contact-lenses | 0.2163 | 0.1825 | 0.3175 | 0.2850 |
| 0.1863 | 0.2850 |
| Labor | 0.0289 | 0.0211 | 0.0279 | 0.0347 | 0.0409 |
| 0.0279 |
| Echocardiogram | 0.2844 | 0.2642 | 0.3065 | 0.2751 |
| 0.2602 | 0.2686 |
| Lymphography |
| 0.1027 | 0.1041 | 0.0933 | 0.1288 | 0.0951 | 0.0996 |
| Sonar | 0.1672 | 0.1646 | 0.1686 | 0.1696 |
| 0.1829 | 0.1762 |
| Glass-id | 0.2901 | 0.2756 | 0.2713 | 0.2785 |
| 0.2730 | 0.2732 |
| New-thyroid | 0.0290 |
| 0.0348 |
| 0.0285 | 0.0279 | 0.0396 |
| Heart-disease-c | 0.1297 | 0.1263 | 0.1299 | 0.1138 | 0.1304 |
| 0.1274 |
| Soybean-large | 0.1070 | 0.1422 | 0.1086 |
| 0.1213 | 0.1717 | 0.1112 |
| Ionosphere | 0.1220 | 0.0804 | 0.0855 | 0.0744 |
| 0.0912 | 0.0862 |
| Dermatology | 0.0079 | 0.0274 | 0.0489 |
| 0.0190 | 0.0541 | 0.0451 |
| House-votes-84 | 0.0899 | 0.0410 |
| 0.0430 | 0.0327 | 0.0457 | 0.0301 |
| Chess | 0.1413 | 0.1437 | 0.1119 | 0.1290 |
| 0.1265 | 0.1192 |
| Soybean | 0.1015 | 0.0522 |
| 0.0524 | 0.0586 | 0.0971 | 0.0502 |
| Breast-cancer-w |
| 0.0384 | 0.0449 | 0.0338 | 0.0301 | 0.0221 | 0.0348 |
| Tic-tac-toe | 0.2614 | 0.1746 |
| 0.2005 | 0.0270 | 0.1434 | 0.1390 |
| Vowel | 0.3301 | 0.1942 | 0.1745 | 0.1895 |
| 0.1845 | 0.1736 |
| Car | 0.0937 | 0.0478 | 0.0387 | 0.0556 | 0.0389 | 0.0389 |
|
| Mfeat-mor | 0.2624 |
| 0.2142 | 0.2477 | 0.2311 | 0.2223 | 0.2166 |
| Segment | 0.0785 | 0.0491 | 0.0453 | 0.0367 |
| 0.0387 | 0.0419 |
| Hypothyroid | 0.0116 | 0.0104 | 0.0096 | 0.0094 | 0.0516 |
| 0.0094 |
| Kr-vs-kp | 0.1107 | 0.0702 | 0.0417 | 0.0747 |
| 0.0434 | 0.0407 |
| Dis |
| 0.0193 | 0.0191 | 0.0170 | 0.0203 | 0.0192 | 0.0191 |
| Hypo | 0.0092 | 0.0124 | 0.0077 |
| 0.0083 | 0.0098 | 0.0073 |
| Sick | 0.0246 | 0.0207 | 0.0198 | 0.0224 |
| 0.0254 | 0.0196 |
| Abalone | 0.4180 | 0.3126 |
| 0.3201 | 0.3257 | 0.3195 | 0.3132 |
| Waveform-5000 | 0.1762 | 0.1232 | 0.1157 | 0.1235 |
| 0.1219 | 0.1147 |
| Phoneme | 0.2216 | 0.2394 | 0.1572 | 0.2207 |
| 0.2927 | 0.1551 |
| Wall-following | 0.0951 | 0.0491 | 0.0257 | 0.0251 |
| 0.0296 | 0.0245 |
| Page-blocks | 0.0451 | 0.0308 | 0.0280 | 0.0251 |
| 0.0277 | 0.0264 |
| Satellite | 0.1746 | 0.0950 | 0.0808 | 0.0902 | 0.0874 | 0.1011 |
|
| Thyroid | 0.0994 | 0.0587 | 0.0553 | 0.0611 | 0.0516 |
| 0.0531 |
| Pendigits | 0.1095 | 0.0314 | 0.0207 | 0.0228 | 0.0216 | 0.0196 |
|
| Sign | 0.3257 | 0.2420 | 0.2161 | 0.2531 |
| 0.2322 | 0.2132 |
| Nursery | 0.0928 | 0.0521 | 0.0281 | 0.0651 |
| 0.0400 | 0.0322 |
| Magic | 0.2111 | 0.1252 |
| 0.1600 | 0.1244 | 0.1323 | 0.1265 |
| Letter-recog | 0.2207 | 0.1032 | 0.0806 | 0.0876 |
| 0.0700 | 0.0732 |
| Adult | 0.1649 | 0.1312 | 0.1220 | 0.1437 |
| 0.1240 | 0.1226 |
| Shuttle | 0.0040 | 0.0008 | 0.0007 |
|
|
|
|
| Connect-4 | 0.2660 | 0.2253 | 0.2022 | 0.2264 |
| 0.2169 | 0.2075 |
| Waveform | 0.0219 |
| 0.0210 | 0.0156 | 0.0158 | 0.0172 | 0.0161 |
| Localization | 0.4523 | 0.3106 | 0.2134 | 0.3129 | 0.2047 |
| 0.2038 |
Experimental results of average variance.
| Dataset | NB | TAN | KDB | AODE | RF | kdb | KDB |
|---|---|---|---|---|---|---|---|
| Contact-lenses | 0.1713 | 0.1925 | 0.1700 |
| 0.2013 | 0.2138 | 0.1775 |
| Labor | 0.0395 | 0.0632 | 0.0721 |
| 0.0758 | 0.0605 | 0.0721 |
| Echocardiogram | 0.1272 |
| 0.1400 | 0.1319 | 0.1469 | 0.1374 | 0.1337 |
| Lymphography |
| 0.1116 | 0.1408 | 0.0476 | 0.1352 | 0.0927 | 0.1249 |
| Sonar |
| 0.1165 | 0.1199 | 0.0942 | 0.1189 | 0.0983 | 0.1107 |
| Glass-id |
| 0.1075 | 0.1189 | 0.1004 | 0.1089 | 0.1101 | 0.1099 |
| New-thyroid |
| 0.0272 | 0.0385 | 0.0230 | 0.0365 | 0.0285 | 0.0351 |
| Heart-disease-c |
| 0.0479 | 0.0582 | 0.0357 | 0.0718 | 0.0466 | 0.0498 |
| Soybean-large |
| 0.1176 | 0.0982 | 0.0842 | 0.1373 | 0.0921 | 0.0947 |
| Ionosphere |
| 0.0401 | 0.0581 | 0.0385 | 0.0582 | 0.0344 | 0.0497 |
| Dermatology | 0.0216 | 0.0513 | 0.0684 |
| 0.0685 | 0.0746 | 0.0648 |
| House-votes-84 |
| 0.0170 | 0.0197 | 0.0094 | 0.0179 | 0.0164 | 0.0168 |
| Chess |
| 0.0486 | 0.0531 | 0.0415 | 0.0626 | 0.0423 | 0.0447 |
| Soybean |
| 0.0654 | 0.0439 | 0.0326 | 0.0606 | 0.0509 | 0.0406 |
| Breast-cancer-w |
| 0.0337 | 0.0504 | 0.0134 | 0.0101 | 0.0199 | 0.0425 |
| Tic-tac-toe |
| 0.0824 | 0.1125 | 0.0513 | 0.0590 | 0.0813 | 0.0951 |
| Vowel | 0.2542 | 0.2445 | 0.2325 | 0.2344 |
| 0.2337 | 0.2255 |
| Car | 0.0520 |
| 0.0434 | 0.0438 | 0.0456 | 0.0447 | 0.0379 |
| Mfeat-mor |
| 0.1020 | 0.1031 | 0.0677 | 0.1351 | 0.0882 | 0.0960 |
| Segment | 0.0259 | 0.0294 | 0.0381 | 0.0255 |
| 0.0291 | 0.0344 |
| Hypothyroid | 0.0031 | 0.0034 |
| 0.0034 | 0.0279 | 0.0034 |
|
| Kr-vs-kp | 0.0186 | 0.0152 | 0.0111 | 0.0186 | 0.0077 |
| 0.0077 |
| Dis | 0.0069 | 0.0005 | 0.0011 | 0.0071 | 0.0021 | 0.0005 |
|
| Hypo | 0.0051 | 0.0071 | 0.0069 | 0.0049 |
| 0.0078 | 0.0060 |
| Sick | 0.0047 | 0.0051 | 0.0043 | 0.0042 | 0.0082 | 0.0052 |
|
| Abalone |
| 0.1693 | 0.1769 | 0.1544 | 0.1865 | 0.1511 | 0.1633 |
| Waveform-5000 |
| 0.0690 | 0.0843 | 0.0410 | 0.0528 | 0.0625 | 0.0666 |
| Phoneme | 0.1215 | 0.1828 | 0.1064 | 0.1343 |
| 0.1850 | 0.1052 |
| Wall-following | 0.0211 | 0.0288 | 0.0294 | 0.0242 |
| 0.0266 | 0.0278 |
| Page-blocks | 0.0135 | 0.0143 | 0.0177 | 0.0124 |
| 0.0115 | 0.0146 |
| Satellite |
| 0.0367 | 0.0455 | 0.0363 | 0.0251 | 0.0388 | 0.0406 |
| Thyroid |
| 0.0257 | 0.0272 | 0.0235 | 0.0279 | 0.0220 | 0.0235 |
| Pendigits | 0.0157 | 0.0200 | 0.0236 |
| 0.0148 | 0.0157 | 0.0198 |
| Sign |
| 0.0386 | 0.0596 | 0.0378 | 0.0593 | 0.0572 | 0.0488 |
| Nursery |
| 0.0168 | 0.0195 | 0.0105 | 0.0193 | 0.0179 | 0.0168 |
| Magic |
| 0.0490 | 0.0491 | 0.0297 | 0.0512 | 0.0407 | 0.0440 |
| Letter-recog | 0.0471 | 0.0591 | 0.0709 | 0.0448 | 0.0492 |
| 0.0619 |
| Adult |
| 0.0165 | 0.0285 | 0.0116 | 0.0425 | 0.0141 | 0.0185 |
| Shuttle | 0.0009 | 0.0004 |
| 0.0004 | 0.0004 | 0.0004 |
|
| Connect-4 | 0.0156 |
| 0.0309 | 0.0222 | 0.0534 | 0.0215 | 0.0222 |
| Waveform |
| 0.0053 | 0.0037 | 0.0025 | 0.0068 | 0.0021 | 0.0035 |
| Localization |
| 0.0594 | 0.1099 | 0.0580 | 0.1106 | 0.0897 | 0.0955 |
W/D/L comparison results of bias on all datasets.
| NB | TAN | KDB | AODE | kdb | |
|---|---|---|---|---|---|
| TAN | 30/5/7 | - | - | - | - |
| KDB | 30/5/7 | 25/9/8 | - | - | - |
| AODE | 32/7/3 | 18/14/10 | 15/4/23 | - | - |
| kdb | 31/3/8 | 20/10/12 | 15/8/19 | 16/11/15 | - |
| KDB | 32/3/7 | 26/9/7 | 11/27/4 | 21/13/8 | 17/18/7 |
W/D/L comparison results of variance on all datasets.
| NB | TAN | KDB | AODE | kdb | |
|---|---|---|---|---|---|
| TAN | 4/3/35 | - | - | - | - |
| KDB | 8/1/33 | 9/7/26 | - | - | - |
| AODE | 9/8/25 | 30/8/4 | 34/3/5 | - | - |
| kdb | 7/1/34 | 19/13/10 | 30/4/8 | 6/10/26 | - |
| KDB | 8/2/32 | 16/12/14 | 34/8/0 | 7/4/31 | 12/9/21 |
Figure 6Training and classification time comparisons for BNCs.
W/D/L records between KDB and RF.
| All | Small | Medium | Large | |
|---|---|---|---|---|
| Zero-one loss | 20/10/12 | 10/4/3 | 7/4/4 | 3/2/5 |
| RMSE | 16/15/11 | 4/9/4 | 8/4/3 | 4/2/4 |
| Bias | 11/11/20 | 5/1/11 | 5/5/5 | 1/5/4 |
| Variance | 26/4/12 | 11/3/3 | 7/1/7 | 8/0/2 |
Figure 7Training and classification time comparisons between KDB and RF.
Description of Heart-disease-c dataset.
| Attribute | Description | Symbol |
|---|---|---|
| age | real value |
|
| sex | male or female, {0,1} |
|
| cp | chest pain type (angina, abnang, notang, asympt), {1,2,3,4} |
|
| trestbps | resting blood pressure, real value |
|
| chol | cholesterol, real value |
|
| fbs | fasting blood sugar < 120 (true or false), {0,1} |
|
| restecg | resting electrocardiographic results (norm, abn, hyper), {0,1,2} |
|
| thalach | maximum heart rate achieved, real value |
|
| exang | exercise induced angina (true or false), {0,1} |
|
| oldpeak | ST depression induced by exercise relative to rest, real value |
|
| slope | the slope of the peak exercise ST segment (up, flat, down), {1,2,3} |
|
| ca | number of vessels colored, real value |
|
| thal | thal (norm, fixed, rever), {3,6,7} |
|
| class | 0 for health, 1 for sick |
|
Figure 8The structure of KDB on Heart-disease-c dataset.
Figure 9The structure of submodels of kdb.
Figure 10The scatter plot of KDB and RF in terms of MCC. Dis dataset is annotated with red color, which is a notable case where KDB enjoys significant advantages.
Ranks in terms of zero-one loss of different learners.
| Dataset | NB | TAN | KDB | AODE | RF | kdb | KDB |
|---|---|---|---|---|---|---|---|
| Contact-lenses | 5.0 | 5.0 |
| 7.0 | 3.0 | 5.0 | 2.0 |
| Labor |
| 5.0 |
| 6.0 | 7.0 |
|
|
| Echocardiogram | 5.0 | 4.0 | 6.0 | 3.0 | 7.0 |
|
|
| Lymphography |
| 3.5 | 7.0 | 2.0 | 6.0 | 3.5 | 5.0 |
| Sonar | 4.5 | 2.0 | 6.5 | 3.0 |
| 6.5 | 4.5 |
| Glass-id | 7.0 | 3.5 | 3.5 | 6.0 |
| 5.0 | 2.0 |
| New-thyroid | 2.0 | 5.0 | 6.0 |
| 7.0 | 3.5 | 3.5 |
| Heart-disease-c |
| 4.5 | 7.0 | 3.0 | 6.0 | 2.0 | 4.5 |
| Soybean-large | 6.0 | 4.5 | 3.0 |
| 4.5 | 7.0 | 2.0 |
| Ionosphere | 7.0 | 2.0 | 3.0 | 4.0 | 5.0 | 6.0 |
|
| Dermatology | 2.0 | 3.0 | 7.0 |
| 4.0 | 6.0 | 5.0 |
| House-votes-84 | 7.0 | 5.0 | 3.0 | 4.0 |
| 6.0 | 2.0 |
| Chess | 7.0 | 2.0 | 4.0 | 5.0 | 6.0 |
|
|
| Soybean | 7.0 |
| 5.0 | 2.0 | 6.0 | 4.0 | 3.0 |
| Breast-cancer-w |
| 5.0 | 7.0 | 2.0 | 3.0 | 4.0 | 6.0 |
| Tic-tac-toe | 7.0 | 5.0 | 3.0 | 6.0 | 4.0 |
| 2.0 |
| Vowel | 7.0 |
| 6.0 | 2.0 | 4.0 | 5.0 | 3.0 |
| Car | 7.0 | 3.0 |
| 6.0 | 5.0 | 4.0 | 2.0 |
| Mfeat-mor | 6.0 |
| 5.0 | 7.0 | 2.0 | 3.0 | 4.0 |
| Segment | 7.0 | 3.0 | 6.0 |
| 4.0 | 2.0 | 5.0 |
| Hypothyroid | 7.0 | 3.0 | 4.0 | 6.0 | 5.0 |
| 2.0 |
| Kr-vs-kp | 7.0 | 5.0 | 3.0 | 6.0 |
| 4.0 | 2.0 |
| Dis | 6.5 | 6.5 | 5.0 | 3.0 | 4.0 | 2.0 |
|
| Hypo | 6.0 | 7.0 | 4.0 |
| 5.0 | 2.5 | 2.5 |
| Sick | 7.0 | 3.0 |
| 6.0 | 4.0 | 5.0 | 2.0 |
| Abalone | 6.0 | 5.0 | 4.0 | 2.0 | 7.0 | 3.0 |
|
| Waveform-5000 | 7.0 | 5.0 | 6.0 |
| 2.0 | 4.0 | 3.0 |
| Phoneme | 5.0 | 6.0 | 3.0 | 4.0 |
| 7.0 | 2.0 |
| Wall-following | 7.0 | 6.0 | 5.0 | 2.0 |
| 4.0 | 3.0 |
| Page-blocks | 7.0 | 6.0 | 5.0 | 4.0 |
| 3.0 | 2.0 |
| Satellite | 7.0 | 5.0 | 2.0 | 4.0 | 3.0 | 6.0 |
|
| Thyroid | 7.0 | 5.0 | 3.0 | 4.0 | 6.0 |
| 2.0 |
| Pendigits | 7.0 | 5.0 | 4.0 |
| 6.0 |
| 3.0 |
| Sign | 7.0 | 5.0 | 3.0 | 6.0 |
| 4.0 | 2.0 |
| Nursery | 7.0 | 5.0 | 2.0 | 6.0 |
| 4.0 | 3.0 |
| Magic | 7.0 | 4.0 | 2.0 | 6.0 | 3.0 | 5.0 |
|
| Letter-recog | 7.0 | 6.0 | 5.0 | 3.0 | 4.0 |
| 2.0 |
| Adult | 7.0 | 4.0 | 5.0 | 6.0 |
| 2.0 | 3.0 |
| Shuttle | 7.0 | 6.0 | 5.0 | 4.0 |
| 2.0 | 3.0 |
| Connect-4 | 7.0 | 5.0 | 3.0 | 6.0 |
| 4.0 | 2.0 |
| Waveform | 5.0 | 4.0 | 6.0 |
| 7.0 | 3.0 | 2.0 |
| Localization | 7.0 | 5.0 | 3.0 | 6.0 | 4.0 |
| 2.0 |
| Sum of ranks | 246.5 | 179.5 | 175.5 | 160.5 | 155.5 | 149.5 |
|
Ranks in terms of RMSE of different learners.
| Dataset | NB | TAN | KDB | AODE | RF | kdb | KDB |
|---|---|---|---|---|---|---|---|
| Contact-lenses | 2.0 | 7.0 |
| 4.0 | 6.0 | 5.0 | 3.0 |
| Labor |
| 4.0 | 2.0 | 3.0 | 6.0 | 7.0 | 5.0 |
| Echocardiogram | 5.0 | 4.0 | 7.0 | 6.0 |
| 2.0 | 3.0 |
| Lymphography |
| 4.0 | 7.0 | 2.0 | 5.0 | 6.0 | 3.0 |
| Sonar | 7.0 | 5.0 | 4.0 | 6.0 |
| 3.0 | 2.0 |
| Glass-id | 7.0 | 4.0 | 5.0 | 6.0 |
| 3.0 | 2.0 |
| New-thyroid |
| 6.0 | 7.0 | 3.0 | 2.0 | 4.0 | 5.0 |
| Heart-disease-c | 5.0 | 6.0 | 7.0 | 2.0 | 4.0 |
| 3.0 |
| Soybean-large | 5.0 | 4.0 | 3.0 | 2.0 | 7.0 | 6.0 |
|
| Ionosphere | 7.0 | 4.0 | 5.0 | 2.0 |
| 6.0 | 3.0 |
| Dermatology |
| 3.0 | 4.0 | 2.0 | 5.0 | 7.0 | 6.0 |
| House-votes-84 | 7.0 | 6.0 | 4.0 | 5.0 |
| 3.0 | 2.0 |
| Chess | 7.0 |
| 2.0 | 4.0 | 5.0 | 6.0 | 3.0 |
| Soybean | 7.0 |
| 4.0 | 3.0 | 6.0 | 5.0 | 2.0 |
| Breast-cancer-w |
| 4.0 | 7.0 | 3.0 | 2.0 | 6.0 | 5.0 |
| Tic-tac-toe | 7.0 | 6.0 | 3.0 | 5.0 |
| 4.0 | 2.0 |
| Vowel | 7.0 |
| 5.0 | 2.0 | 4.0 | 6.0 | 3.0 |
| Car | 7.0 | 3.0 |
| 6.0 | 5.0 | 4.0 | 2.0 |
| Mfeat-mor | 7.0 |
| 5.0 | 4.0 | 6.0 | 2.0 | 3.0 |
| Segment | 7.0 | 4.0 | 5.0 |
| 6.0 | 3.0 | 2.0 |
| Hypothyroid | 7.0 | 4.0 | 3.0 | 6.0 |
| 5.0 | 2.0 |
| Kr-vs-kp | 7.0 | 4.0 | 2.0 | 6.0 |
| 5.0 | 3.0 |
| Dis | 7.0 | 6.0 | 3.0 | 5.0 |
| 4.0 | 2.0 |
| Hypo | 7.0 | 6.0 | 3.0 | 2.0 | 4.0 | 5.0 |
|
| Sick | 7.0 | 3.0 | 2.0 | 6.0 | 4.0 | 5.0 |
|
| Abalone | 7.0 | 4.0 | 5.0 |
| 6.0 | 2.5 | 2.5 |
| Waveform-5000 | 7.0 | 3.0 | 6.0 |
| 5.0 | 4.0 | 2.0 |
| Phoneme | 4.0 | 6.0 | 3.0 | 5.0 |
| 7.0 | 2.0 |
| Wall-following | 7.0 | 6.0 | 5.0 | 3.0 |
| 4.0 | 2.0 |
| Page-blocks | 7.0 | 6.0 | 5.0 | 4.0 | 2.0 |
| 3.0 |
| Satellite | 7.0 | 5.0 | 3.0 | 4.0 | 2.0 | 6.0 |
|
| Thyroid | 7.0 | 4.0 | 5.0 | 3.0 | 6.0 |
| 2.0 |
| Pendigits | 7.0 | 4.0 | 3.0 |
| 6.0 | 5.0 | 2.0 |
| Sign | 7.0 | 5.0 | 3.0 | 6.0 |
| 4.0 | 2.0 |
| Nursery | 7.0 | 5.0 | 2.0 | 6.0 |
| 4.0 | 3.0 |
| Magic | 7.0 | 2.0 | 3.0 | 5.0 | 6.0 | 4.0 |
|
| Letter-recog | 7.0 | 5.0 | 4.0 | 2.0 | 6.0 | 3.0 |
|
| Adult | 7.0 | 3.0 | 4.0 | 5.0 | 6.0 | 2.0 |
|
| Shuttle | 7.0 | 6.0 | 5.0 | 3.0 | 4.0 |
| 2.0 |
| Connect-4 | 7.0 | 4.0 | 3.0 | 5.0 |
| 6.0 | 2.0 |
| Waveform | 7.0 | 4.0 | 6.0 | 2.0 |
| 5.0 | 3.0 |
| Localization | 7.0 | 5.5 | 4.0 | 5.5 | 3.0 |
| 2.0 |
| Sum of ranks | 250.0 | 178.5 | 170.0 | 157.5 | 144.0 | 173.5 |
|
Figure 11Average ranks in terms of zero-one loss and RMSE for all learners.
Figure 12Nemenyi test in terms of zero-one loss and RMSE for all learners.