| Literature DB >> 28753613 |
Jinyan Li1, Lian-Sheng Liu2, Simon Fong1, Raymond K Wong3, Sabah Mohammed4, Jinan Fiaidhi4, Yunsick Sung5, Kelvin K L Wong6.
Abstract
Clinical data analysis and forecasting have made substantial contributions to disease control, prevention and detection. However, such data usually suffer from highly imbalanced samples in class distributions. In this paper, we aim to formulate effective methods to rebalance binary imbalanced dataset, where the positive samples take up only the minority. We investigate two different meta-heuristic algorithms, particle swarm optimization and bat algorithm, and apply them to empower the effects of synthetic minority over-sampling technique (SMOTE) for pre-processing the datasets. One approach is to process the full dataset as a whole. The other is to split up the dataset and adaptively process it one segment at a time. The experimental results reported in this paper reveal that the performance improvements obtained by the former methods are not scalable to larger data scales. The latter methods, which we call Adaptive Swarm Balancing Algorithms, lead to significant efficiency and effectiveness improvements on large datasets while the first method is invalid. We also find it more consistent with the practice of the typical large imbalanced medical datasets. We further use the meta-heuristic algorithms to optimize two key parameters of SMOTE. The proposed methods lead to more credible performances of the classifier, and shortening the run time compared to brute-force method.Entities:
Mesh:
Year: 2017 PMID: 28753613 PMCID: PMC5533448 DOI: 10.1371/journal.pone.0180830
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The environment and parmeters of PSO and BA.
| PSO | BA | ||
|---|---|---|---|
| Parameter | value | Parameter | value |
| Population | 20 | Populations | 20 |
| Iterations | 1000 | Iterations | 1000 |
| c1 | 1.5 | A(loudness) | 0.5 |
| c2 | 1.5 | R(pulse rate) | 1 |
| Qmin | 0 | ||
| Qmax | 2 | ||
Fig 1Flow chart of the Swarm Balancing Algorithms.
Characteristics of the highly imbalanced datasets used in experiment 2.
| Bioassay No. | Negative | Length of each window in | Positive | Length of each window in | Total instances | Imbalance ratio (Majority/Minority) |
|---|---|---|---|---|---|---|
| 362 | 3375 | 562(+1), 1124(+1), 1686(+1) | 48 | 8, 16, 24 | 3423 | 70.3125 |
| 1608 | 772 | 128(+1), 256(+1), 384(+2) | 55 | 9, 18, 27(+1) | 827 | 14.03636364 |
| 746 | 47538 | 7923, 15846, 23769 | 293 | 48(+1), 96(+2), 144(+2) | 47831 | 162.2457338 |
| 687 | 26378 | 4396, 8792(+1), 13188(+1) | 76 | 12(+1), 24(+1), 36(+2) | 26454 | 347.0789474 |
| 456 | 7964 | 1327, 2654(+1), 3981(+1) | 22 | 3(+1), 6(+1), 9(+2) | 7986 | 362 |
| 373 | 47781 | 7963(+1), 15926(+1), 23889(+1) | 50 | 8, 16(+1), 24(+1) | 47831 | 955.62 |
Fig 2Principle of the Adaptive Swarm Balancing Algorithms.
Results of surgery dataset in experiment 1.
| Data name: | Surgery Data | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Algorithms | positive | negative | Accuracy | Kappa | Imbalance ratio | Precision | Recall | F-Measure | ROC Area |
| PN | 70 | 400 | 83.19% | 0.00 | 0.18 | 0.75 | 0.83 | 0.78 | 0.64 |
| Class balancer-PN | 235 | 235 | 58.89% | 0.18 | 1.00 | 0.59 | 0.59 | 0.59 | 0.61 |
| SMOTE(complete balance, K = 5) | 399 | 400 | 72.09% | 0.442 | 1.00 | 0.734 | 0.721 | 0.717 | 0.774 |
| PSO-Balancing Algorithm-PN | 408 | 400 | 82.55% | 0.65 | 1.02 | 0.83 | 0.83 | 0.83 | 0.85 |
| BA-Balancing Algorithm-PN | 213 | 400 | 78.14% | 0.52 | 0.53 | 0.78 | 0.78 | 0.78 | 0.78 |
Results of bioassay 362 dataset in experiment 1.
| Data name: | AID362 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Algorithms | positive | negative | Accuracy | Kappa | Imbalance ratio(Min/Maj) | Precision | Recall | F-Measure | ROC Area |
| PN | 48 | 3375 | 98.60% | 0.00 | 0.01 | 0.97 | 0.99 | 0.98 | 0.58 |
| Class balancer-PN | 1711.5 | 1711.5 | 57.92% | 0.16 | 1.00 | 0.63 | 0.58 | 0.53 | 0.58 |
| SMOTE(complete balance, K = 5) | 3374 | 3375 | 63.14% | 0.2628 | 1.00 | 0.786 | 0.631 | 0.574 | 0.641 |
| PSO-Balancing Algorithm-PN | 3393 | 3375 | 63.18% | 0.26 | 1.01 | 0.79 | 0.63 | 0.57 | 0.64 |
| BA-Balancing Algorithm-PN | 3344 | 3375 | 62.91% | 0.26 | 0.99 | 0.79 | 0.63 | 0.57 | 0.64 |
Results of bioassay 439 dataset in experiment 1.
| Data name: | AID439 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Algorithms | positive | Negative | Accuracy | Kappa | Imbalance ratio | Precision | Recall | F-Measure | ROC Area |
| PN | 11 | 45 | 73.21% | 0.12 | 0.24 | 0.72 | 0.73 | 0.73 | 0.69 |
| Class balancer-PN | 28 | 28 | 49.29% | -0.01 | 1.00 | 0.49 | 0.49 | 0.48 | 0.46 |
| SMOTE(complete balance, K = 5) | 44 | 45 | 79.78% | 0.60 | 0.98 | 0.842 | 0.798 | 0.791 | 0.774 |
| PSO-Balancing Algorithm-PN | 34 | 45 | 82.28% | 0.65 | 0.76 | 0.86 | 0.82 | 0.82 | 0.87 |
| BA-Balancing Algorithm-PN | 40 | 45 | 78.82% | 0.58 | 0.89 | 0.81 | 0.79 | 0.79 | 0.80 |
Results of bioassay 721 dataset in experiment 1.
| Data name: | AID721 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Algorithms | positive | Negative | Accuracy | Kappa | Imbalance ratio | Precision | Recall | F-Measure | ROC Area |
| PN | 17 | 59 | 78.95% | 0.09 | 0.29 | 0.83 | 0.79 | 0.71 | 0.41 |
| Class balancer-PN | 38 | 38 | 40.88% | -0.18 | 1.00 | 0.62 | 0.68 | 0.65 | 0.49 |
| SMOTE(complete balance, K = 5) | 58 | 59 | 65.81% | 0.32 | 0.98 | 0.775 | 0.658 | 0.619 | 0.682 |
| PSO-Balancing Algorithm-PN | 63 | 59 | 70.49% | 0.40 | 1.07 | 0.40 | 0.41 | 0.40 | 0.39 |
| BA-Balancing Algorithm-PN | 63 | 59 | 69.67% | 0.38 | 1.07 | 0.44 | 0.46 | 0.41 | 0.46 |
Results of bioassay 1284 dataset in experiment 1.
| Data name: | AID1284 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Algorithms | positive | negative | Accuracy | Kappa | Imbalance ratio | Precision | Recall | F-Measure | ROC Area |
| PN | 46 | 244 | 84.14% | 0.00 | 0.19 | 0.71 | 0.84 | 0.77 | 0.52 |
| Class balancer-PN | 145 | 145 | 50.62% | 0.01 | 1.00 | 0.51 | 0.51 | 0.48 | 0.50 |
| SMOTE(complete balance, K = 5) | 243 | 244 | 64.07% | 0.28 | 1.00 | 0.76 | 0.641 | 0.594 | 0.691 |
| PSO-Balancing Algorithm-PN | 202 | 244 | 70.32% | 0.38 | 0.83 | 0.49 | 0.70 | 0.58 | 0.64 |
| BA-Balancing Algorithm-PN | 254 | 244 | 67.07% | 0.33 | 1.04 | 0.78 | 0.67 | 0.63 | 0.68 |
Fig 3Average performance results of each method in experiment 1.
Average values of Accuracy, Kappa and imbalance ratio (min/maj) for the two methods in experiment 2.
| Neural | Acc. | Kap. | Imb. Ratio | Neural | Acc. | Kap. | Imb. Ratio |
|---|---|---|---|---|---|---|---|
| Original data | 98.45% | 0.000 | 0.004 | Original data | 98.45% | 0.000 | 0.004 |
| SMOTE (complete balance, | 63.07% | 0.26140 | 1.00 | SMOTE (complete balance, | 63.07% | 0.26140 | 1.00 |
| PSO-Balancing Algorithm | 61.73% | 0.239 | 0.790 | BA-Balancing Algorithm | 62.56% | 0.252 | 0.877 |
| APBA-Window 1 | 98.34% | -0.002 | 0.004 | ABBA-Window 1 | 98.34% | -0.002 | 0.004 |
| APBA-Processed State 1 | 78.41% | 0.565 | 0.861 | ABBA-Processed State 1 | 79.20% | 0.590 | 0.816 |
| APBA-Window 2 | 98.41% | -0.001 | 0.004 | ABBA-Window 2 | 98.41% | -0.001 | 0.004 |
| APBA -Initial State = APBA -PS1 | 72.87% | 0.408 | 0.862 | ABBA-Initial State = ABSB-PS1 | 71.41% | 0.423 | 0.822 |
| APBA-Processed State 2 | 74.13% | 0.481 | 0.974 | ABBA-Processed State 2 | 75.58% | 0.505 | 0.979 |
| APBA-Window 3 | 98.43% | 0.000 | 0.004 | ABBA-Window 3 | 98.43% | 0.000 | 0.004 |
| APBA-Processed State 1 | 68.69% | 0.292 | 0.827 | ABBA-Processed State 1 | 67.22% | 0.315 | 0.783 |
| APBA-Processed State 2 | 69.70% | 0.386 | 0.972 | ABBA-Processed State 2 | 70.71% | 0.397 | 0.977 |
| APBA-Finally Average results | 74.08% | 0.477 | 0.936 | ABBA-Finally Average results | 75.17% | 0.497 | 0.924 |
Results of Bioassay 746 dataset in experiment 2.
| Data Name: | AID 746 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neural Network | Percentage | Nearest Neighbors | Accuracy | Kappa | Psize | Nsize | Searching Time (s) | TPR | FPR | Precision | Recall | F-Measure | ROC Area |
| 99.39% | 0.000 | 293.000 | 47831.000 | 0.994 | 0.994 | 0.988 | 0.994 | 0.991 | 0.543 | ||||
| 16124 | 5 | 57.40% | 0.148 | 47831.000 | 47831.000 | 0.763 | 0.574 | 0.481 | 0.577 | 57.40% | 0.148 | ||
| 13743 | 129 | 56.98% | 0.132 | 40559.000 | 47831.000 | 15176.904 | 0.570 | 0.438 | 0.569 | 0.570 | 0.569 | 0.607 | |
| APBA-Window 1 | 99.37% | 0.000 | 49.000 | 7923.000 | 0.994 | 0.994 | 0.987 | 0.994 | 0.991 | 0.585 | |||
| APBA-Processed State 1 | 15393 | 31 | 64.79% | 0.306 | 7591.000 | 7923.000 | 873.725 | 0.648 | 0.337 | 0.795 | 0.648 | 0.602 | 0.706 |
| APBA-Window 2 | 99.39% | 0.000 | 98.000 | 15846.000 | 0.994 | 0.994 | 0.988 | 0.994 | 0.991 | 0.589 | |||
| APBA-Initial State = APSB-PS1 | 60.25% | 0.218 | 15183.000 | 15846.000 | 0.602 | 0.381 | 0.780 | 0.602 | 0.533 | 0.630 | |||
| APBA-Processed State 2 | 16120 | 41 | 61.24% | 0.224 | 15895.000 | 15846.000 | 2813.100 | 0.612 | 0.389 | 0.781 | 0.612 | 0.544 | 0.622 |
| APBA-Window 3 | 99.39% | 0.000 | 147.000 | 23768.000 | 0.994 | 0.994 | 0.988 | 0.994 | 0.991 | 0.560 | |||
| APBA-Processed State 1 | 15393 | 31 | 58.89% | 0.178 | 22774.000 | 23768.000 | 0.589 | 0.411 | 0.589 | 0.589 | 0.589 | 0.632 | |
| APBA-Processed State 2 | 16120 | 41 | 60.38% | 0.208 | 23843.000 | 23768.000 | 0.604 | 0.396 | 0.604 | 0.604 | 0.604 | 0.638 | |
| APBA- | 62.14% | 0.246 | 3686.825 | 0.621 | 0.374 | 0.727 | 0.621 | 0.583 | 0.655 | ||||
| 15863 | 193 | 57.84% | 0.163 | 46771.000 | 47831.000 | 25927.423 | 0.578 | 0.415 | 0.754 | 0.578 | 0.493 | 0.603 | |
| ABBA-Window 1 | 99.37% | 0.000 | 50.000 | 7923.000 | 0.994 | 0.994 | 0.987 | 0.994 | 0.991 | 0.585 | |||
| ABBA-Processed State 1 | 12927 | 45 | 62.77% | 0.238 | 6383.000 | 7923.000 | 2393.697 | 0.628 | 0.393 | 0.625 | 0.628 | 0.624 | 0.691 |
| ABBA-Window 2 | 99.39% | 0.000 | 98.000 | 15846.000 | 0.994 | 0.994 | 0.988 | 0.994 | 0.991 | 0.589 | |||
| ABBA-Initial State = ABSB-PS1 | 57.48% | 0.138 | 12766.000 | 15846.000 | 0.575 | 0.437 | 0.574 | 0.575 | 0.574 | 0.622 | |||
| ABBA-Processed State 2 | 15895 | 55 | 60.94% | 0.222 | 15675.000 | 15846.000 | 5962.276 | 0.609 | 0.386 | 0.781 | 0.609 | 0.541 | 0.617 |
| ABBA-Window 3 | 99.39% | 0.000 | 147.000 | 23768.000 | 0.994 | 0.994 | 0.988 | 0.994 | 0.991 | 0.560 | |||
| ABBA-Processed State 1 | 12927 | 45 | 60.86% | 0.212 | 19149.000 | 23768.000 | 0.609 | 0.396 | 0.611 | 0.609 | 0.609 | 0.644 | |
| ABBA-Processed State 2 | 15895 | 55 | 61.29% | 0.226 | 23512.000 | 23768.000 | 0.613 | 0.387 | 0.613 | 0.613 | 0.613 | 0.643 | |
| ABBA- | 61.67% | 0.229 | 8355.974 | 0.617 | 0.389 | 0.673 | 0.617 | 0.593 | 0.650 | ||||
the grey part means there is no searching time in this step.
APBA means Adaptive PSO Balancing Algorithm; ABBA means Adaptive BA Balancing Algorithm.
Results of Bioassay 456 dataset in experiment 2.
| Data Name: | AID 456 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neural Network | Percentage | Nearest Neighbors | Accuracy | Kappa | Psize | Nsize | Searching Time(s) | TPR | FPR | Precision | Recall | F-Measure | ROC Area |
| 36ss1 | 5 | 69.64% | 0.393 | 7964.000 | 7964.000 | 0.696 | 0.304 | 0.811 | 0.696 | 0.666 | 0.710 | ||
| APBA-Window 1 | 99.70% | 0.000 | 4.000 | 1327.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.995 | 0.415 | |||
| APBA-Processed State 1 | 27046 | 3 | 97.64% | 0.953 | 1085.000 | 1327.000 | 121.900 | 0.976 | 0.019 | 0.978 | 0.976 | 0.976 | 0.986 |
| APBA-Window 2 | 99.74% | 0.000 | 7.000 | 2655.000 | 0.997 | 0.997 | 0.995 | 0.997 | 0.996 | 0.634 | |||
| APBA-Initial State = APSB-PS1 | 27046 | 3 | 92.27% | 0.845 | 1900.000 | 2655.000 | 0.923 | 0.055 | 0.935 | 0.923 | 0.923 | 0.960 | |
| APBA-Processed State 2 | 34165 | 4 | 93.03% | 0.861 | 2398.000 | 2655.000 | 119.471 | 0.930 | 0.063 | 0.939 | 0.930 | 0.930 | 0.947 |
| APBA-Window 3 | 99.72% | 0.000 | 11.000 | 3982.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.463 | |||
| APBA-Processed State 1 | 27046 | 3 | 77.90% | 0.576 | 2986.000 | 3982.000 | 0.779 | 0.166 | 0.854 | 0.779 | 0.775 | 0.815 | |
| APBA-Processed State 2 | 34165 | 4 | 79.96% | 0.603 | 3769.000 | 3982.000 | 0.800 | 0.190 | 0.858 | 0.800 | 0.792 | 0.831 | |
| APBA- | |||||||||||||
| ABBA-Window 1 | 99.70% | 0.000 | 4.000 | 1327.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.995 | 0.415 | |||
| ABBA-Processed State 1 | 29511 | 10 | 97.65% | 0.953 | 1184.000 | 1327.000 | 274.540 | 0.977 | 0.021 | 0.978 | 0.977 | 0.977 | 0.986 |
| ABBA-Window 2 | 99.74% | 0.000 | 7.000 | 2655.000 | 0.997 | 0.997 | 0.995 | 0.997 | 0.996 | 0.634 | |||
| ABBA-Initial State = ABSB-PS1 | 29511 | 10 | 92.66% | 0.854 | 2072.000 | 2655.000 | 0.927 | 0.057 | 0.937 | 0.927 | 0.927 | 0.946 | |
| ABBA-Processed State 2 | 33294 | 10 | 93.03% | 0.861 | 2337.000 | 2655.000 | 123.385 | 0.930 | 0.061 | 0.939 | 0.930 | 0.930 | 0.943 |
| ABBA-Window 3 | 99.72% | 0.000 | 11.000 | 3982.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.463 | |||
| ABBA-Processed State 1 | 29511 | 10 | 78.35% | 0.581 | 3257.000 | 3982.000 | 0.784 | 0.177 | 0.854 | 0.784 | 0.778 | 0.836 | |
| ABBA-Processed State 2 | 33294 | 10 | 79.53% | 0.597 | 3673.000 | 3982.000 | 0.795 | 0.189 | 0.857 | 0.795 | 0.788 | 0.837 | |
| ABBA- | |||||||||||||
the grey part means there is no searching time in this step.
APBA means Adaptive PSO Balancing Algorithm; ABBA means Adaptive BA Balancing Algorithm.
Fig 4Average Accuracy and Kappa of different methods in experiment 2.
Fig 5Average time of our four methods in experiment 2.
Results of Bioassay 362 dataset in experiment 2.
| Data Name: | AID 362 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neural Network | Percentage | Nearest Neighbors | Accuracy | Kappa | Psize | Nsize | Searching Time(s) | TPR | FPR | Precision | Recall | F-Measure | ROC Area |
| Original data | 98.60% | 0.000 | 48.000 | 3375.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.580 | |||
| 69.3125 | 5 | 63.14% | 0.263 | 3374.000 | 3375.000 | 0.631 | 0.369 | 0.786 | 0.631 | 0.574 | 0.641 | ||
| 6969 | 36 | 63.18% | 0.262 | 3393.000 | 3375.000 | 136.673 | 0.632 | 0.370 | 0.787 | 0.632 | 0.574 | 0.643 | |
| APBA-Window 1 | 98.60% | 0.000 | 8.000 | 563.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.398 | |||
| APBA-Processed State 1 | 4710 | 2 | 85.53% | 0.716 | 384.000 | 563.000 | 64.781 | 0.855 | 0.099 | 0.893 | 0.855 | 0.856 | 0.894 |
| APBA-Window 2 | 98.60% | 0.000 | 16.000 | 1125.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.555 | |||
| APBA-Initial State = APSB-PS1 | 75.40% | 0.528 | 769.000 | 1125.000 | 0.754 | 0.184 | 0.819 | 0.754 | 0.753 | 0.862 | |||
| APBA-Processed State 2 | 9700 | 610 | 82.92% | 0.627 | 1568.000 | 1125.000 | 26.575 | 0.829 | 0.238 | 0.867 | 0.829 | 0.818 | 0.853 |
| APBA-Window 3 | 98.60% | 0.000 | 24.000 | 1687.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.484 | |||
| APBA-Processed State 1 | 4710 | 2 | 61.99% | 0.237 | 1154.000 | 1687.000 | 0.620 | 0.374 | 0.636 | 0.620 | 0.623 | 0.694 | |
| APBA-Processed State 2 | 9700 | 610 | 73.30% | 0.396 | 2354.000 | 1687.000 | 0.733 | 0.373 | 0.817 | 0.733 | 0.695 | 0.727 | |
| APBA- | 80.58% | 0.580 | 91.356 | 0.806 | 0.237 | 0.859 | 0.806 | 0.790 | 0.825 | ||||
| 62.91% | 0.261 | 3344.000 | 3375.000 | 149.354 | 0.629 | 0.367 | 0.787 | 0.629 | 0.571 | 0.638 | |||
| ABBA-Window 1 | 98.60% | 0.000 | 8.000 | 563.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.398 | |||
| ABBA-Processed State 1 | 4710 | 2 | 85.53% | 0.716 | 384.000 | 563.000 | 75.465 | 0.855 | 0.099 | 0.893 | 0.855 | 0.856 | 0.894 |
| ABBA-Window 2 | 98.60% | 0.000 | 16.000 | 1125.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.555 | |||
| ABBA-Initial State = ABSB-PS1 | 75.40% | 0.528 | 769.000 | 1125.000 | 0.754 | 0.184 | 0.819 | 0.754 | 0.753 | 0.862 | |||
| ABBA-Processed State 2 | 9893 | 10 | 85.57% | 0.686 | 1599.000 | 1125.000 | 48.299 | 0.856 | 0.205 | 0.884 | 0.856 | 0.848 | 0.876 |
| ABBA-Window 3 | 98.60% | 0.000 | 24.000 | 1687.000 | 0.986 | 0.986 | 0.972 | 0.986 | 0.979 | 0.484 | |||
| ABBA-Processed State 1 | 4710 | 2 | 61.99% | 0.237 | 1154.000 | 1687.000 | 0.620 | 0.374 | 0.636 | 0.620 | 0.623 | 0.694 | |
| ABBA-Processed State 2 | 9893 | 10 | 72.91% | 0.381 | 2400.000 | 1687.000 | 0.729 | 0.385 | 0.815 | 0.729 | 0.688 | 0.696 | |
| ABBA- | 81.34% | 0.594 | 123.765 | 0.813 | 0.230 | 0.864 | 0.813 | 0.797 | 0.822 | ||||
the grey part means there is no searching time in this step.
APBA means Adaptive PSO Balancing Algorithm; ABBA means Adaptive BA Balancing Algorithm.
Results of Bioassay 1608 dataset in experiment 2.
| Data Name: | AID 1608 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neural Network | Percentage | Nearest Neighbors | Accuracy | Kappa | Psize | Nsize | Searching Time (s) | TPR | FPR | Precision | Recall | F-Measure | ROC Area |
| 93.35% | 0.000 | 55.000 | 772.000 | 0.933 | 0.933 | 0.871 | 0.933 | 0.901 | 0.410 | ||||
| 1304% | 5 | 59.33% | 0.187 | 55.000 | 772.000 | 0.593 | 0.407 | 0.748 | 0.593 | 0.518 | 0.601 | ||
| 1162 | 50 | 56.75% | 0.165 | 694.000 | 772.000 | 157.216 | 0.568 | 0.397 | 0.659 | 0.568 | 0.515 | 0.616 | |
| APBA-Window 1 | 92.75% | -0.013 | 9.000 | 129.000 | 0.928 | 0.935 | 0.873 | 0.928 | 0.900 | 0.462 | |||
| APBA-Processed State 1 | 386 | 3 | 75.00% | 0.419 | 43.000 | 129.000 | 64.004 | 0.750 | 0.269 | 0.793 | 0.750 | 0.763 | 0.866 |
| APBA-Window 2 | 93.09% | -0.007 | 18.000 | 257.000 | 0.931 | 0.935 | 0.873 | 0.931 | 0.901 | 0.372 | |||
| APBA-Initial State = APSB-PS1 | 75.29% | 0.165 | 87.000 | 257.000 | 0.753 | 0.623 | 0.713 | 0.753 | 0.705 | 0.753 | |||
| APBA-Processed State 2 | 1062 | 19 | 71.03% | 0.448 | 209.000 | 257.000 | 29.576 | 0.710 | 0.236 | 0.824 | 0.710 | 0.694 | 0.754 |
| APBA-Window 3 | 93.24% | 0.000 | 28.000 | 386.000 | 0.932 | 0.932 | 0.869 | 0.932 | 0.900 | 0.562 | |||
| APBA-Processed State 1 | 386 | 3 | 73.95% | 0.000 | 136.000 | 386.000 | 0.739 | 0.739 | 0.547 | 0.739 | 0.629 | 0.651 | |
| APBA-Processed State 2 | 1062 | 19 | 65.22% | 0.338 | 327.000 | 386.000 | 0.652 | 0.295 | 0.798 | 0.652 | 0.618 | 0.678 | |
| APBA- | 70.42% | 0.401 | 93.580 | 0.704 | 0.267 | 0.805 | 0.704 | 0.692 | 0.766 | ||||
| 1398 | 29 | 61.00% | 0.200 | 823.000 | 772.000 | 172.092 | 0.610 | 0.415 | 0.763 | 0.610 | 0.535 | 0.613 | |
| ABBA-Window 1 | 92.75% | -0.013 | 9.000 | 129.000 | 0.928 | 0.935 | 0.873 | 0.928 | 0.900 | 0.462 | |||
| ABBA-Processed State 1 | 685 | 2 | 81.41% | 0.636 | 70.000 | 129.000 | 76.199 | 0.814 | 0.101 | 0.878 | 0.814 | 0.818 | 0.866 |
| ABBA-Window 2 | 93.09% | -0.007 | 18.000 | 257.000 | 0.931 | 0.935 | 0.873 | 0.931 | 0.901 | 0.372 | |||
| ABBA-Initial State = ABSB-PS1 | 71.36% | 0.305 | 141.000 | 257.000 | 0.714 | 0.439 | 0.705 | 0.714 | 0.689 | 0.770 | |||
| ABBA-Processed State 2 | 1595 | 30 | 75.98% | 0.496 | 305.000 | 257.000 | 47.766 | 0.760 | 0.284 | 0.830 | 0.760 | 0.739 | 0.772 |
| ABBA-Window 3 | 93.24% | 0.000 | 28.000 | 386.000 | 0.932 | 0.932 | 0.869 | 0.932 | 0.900 | 0.562 | |||
| ABBA-Processed State 1 | 685 | 2 | 63.31% | 0.072 | 219.000 | 386.000 | 0.633 | 0.572 | 0.591 | 0.633 | 0.571 | 0.661 | |
| ABBA-Processed State 2 | 1595 | 30 | 70.96% | 0.375 | 475.000 | 386.000 | 0.651 | 0.421 | 0.773 | 0.651 | 0.587 | 0.620 | |
| ABBA- | 123.965 | ||||||||||||
the grey part means there is no searching time in this step.
APBA means Adaptive PSO Balancing Algorithm; ABBA means Adaptive BA Balancing Algorithm.
Results of Bioassay 373 dataset in experiment 2.
| Data Name: | AID 373 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neural Network | Percentage | Nearest Neighbors | Accuracy | Kappa | Psize | Nsize | Searching Time (s) | TPR | FPR | Precision | Recall | F-Measure | ROC Area |
| 99.90% | 0.000 | 50.000 | 47831.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.658 | ||||
| 95462 | 5 | 68.54% | 0.371 | 47831.000 | 47831.000 | 0.685 | 0.315 | 0.807 | 0.685 | 0.651 | 0.719 | ||
| 61214 | 43 | 65.27% | 0.251 | 30657.000 | 47831.000 | 538.195 | 0.653 | 0.409 | 0.645 | 0.653 | 0.647 | 0.720 | |
| APBA-Window 1 | 99.90% | 0.000 | 8.000 | 7964.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.592 | |||
| APBA-Processed State 1 | 81124 | 7 | 74.48% | 0.510 | 6497.000 | 7964.000 | 106.964 | 0.745 | 0.208 | 0.837 | 0.745 | 0.735 | 0.831 |
| APBA-Window 2 | 99.89% | 0.000 | 17.000 | 15927.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.455 | |||
| APBA-Initial State = APSB-PS1 | 67.12% | 0.381 | 13808.000 | 15927.000 | 0.671 | 0.258 | 0.784 | 0.671 | 0.655 | 0.462 | |||
| APBA-Processed State 2 | 91409 | 14 | 69.89% | 0.402 | 15556.000 | 15927.000 | 129.514 | 0.699 | 0.295 | 0.776 | 0.699 | 0.678 | 0.732 |
| APBA-Window 3 | 99.90% | 0.000 | 25.000 | 23890.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.639 | |||
| APBA-Processed State 1 | 81124 | 7 | 77.98% | 0.572 | 17793.000 | 23890.000 | 0.780 | 0.187 | 0.851 | 0.780 | 0.773 | 0.850 | |
| APBA-Processed State 2 | 91409 | 14 | 79.19% | 0.587 | 22877.000 | 23890.000 | 0.792 | 0.199 | 0.854 | 0.792 | 0.784 | 0.836 | |
| APBA- | 74.52% | 0.500 | 236.478 | 0.745 | 0.234 | 0.822 | 0.745 | 0.732 | 0.800 | ||||
| 71076 | 44 | 64.29% | 0.256 | 35588.000 | 47831.000 | 541.370 | 0.643 | 0.392 | 0.638 | 0.643 | 0.638 | 0.718 | |
| ABBA-Window 1 | 99.90% | 0.000 | 8.000 | 7964.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.592 | |||
| ABBA-Processed State 1 | 81124 | 7 | 74.48% | 0.510 | 6497.000 | 7964.000 | 106.964 | 0.745 | 0.208 | 0.837 | 0.745 | 0.735 | 0.831 |
| ABBA-Window 2 | 99.89% | 0.000 | 17.000 | 15927.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.455 | |||
| ABBA-Initial State = ABSB-PS1 | 67.12% | 0.381 | 13808.000 | 15927.000 | 0.671 | 0.258 | 0.784 | 0.671 | 0.655 | 0.462 | |||
| ABBA-Processed State 2 | 89403 | 10 | 70.12% | 0.401 | 15215.000 | 15927.000 | 167.194 | 0.701 | 0.301 | 0.702 | 0.701 | 0.700 | 0.772 |
| ABBA-Window 3 | 99.90% | 0.000 | 25.000 | 23890.000 | 0.999 | 0.999 | 0.998 | 0.999 | 0.998 | 0.639 | |||
| ABBA-Processed State 1 | 81124 | 7 | 77.98% | 0.572 | 17793.000 | 23890.000 | 0.780 | 0.187 | 0.851 | 0.780 | 0.773 | 0.850 | |
| ABBA-Processed State 2 | 89403 | 10 | 79.05% | 0.586 | 22375.000 | 23890.000 | 0.791 | 0.196 | 0.854 | 0.791 | 0.783 | 0.844 | |
| ABBA- | 74.55% | 0.499 | 274.159 | 0.746 | 0.235 | 0.798 | 0.746 | 0.739 | 0.816 | ||||
the grey part means there is no searching time in this step.
APBA means Adaptive PSO Balancing Algorithm; ABBA means Adaptive BA Balancing Algorithm.
Results of Bioassay 687 dataset in experiment 2.
| Data Name: | AID 687 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neural Network | Percentage | Nearest Neighbors | Accuracy | Kappa | Psize | Nsize | Searching Time (s) | TPR | FPR | Precision | Recall | F-Measure | ROC Area |
| 99.71% | 0.000 | 76.000 | 26378.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.577 | ||||
| 34608 | 5 | 60.37% | 0.207 | 26378.000 | 26378.000 | 0.604 | 0.396 | 0.779 | 0.604 | 0.530 | 0.641 | ||
| 32368 | 75 | 63.30% | 0.268 | 24675.000 | 4396.000 | 6813.241 | 0.633 | 0.364 | 0.636 | 0.633 | 0.633 | 0.673 | |
| APBA-Window 1 | 99.71% | 0.000 | 13.000 | 4396.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.533 | |||
| APBA-Processed State 1 | 27568 | 11 | 73.02% | 0.483 | 3596.000 | 4396.000 | 146.408 | 0.730 | 0.221 | 0.831 | 0.730 | 0.718 | 0.799 |
| APBA-Window 2 | 99.72% | 0.000 | 25.000 | 8793.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.535 | |||
| APBA-Initial State = APSB-PS1 | 66.91% | 0.312 | 6719.000 | 8793.000 | 0.669 | 0.365 | 0.668 | 0.669 | 0.661 | 0.743 | |||
| APBA-Processed State 2 | 31180 | 12 | 66.69% | 0.324 | 7820.000 | 8793.000 | 926.972 | 0.667 | 0.347 | 0.670 | 0.667 | 0.661 | 0.747 |
| APBA-Window 3 | 99.71% | 0.000 | 38.000 | 13189.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.513 | |||
| APBA-Processed State 1 | 27568 | 11 | 61.44% | 0.186 | 10513.000 | 13189.000 | 0.614 | 0.436 | 0.613 | 0.614 | 0.593 | 0.659 | |
| APBA-Processed State 2 | 31180 | 12 | 60.14% | 0.185 | 11886.000 | 13189.000 | 0.601 | 0.420 | 0.609 | 0.601 | 0.583 | 0.662 | |
| APBA- | 66.62% | 0.331 | 1073.380 | 0.666 | 0.329 | 0.703 | 0.666 | 0.654 | 0.736 | ||||
| 32368 | 75 | 63.30% | 0.268 | 24675.000 | 26378.000 | 5139.300 | 0.633 | 0.364 | 0.636 | 0.633 | 0.633 | 0.673 | |
| ABBA-Window 1 | 99.71% | 0.000 | 13.000 | 4396.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.533 | |||
| ABBA-Processed State 1 | 28295 | 7 | 73.35% | 0.487 | 3691.350 | 4396.000 | 262.371 | 0.734 | 0.224 | 0.832 | 0.734 | 0.720 | 0.790 |
| ABBA-Window 2 | 99.72% | 0.000 | 25.000 | 8793.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.535 | |||
| ABBA-Initial State = ABSB-PS1 | 64.42% | 0.332 | 7098.000 | 8793.000 | 0.644 | 0.287 | 0.802 | 0.644 | 0.611 | 0.702 | |||
| ABBA-Processed State 2 | 34008 | 8 | 67.85% | 0.363 | 8526.000 | 8793.000 | 1000.736 | 0.678 | 0.312 | 0.806 | 0.678 | 0.644 | 0.732 |
| ABBA-Window 3 | 99.71% | 0.000 | 38.000 | 13189.000 | 0.997 | 0.997 | 0.994 | 0.997 | 0.996 | 0.513 | |||
| ABBA-Processed State 1 | 28295 | 7 | 60.82% | 0.213 | 10790.000 | 13189.000 | 0.608 | 0.393 | 0.611 | 0.608 | 0.609 | 0.657 | |
| ABBA-Processed State 2 | 34008 | 8 | 60.51% | 0.216 | 12961.000 | 13189.000 | 0.605 | 0.388 | 0.780 | 0.605 | 0.534 | 0.669 | |
| ABBA- | 67.24% | 0.355 | 1263.107 | 0.672 | 0.308 | 0.806 | 0.672 | 0.633 | 0.730 | ||||
the grey part means there is no searching time in this step.
APBA means Adaptive PSO Balancing Algorithm; ABBA means Adaptive BA Balancing Algorithm.