| Literature DB >> 32193487 |
Ahmed A Hemedan1, Mohamed Abd Elaziz2,3, Pengcheng Jiao4, Amir H Alavi5,6,7, Mahmoud Bahgat8,9, Marek Ostaszewski1, Reinhard Schneider1, Haneen A Ghazy10, Ahmed A Ewees11, Songfeng Lu12.
Abstract
Recently, significant attention has been devoted to vaccine-derived poliovirus (VDPV) surveillance due to its severe consequences. Prediction of the outbreak incidence of VDPF requires an accurate analysis of the alarming data. The overarching aim to this study is to develop a novel hybrid machine learning approach to identify the key parameters that dominate the outbreak incidence of VDPV. The proposed method is based on the integration of random vector functional link (RVFL) networks with a robust optimization algorithm called whale optimization algorithm (WOA). WOA is applied to improve the accuracy of the RVFL network by finding the suitable parameter configurations for the algorithm. The classification performance of the WOA-RVFL method is successfully validated using a number of datasets from the UCI machine learning repository. Thereafter, the method is implemented to track the VDPV outbreak incidences recently occurred in several provinces in Lao People's Democratic Republic. The results demonstrate the accuracy and efficiency of the WOA-RVFL algorithm in detecting the VDPV outbreak incidences, as well as its superior performance to the traditional RVFL method.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32193487 PMCID: PMC7081356 DOI: 10.1038/s41598-020-61853-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Structure of the RVFL network.
Figure 2The WOA-RVFL classification process.
The UCI datasets.
| NO | Dataset | Features | Sample | No of class | Subject |
|---|---|---|---|---|---|
| 1 | Clean 1 | 168 | 476 | 2 | Physical |
| 2 | Clean 2 | 168 | 6598 | 2 | Physical |
| 3 | Hayes-roth | 5 | 160 | 3 | Social |
| 4 | IonoSphere | 34 | 351 | 2 | Physical |
| 5 | House-votes | 16 | 435 | 2 | Social |
| 6 | Madelon | 500 | 4400 | 2 | N/A |
| 7 | PCMAC | 3289 | 1943 | 2 | N/A |
| 8 | Soybean | 35 | 307 | 19 | Life |
| 9 | WaveForme | 40 | 5000 | 3 | Physical |
| 10 | Wine | 13 | 178 | 3 | Physical |
| 11 | Zoo | 17 | 101 | 7 | Life |
Performance statistics of the WOA-RVFL and RVFL methods over different UCI datasets.
| Dataset | Set | WOA-RVFL | RVFL | ||||
|---|---|---|---|---|---|---|---|
| Pre | Rec | Acc | Pre | Rec | Acc | ||
| Zoo | Train | 100 | 100 | 100 | 100 | 100 | 100 |
| Validation | 100 | 100 | 77.80 | 100 | 100 | 66.76 | |
| Test | 85.71 | 85.71 | 100 | 71.43 | 64.29 | 90.00 | |
| Wine | Train | 100 | 100 | 100 | 97.47 | 97.71 | 97.52 |
| Validation | 100 | 100 | 100 | 97.24 | 95.91 | 100 | |
| Test | 100 | 100 | 100 | 100 | 100 | 100 | |
| Soybean | Train | 100 | 100 | 100 | 100 | 100 | 100 |
| Validation | 100 | 100 | 100 | 100 | 100 | 100 | |
| Test | 100 | 100 | 100 | 100 | 100 | 100 | |
| PCMAC1 | Train | 100 | 100 | 100 | 100 | 100 | 100 |
| Validation | 95.36 | 96.87 | 100 | 91.23 | 89.79 | 92.70 | |
| Test | 91.21 | 91.36 | 91.24 | 87.63 | 87.63 | 87.63 | |
| Madelon | Train | 100 | 100 | 100 | 74.32 | 74.32 | 74.32 |
| Validation | 60.48 | 67.81 | 61.32 | 48.87 | 46.16 | 43.59 | |
| Test | 68.46 | 68.47 | 68.46 | 55.00 | 55.00 | 55.00 | |
| Ionosphere | Train | 100 | 100 | 100 | 100 | 100 | 98.10 |
| Validation | 100 | 100 | 100 | 99.29 | 100 | 100 | |
| Test | 100 | 100 | 95.71 | 100 | 100 | 94.29 | |
| House-Vote | Train | 100 | 100 | 100 | 97.47 | 96.74 | 97.19 |
| Validation | 94.87 | 100 | 91.67 | 94.87 | 97.79 | 92.23 | |
| Test | 94.19 | 93.70 | 94.25 | 93.17 | 92.19 | 93.02 | |
| Hayesroth | Train | 87.68 | 91.01 | 85.71 | 92.03 | 91.31 | 90.76 |
| Validation | 94.67 | 100 | 91.69 | 47.61 | 63.52 | 61.53 | |
| Test | 93.33 | 94.44 | 92.31 | 57.78 | 72.38 | 61.54 | |
| Clean2 | Train | 100 | 100 | 100 | 100 | 100 | 95.23 |
| Validation | 100 | 100 | 100 | 100 | 100 | 97.25 | |
| Test | 100 | 100 | 100 | 100 | 100 | 94.39 | |
| Clean1 | Train | 100 | 100 | 100 | 100 | 100 | 95.12 |
| Validation | 100 | 100 | 100 | 100 | 100 | 100 | |
| Test | 100 | 100 | 100 | 100 | 100 | 95.30 | |
Figure 3Accuracy, Precision and Recall rates of the WOA-RVFL and RVFL methods for the training set (UCI data).
Figure 5Accuracy, Precision, and Recall of the WOA-RVFL and RVFL methods for the testing set (UCI data).
The mean rank and p-value of Friedman test to compare between WOA-RVFL and RVFL.
| Pre | Rec | Acc | |||||
|---|---|---|---|---|---|---|---|
| WOA-RVFL | RVFL | WOA-RVFL | RVFL | WOA-RVFL | RVFL | ||
| Training set | Mean Rank | 1.6000 | 1.4000 | 1.5500 | 1.4500 | 1.7500 | 1.2500 |
| p-value | 0.3173 | 0.5637 | 0. 0588 | ||||
| Validation | Mean Rank | 2.8000 | 2.2000 | 2.9000 | 2.10 | 2.8000 | 2.2 |
| p-value | 0.2076 | 0.0881 | 0.2453 | ||||
| Test | Mean Rank | 1.75 | 1.25 | 2.8500 | 2.15 | 3.0500 | 1.950 |
| p-value | 0.0253 | 0.1587 | 0. 0423 | ||||
| Average | Mean Rank | 1.7 | 1.3 | 2.8167 | 2.1833 | 2.9333 | 2.0667 |
| p-value | 0.0013 | 0.0068 | 0.0025 | ||||
Descriptive statistics of the variables included in the VDPV outbreak model development.
| Age | Titers | |
|---|---|---|
| Min | 5 | 0.8933 |
| Max | 69 | 203.400 |
| Average | 25.67 | 54.472 |
Figure 6The correlation between the parameters included in the model development.
Prediction performance of the WOA-RVFL and RVFL methods for classifying the VDPV outbreak incidences.
| Method | Set | Pre | Rec | Acc |
|---|---|---|---|---|
| RVFL | Train | 77.19 | 76.29 | 91.24 |
| Validation | 72.38 | 75.93 | 92.78 | |
| Test | 74.90 | 71.45 | 89.36 | |
| WOA-RVFL | Train | 100 | 100 | 100 |
| Validation | 93.31 | 94.41 | 96.28 | |
| Test | 94.13 | 95.12 | 98.30 |
The Comparison with other Meta-heuristic methods.
| Method | Set | Pre | Rec | Acc | Method | Set | Pre | Rec | Acc |
|---|---|---|---|---|---|---|---|---|---|
| WOA-RVFL | Train | 100 | 100 | 100 | SCA-RVFL | Train | 100 | 100 | 100 |
| Test | 96.12 | 96.74 | 98.91 | Test | 95.16 | 94.45 | 97.25 | ||
| ALO -RVFL | Train | 100 | 100 | 100 | MFO-RVFL | Train | 100 | 100 | 100 |
| Test | 92.83 | 93.24 | 95.23 | Test | 92.61 | 92.65 | 92.41 |