| Literature DB >> 36091345 |
Suzilah Ismail1, Robert Fildes2, Rohani Ahmad3, Wan Najdah Wan Mohamad Ali3, Topek Omar4.
Abstract
Dengue is a harmful tropical disease that causes death to many people. Currently, the dengue vaccine development is still at an early stage, and only intervention methods exist after dengue cases increase. Thus, previously, two scientific experimental field studies were conducted in producing a dengue outbreak forecasting model as an early warning system. Successfully, an Autoregressive Distributed Lag (ADL) Model was developed using three factors: the epidemiological, entomological, and environmental with an accuracy of 85%; but a higher percentage is required in minimizing the error for the model to be useful. Hence, this study aimed to develop a practical and cost-effective dengue outbreak forecasting model with at least 90% accuracy to be embedded in an early warning computer system using the Internet of Things (IoT) approach. Eighty-one weeks of time series data of the three factors were used in six forecasting models, which were Autoregressive Distributed Lag (ADL), Hierarchical Forecasting (Bottom-up and Optimal combination) and three Machine Learning methods: (Artificial Neural Network (ANN), Support Vector Machine (SVM) and Random Forest). Five error measures were used to evaluate the consistency performance of the models in order to ensure model performance. The findings indicated Random Forest outperformed the other models with an accuracy of 95% when including all three factors. But practically, collecting mosquito related data (the entomological factor) was very costly and time consuming. Thus, it was removed from the model, and the accuracy dropped to 92% but still high enough to be of practical use, i.e., beyond 90%. However, the practical ground operationalization of the early warning system also requires several rain gauges to be located at the dengue hot spots due to localized rainfall. Hence, further analysis was conducted in determining the location of the rain gauges. This has led to the recommendation that the rain gauges should be located about 3-4 km apart at the dengue hot spots to ensure the accuracy of the rainfall data to be included in the dengue outbreak forecasting model so that it can be embedded in the early warning system. Therefore, this early warning system can save lives, and prevention is better than cure.Entities:
Keywords: Early warning system; IoT; Machine learning; dengue; rainfall
Year: 2022 PMID: 36091345 PMCID: PMC9418377 DOI: 10.1016/j.idm.2022.07.008
Source DB: PubMed Journal: Infect Dis Model ISSN: 2468-0427
Fig. 1Hierarchical structure.
Fig. 2Notified dengue cases for Selayang, Bandar Baru Bangi and pool data.
Correlations between notified dengue cases (target variable) and predictors.
| Area | Notified Dengue Cases (Target Variable) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Predictors | ||||||||||
| Onset | Onset1 | API4 | Larvae3 | PCR3 | Rainfall4 | Min | Max | Max | Inter ventionLD1 | |
| Selayang | 0.782 | 0.814 | −0.637 | 0.847 | 0.537 | 0.678 | −0.453 | −0.452 | 0.674 | 0.533 |
| BBBangi | 0.819 | 0.811 | −0.393 | 0.747 | 0.407 | 0.678 | −0.379 | −0.474 | 0.656 | 0.304 |
| Pool | 0.837 | 0.871 | −0.415 | 0.797 | 0.465 | 0.665 | −0.485 | −0.518 | 0.649 | 0.260∗ |
Significant at 1% ∗ Significant at 5%.
Fig. 3Conceptual relationship (target variable & predictors): Epidemiological, entomological and environmental factors based on weeks.
Estimated autoregressive distributed lag (ADL) model.
| Variables (Predictors) | Selayang | Bandar Baru Bangi | Pool |
|---|---|---|---|
| Intercept | −18.400 | 8.138 | −80.4545 |
| Onset | 0.372 | 0.393 | 0.490 |
| Onset1 | 0.381 | 0.399 | 0.137 |
| Larvae3 | 0.007 | 0.005 | 0.011 |
| Rainfall4 | −0.021 | 0.017 | 0.055 |
| MinTemp3 | −0.272 | 1.352 | 1.282 |
| MaxTemp3 | 0.705 | −1.323 | 1.305 |
| MaxHumidity3 | 0.039 | 0.051 | −0.181 |
| PCR3 | 0.624 | 0.727 | 0.262 |
| API4 | −0.016 | 0.007 | 0.002 |
| InterventionLD1 | 0.061 | 0.058 | 0.113 |
| Adjusted R2 | 0.8465 | 0.8388 | 0.8506 |
| Information Criterion | 135.039 | 268.466 | 206.447 |
| Accuracy | 84.9% (∼85%) | 84.1% (∼84%) | 85.7% (∼86%) |
Target Variable: Notified Dengue Cases.
Significant at 1%.
Significant at 5%.
Error measures based on RelMAE, MASE, GMRAE and UMBRAE.
| Forecasting Methods | Lead | RelMAE | MASE | GMRAE | UMBRAE |
|---|---|---|---|---|---|
| Autoregressive Distributed Lag (ADL) | 1 | 0.9195 | 0.5672 | 0.9916 | 0.7497 |
| 2 | 0.7535 | 0.5142 | 0.9690 | 0.6261 | |
| 3 | 0.7473 | 0.548 | 0.9642 | 0.5938 | |
| 4 | 0.5636 | 0.4496 | 0.9214 | 0.4330 | |
| Median | 0.7504 (5) | 0.5311 (5) | 0.9666 (5) | 0.6100 (4) | |
| Hierarchical Forecasting: | 1 | 0.9434 | 0.5819 | 0.9942 | 0.9333 |
| 2 | 0.7748 | 0.5288 | 0.9721 | 0.7618 | |
| 3 | 0.7338 | 0.5381 | 0.962 | 0.6955 | |
| 4 | 0.5824 | 0.4645 | 0.9257 | 0.5834 | |
| Median | 0.7543 (6) | 0.5335 (6) | 0.9670 (6) | 0.7287 (6) | |
| Hierarchical Forecasting: | 1 | 0.912 | 0.5626 | 0.9908 | 0.8221 |
| 2 | 0.7442 | 0.5079 | 0.9677 | 0.6559 | |
| 3 | 0.7247 | 0.5314 | 0.9605 | 0.6024 | |
| 4 | 0.565 | 0.4507 | 0.9217 | 0.5049 | |
| Median | 0.7344 (4) | 0.5197 (4) | 0.9641(4) | 0.5317 (5) | |
| Artificial Neural Network (ANN) | 1 | 0.6798 | 0.4194 | 0.4208 | 0.6339 |
| 2 | 0.6876 | 0.4693 | 0.5841 | 0.5735 | |
| 3 | 0.6693 | 0.4908 | 0.5093 | 0.6956 | |
| 4 | 0.866 | 0.6907 | 0.6316 | 0.6037 | |
| Median | 0.6837 (3) | 0.4800 (3) | 0.5467 (3) | 0.5317 (3) | |
| Support Vector Machine (SVM) | 1 | 0.5066 | 0.3125 | 0.3331 | 0.4947 |
| 2 | 0.4423 | 0.3019 | 0.2557 | 0.394 | |
| 3 | 0.3579 | 0.2625 | 0.2215 | 0.3282 | |
| 4 | 0.303 | 0.2417 | 0.1819 | 0.2641 | |
| Median | 0.4001 (2) | 0.2822 (2) | 0.2386 (2) | 0.3611 (2) | |
| Random Forest | 1 | 0.3881 | 0.2394 | 0.2070 | 0.4048 |
| 2 | 0.3376 | 0.2304 | 0.2463 | 0.3191 | |
| 3 | 0.2428 | 0.1781 | 0.1600 | 0.2233 | |
| 4 | 0.2315 | 0.1847 | 0.1351 | 0.2118 | |
| Median | 0.2902 (1) | 0.2076 (1) | 0.1835 (1) | 0.2712 (1) |
() - Rank.
Model accuracy based on MAPE.
| MAPE | Lead 1 | Lead 2 | Lead 3 | Lead 4 | Median | Rank | Accuracy (%) |
|---|---|---|---|---|---|---|---|
| Autoregressive Distributed Lag (ADL) | 15.9573 | 13.8988 | 14.7043 | 11.2449 | 14.3016 | 4 | 85.70 |
| Hierarchical Forecasting: | 17.0057 | 15.0344 | 15.2610 | 12.7246 | 15.1477 | 6 | 84.85 |
| Hierarchical Forecasting: | 16.1123 | 14.0466 | 14.6186 | 11.7883 | 14.3326 | 5 | 85.67 |
| Artificial Neural Network (ANN) | 11.6450 | 12.5917 | 15.1991 | 20.7552 | 13.8954 | 3 | 86.10 |
| Support Vector Machine (SVM) | 8.5833 | 8.1001 | 6.9631 | 6.1788 | 7.5316 | 2 | 92.47 |
| Random Forest | 6.7609 | 6.2222 | 4.6470 | 4.6981 | 5.4602 | 1 | 94.54 |
| Random Forest Less 2 variables | 7.8956 | 8.7415 | 5.4089 | 9.1792 | 8.3186 | 91.68 |
Fig. 4Total larvae, amount of rainfall and notified dengue cases.
Fig. 5Location of Rain gauge (Mobile Weather Station) at Selayang.
Fig. 6Location of Rain gauge (Mobile Weather Station) at Bandar Baru Bangi.
Rain gauge distance in kilometre (KM) and correlation (Selayang).
| Distance (KM) | RG1 | RG2 | RG3 | RG4 | RG5 | RG6 | RG7 | RG8 | RG9 | RG10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Correlation | |||||||||||
| _ | 0.97 | 3.88 | 4.37 | 4.85 | 6.14 | 4.04 | 2.59 | 1.62 | 2.91 | ||
| .893 | _ | 4.20 | 4.69 | 5.34 | 6.79 | 5.01 | 3.56 | 2.59 | 3.72 | ||
| .483 | .484 | _ | 0.89 | 2.26 | 3.23 | 3.49 | 4.38 | 3.56 | 1.78 | ||
| .481 | .485 | .773 | _ | 1.46 | 2.26 | 2.91 | 4.21 | 3.57 | 1.63 | ||
| .411 | .423 | .477 | .476 | _ | 1.45 | 1.78 | 3.72 | 3.55 | 1.77 | ||
| .162∗ | .188∗ | .269 | .306 | .375 | _ | 2.91 | 5.02 | 5.01 | 3.22 | ||
| .314 | .321 | .351 | .388 | .267 | .185∗ | _ | 2.27 | 2.59 | 1.94 | ||
| .508 | .512 | .492 | .519 | .363 | .286 | .431 | _ | 1.13 | 2.58 | ||
| .521 | .532 | .493 | .535 | .396 | .311 | .408 | .886 | _ | 2.1 | ||
| .392 | .390 | .608 | .634 | .410 | .221 | .344 | .520 | .685 | _ | ||
Sig. at 1%, ∗ Sig. at 5%.
Rain gauge distance in kilometre (KM) and correlation (Bandar Baru Bangi).
| Distance (KM) | RG11 | RG12 | RG13 | RG14 | RG15 | RG16 | RG17 | RG18 | RG19 | RG20 | RG21 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Correlation | ||||||||||||
| _ | 1.96 | 1.31 | 2.44 | 4.73 | 1.79 | 1.41 | 2.76 | 3.25 | 3.57 | 3.26 | ||
| .797 | _ | 2.12 | 3.41 | 5.03 | 1.47 | 2.44 | 1.79 | 4.54 | 5.03 | 3.91 | ||
| .871 | .756 | _ | 1.95 | 4.58 | 2.46 | 2.11 | 3.09 | 3.73 | 3.91 | 2.45 | ||
| .744 | .720 | .763 | _ | 3.28 | 3.71 | 2.93 | 4.55 | 3.43 | 3.11 | 1.63 | ||
| 0.078 | 0.108 | 0.074 | 0.149 | _ | 5.05 | 4.82 | 5.84 | 3.62 | 3.43 | 3.94 | ||
| .776 | .741 | .770 | .720 | 0.078 | _ | 1.78 | 1.30 | 4.06 | 4.54 | 4.21 | ||
| .810 | .746 | .804 | .711 | 0.067 | .815 | _ | 2.62 | 2.95 | 3.24 | 3.74 | ||
| .723 | .654 | .721 | .697 | 0.038 | .820 | .754 | _ | 4.70 | 5.35 | 5.02 | ||
| .415 | .395 | .433 | .515 | .228 | .465 | .461 | .433 | _ | 1.32 | 4.38 | ||
| .372 | .352 | .398 | .488 | .238 | .441 | .431 | .422 | .867 | _ | 4.05 | ||
| .542 | .537 | .583 | .680 | 0.119 | .576 | .538 | .573 | .436 | .430 | _ | ||
Sig. at 1%, ∗ Sig. at 5%.
Fig. 7Rainfall amount pattern of near and far rain gauge (Selayang).
Fig. 8Rainfall amount pattern of near and far rain gauge (Bandar Baru Bangi).
Moderate correlation & distance (KM).
| Area | Rain Gauge | Moderate Correlation | Distance (KM) |
|---|---|---|---|
| Selayang | RG1 & RG8 | 0.508 | 2.59 |
| RG1 & RG9 | 0.521 | 1.62 | |
| RG2 & RG8 | 0.512 | 3.56 | |
| RG2 & RG9 | 0.532 | 2.59 | |
| RG4 & RG8 | 0.519 | 4.21 | |
| RG4 & RG9 | 0.535 | 3.57 | |
| RG8 & RG10 | 0.520 | 2.58 | |
| Bandar Baru Bangi | RG19 & RG14 | 0.515 | 3.43 |
| RG21 & RG11 | 0.542 | 3.91 | |
| RG21 & RG12 | 0.537 | 2.45 | |
| RG21& RG13 | 0.583 | 4.21 | |
| RG21& RG16 | 0.576 | 3.74 | |
| RG21& RG17 | 0.538 | 5.02 | |
| RG21& RG18 | 0.573 | 3.26 | |
| Overall | Mean | 3.34 | |
| Standard Deviation | 0.89 | ||
| 95% Confidence Interval for Mean (Lower & Upper Bound) | |||
Fig. 9e-dengue early warning system conceptual framework.