| Literature DB >> 35125958 |
Robert Kurniawan1, I Nyoman Setiawan2, Rezzy Eko Caraka3,4, Bahrul Ilmi Nasution5.
Abstract
As an area experiencing air pollution, especially ozone concentrations that often exceed the threshold or are unhealthy, JABODETABEK (Jakarta, Bogor, Depok, Tangerang, and Bekasi) seeks to prevent and control pollution as well as restore air quality. Therefore, this study aims to build a predictive model of ozone concentration using Harris hawks optimization-support vector regression (HHO-SVR) in 14 sub-districts in JABODETABEK. This goal is achieved by collecting data on ozone concentration as a response variable and meteorological factors as predictor variables from the website that provides the data. Other predictor variables such as time and significant lag detected with partial autocorrelation function of ozone concentration were also used. Then the variables will be selected using the recursive feature elimination-support vector regression (RFE-SVR) to obtain a significant predictor variable that affects the ozone concentration. After that, the prediction model will be built using the HHO-SVR method, support vector regression (SVR) whose parameter values are optimized with the Harris hawks optimization (HHO) algorithm. When the model has been formed, several evaluation metrics used to determine the best model include mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), Coefficient of Determination (R2), Variance Ratio (VR), and Diebold-Mariano test. The results of this study indicate that lag 1, lag 2, air temperature, humidity, and UV index are significant predictor variables of the RFE-SVR results for most sub-districts. In general, the HHO process takes longer than other metaheuristic algorithms. On average, 7 of the 14 sub-districts using the HHO-SVR model yielded the best predictions with MAE below 10, RMSE and MAPE below 20, R2 around 0.97, and VR around 0.98. Then, the results of the Diebold-Mariano test also show that the accuracy of the prediction results and the stability of the performance of the HHO-SVR model is better, especially for the Ciputat and South Bekasi sub-districts. This shows that the two sub-districts are very suitable to use HHO-SVR in predicting ozone concentrations.Entities:
Keywords: HHO; JABODETABEK; Ozone; RFE; SVR
Year: 2022 PMID: 35125958 PMCID: PMC8801044 DOI: 10.1007/s00477-022-02178-2
Source DB: PubMed Journal: Stoch Environ Res Risk Assess ISSN: 1436-3240 Impact factor: 3.379
Fig. 1Research locus map
Research data
| No. | Variable | Description | Unit | Related research |
|---|---|---|---|---|
| 1. | Ozone concentration | One of the secondary pollutants formed from chemical reactions induced by direct or indirect light (photochemical) which has a negative impact on the environment, nature and humans | µg/m3 | Masmoudi et al. ( |
| 2. | Air temperature | The degree of heat from a place on the earth’s surface that is affected by the sun’s heat | °C | Masmoudi et al. ( |
| 3. | Wind speed | The rate of air flow caused by the difference in air pressure from one place to another | km/jam | Masmoudi et al. ( |
| 4. | Air humidity | The amount of water vapor contained in the air | % | Masmoudi et al. ( |
| 5. | Dew point | Conditions that cause the process of condensation of water vapor into water droplets because the air can no longer accommodate water vapor | °C | – |
| 6. | Ultraviolet (UV) Index | Index that shows the amount of ultraviolet light exposure to the earth’s surface | – | Masmoudi et al. ( |
| 7. | Air pressure | The buildup of air masses on the earth’s surface due to the air above pressing the air below | Millibar | Souza et al. ( |
| 8. | Visibility | The maximum distance that an observer can see from an observation point | km | Permadi and Kim Oanh ( |
| 9. | Time | A series of times when the process of an event, change or state when an object takes place, a certain length of time to do something | – | – |
Descriptive statistics of research data
| Variable | Training data | Testing Data | ||||||
|---|---|---|---|---|---|---|---|---|
| Max | Min | Mean | Sd | Max | Min | Mean | Sd | |
| Ozone Concentration | 573 | 1 | 106.46 | 95.86 | 386 | 1 | 76.33 | 74.65 |
| Air Temperature | 37 | 22 | 28.32 | 2.84 | 33 | 23 | 27.47 | 2.34 |
| Wind Speed | 37 | 0 | 8.55 | 5.44 | 34 | 0 | 7.95 | 5.29 |
| Air Humidity | 98 | 33 | 74.13 | 11.90 | 98 | 55 | 81.28 | 8.85 |
| Dew Point | 26 | 17 | 22.93 | 1.03 | 26 | 21 | 23.69 | 0.74 |
| UV Index | 11 | 0 | 2.89 | 4.18 | 11 | 0 | 2.80 | 4.12 |
| Air Pressure | 1013.9 | 1005.1 | 1009.67 | 1.50 | 1013.6 | 1006.1 | 1009.86 | 1.33 |
| Visibility | 16.1 | 0.2 | 8.66 | 3.13 | 16.1 | 0.8 | 7.59 | 3.28 |
| Time | 23 | 0 | 11.44 | 6.92 | 23 | 0 | 11.76 | 6.94 |
Fig. 2Web scraping flowchart
Fig. 3SVM (left) and SVR (right)
Fig. 4SVR with slack variable
Fig. 5Pseudocode RFE-SVR
Fig. 6HHO flowchart
Fig. 7HHO-SVR flowchart
Fig. 8Research flow
Fig. 9PACF correlogram of ozone data
Fig. 10The results of variable selection
Optimization results for SVR parameters
| Sub-district | Metaheuristic algorithm | Iteration | Time | Average time for each iteration | Cost | Gamma | Epsilon |
|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) |
| Kembangan | ALO | 5 | 00:03:16 | 00:00:39 | 423.2400 | 0.3422 | 0.0104 |
| DA | 18 | 00:04:27 | 00:00:15 | 1024.0000 | 0.0308 | 0.0039 | |
| GWO | 15 | 00:06:47 | 00:00:27 | 915.8583 | 0.0316 | 0.0039 | |
| HHO | 12 | 00:03:37 | 00:00:18 | 44.5637 | 0.0594 | 0.0039 | |
| Menteng | ALO | 5 | 00:06:00 | 00:01:12 | 425.2400 | 0.3500 | 0.0039 |
| DA | 21 | 00:09:02 | 00:00:26 | 504.8412 | 0.0717 | 0.0039 | |
| GWO | 11 | 00:06:35 | 00:00:36 | 294.9462 | 0.0821 | 0.0039 | |
| HHO | 12 | 00:07:06 | 00:00:36 | 148.6889 | 0.0945 | 0.0039 | |
| Kebayoran Baru | ALO | 5 | 00:04:47 | 00:00:57 | 424.2400 | 0.3461 | 0.0143 |
| DA | 5 | 00:01:07 | 00:00:13 | 424.2400 | 0.3461 | 0.0143 | |
| GWO | 8 | 00:08:30 | 00:01:04 | 270.9313 | 0.1831 | 0.0039 | |
| HHO | 19 | 00:49:27 | 00:02:36 | 383.1371 | 0.1676 | 0.0047 | |
| Cakung | ALO | 6 | 00:04:43 | 00:00:47 | 425.2400 | 0.0078 | 0.0039 |
| DA | 16 | 00:04:58 | 00:00:19 | 65.3659 | 0.1000 | 0.0039 | |
| GWO | 7 | 00:02:07 | 00:00:18 | 27.5240 | 0.1099 | 0.0051 | |
| HHO | 9 | 00:12:03 | 00:01:20 | 93.5102 | 0.0774 | 0.0039 | |
| Koja | ALO | 5 | 00:03:15 | 00:00:39 | 423.2400 | 0.3422 | 0.0104 |
| DA | 9 | 00:02:05 | 00:00:14 | 436.9753 | 0.3380 | 0.0101 | |
| GWO | 12 | 00:03:01 | 00:00:15 | 26.7147 | 0.1578 | 0.0123 | |
| HHO | 9 | 00:09:11 | 00:01:01 | 27.4987 | 0.1568 | 0.0039 | |
| North Thousand Islands | ALO | 5 | 00:01:58 | 00:00:24 | 424.2400 | 0.8480 | 0.0039 |
| DA | 13 | 00:02:09 | 00:00:10 | 613.4428 | 0.7821 | 0.0039 | |
| GWO | 6 | 00:03:29 | 00:00:35 | 623.6411 | 0.7765 | 0.0039 | |
| HHO | 8 | 00:11:47 | 00:01:28 | 827.2026 | 0.7257 | 0.0039 | |
| Central Bogor | ALO | 5 | 00:02:58 | 00:00:36 | 506.6380 | 0.0084 | 0.0039 |
| DA | 7 | 00:01:41 | 00:00:14 | 511.6969 | 0.0376 | 0.0039 | |
| GWO | 6 | 00:11:05 | 00:01:51 | 981.0511 | 0.0492 | 0.0039 | |
| HHO | 9 | 00:07:58 | 00:00:53 | 1024.0000 | 0.0071 | 0.0039 | |
| Cibinong | ALO | 6 | 00:03:36 | 00:00:36 | 1024.0000 | 0.0971 | 0.0039 |
| DA | 11 | 00:02:36 | 00:00:14 | 733.0358 | 0.1068 | 0.0039 | |
| GWO | 19 | 00:12:37 | 00:00:40 | 625.2154 | 0.1143 | 0.0039 | |
| HHO | 9 | 00:04:40 | 00:00:31 | 348.8484 | 0.1239 | 0.0039 | |
| Pancoran Mas | ALO | 6 | 00:04:58 | 00:00:50 | 425.2400 | 0.8519 | 0.0039 |
| DA | 13 | 00:04:29 | 00:00:21 | 755.6371 | 0.7260 | 0.0039 | |
| GWO | 9 | 00:08:52 | 00:00:59 | 344.2201 | 1.0000 | 0.0049 | |
| HHO | 10 | 00:21:04 | 00:02:06 | 880.3684 | 0.7510 | 0.0039 | |
| Tangerang | ALO | 43 | 00:12:11 | 00:00:17 | 489.8787 | 0.0619 | 0.0946 |
| DA | 5 | 00:00:56 | 00:00:11 | 424.2400 | 0.3461 | 0.0143 | |
| GWO | 19 | 00:07:52 | 00:00:25 | 964.0312 | 0.0406 | 0.0039 | |
| HHO | 6 | 00:01:52 | 00:00:19 | 77.6844 | 0.0319 | 0.0039 | |
| Tigaraksa | ALO | 9 | 00:07:39 | 00:00:51 | 512.5000 | 0.0117 | 0.0039 |
| DA | 5 | 00:01:14 | 00:00:15 | 424.2400 | 0.3461 | 0.0143 | |
| GWO | 12 | 00:04:55 | 00:00:25 | 1024.0000 | 0.0109 | 0.0045 | |
| HHO | 9 | 00:03:27 | 00:00:23 | 183.6585 | 0.0133 | 0.0039 | |
| Ciputat | ALO | 6 | 00:09:44 | 00:01:37 | 937.7400 | 0.3500 | 0.0039 |
| DA | 24 | 00:08:37 | 00:00:22 | 892.3696 | 0.0269 | 0.0039 | |
| GWO | 14 | 00:07:54 | 00:00:34 | 1024.0000 | 0.0253 | 0.0039 | |
| HHO | 12 | 00:06:16 | 00:00:31 | 75.4711 | 0.0724 | 0.0040 | |
| South Bekasi | ALO | 5 | 00:03:08 | 00:00:38 | 423.2400 | 0.3422 | 0.0104 |
| DA | 9 | 00:01:45 | 00:00:12 | 436.9753 | 0.3380 | 0.0101 | |
| GWO | 13 | 00:04:44 | 00:00:22 | 26.8120 | 0.5036 | 0.0051 | |
| HHO | 13 | 00:05:47 | 00:00:27 | 111.7048 | 0.0924 | 0.0039 | |
| Central Cikarang | ALO | 5 | 00:03:39 | 00:00:44 | 1019.1380 | 0.0084 | 0.0039 |
| DA | 7 | 00:01:38 | 00:00:14 | 511.6969 | 0.0376 | 0.0039 | |
| GWO | 19 | 00:20:10 | 00:01:04 | 1024.0000 | 0.0297 | 0.0039 | |
| HHO | 18 | 00:06:53 | 00:00:23 | 85.3265 | 0.0504 | 0.0161 |
The results of the evaluation using MAE, RMSE, and MAPE
| Sub-district | Model | Training | Test | ||||
|---|---|---|---|---|---|---|---|
| MAE | RMSE | MAPE | MAE | RMSE | MAPE | ||
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) |
| Kembangan | ALO-SVR | 5.2914 | 10.7730 | 8.1027 | 9.3175 | 17.2796 | 15.8704 |
| DA-SVR | 7.2557 | 13.8478 | 9.5405 | 7.7864 | 12.7201 | 14.0976 | |
| GWO-SVR | 7.2543 | 13.8461 | 9.5432 | 7.7866 | 12.6953 | 14.0948 | |
| HHO-SVR | 7.3221 | 13.8446 | 9.6312 | 7.7494 | 12.5356 | 14.1448 | |
| Menteng | ALO-SVR | 4.0348 | 10.4083 | 8.8457 | 14.6883 | 30.1129 | 22.2783 |
| DA-SVR | 6.4475 | 13.6066 | 11.0240 | 9.7110 | 17.6528 | 16.5291 | |
| GWO-SVR | 6.4216 | 13.5805 | 10.9852 | 9.7580 | 17.7086 | 16.5053 | |
| HHO-SVR | 6.4429 | 13.6415 | 11.0006 | 9.6904 | 17.4843 | 16.4469 | |
| Kebayoran Baru | ALO-SVR | 4.3023 | 9.6932 | 9.5891 | 12.5415 | 22.4830 | 22.0317 |
| DA-SVR | 4.3023 | 9.6932 | 9.5891 | 12.5415 | 22.4830 | 22.0317 | |
| GWO-SVR | 5.2632 | 11.6757 | 9.6654 | 9.4754 | 16.2822 | 16.7167 | |
| HHO-SVR | 5.2997 | 11.6945 | 9.7375 | 9.4141 | 16.2255 | 16.6123 | |
| Cakung | ALO-SVR | 7.3064 | 13.9228 | 8.7495 | 8.0526 | 12.2118 | 13.8328 |
| DA-SVR | 5.5125 | 10.7260 | 7.1123 | 8.4629 | 13.2993 | 14.2874 | |
| GWO-SVR | 5.6827 | 10.8655 | 7.2478 | 8.3150 | 13.0361 | 13.9572 | |
| HHO-SVR | 5.7532 | 10.9907 | 7.3096 | 8.2574 | 13.0409 | 13.4381 | |
| Koja | ALO-SVR | 4.8255 | 10.5008 | 6.2644 | 8.2608 | 17.0927 | 12.1320 |
| DA-SVR | 4.8323 | 10.5241 | 6.2785 | 8.2626 | 17.1066 | 12.2086 | |
| GWO-SVR | 6.2582 | 12.2373 | 7.6996 | 7.9440 | 13.0928 | 12.3278 | |
| HHO-SVR | 6.1894 | 12.2523 | 7.6285 | 7.9641 | 13.2332 | 12.2090 | |
| North Thousand Islands | ALO-SVR | 4.8075 | 11.2435 | 4.3916 | 3.0032 | 5.7569 | 4.3524 |
| DA-SVR | 4.8106 | 11.2581 | 4.3886 | 2.9592 | 5.7113 | 4.2635 | |
| GWO-SVR | 4.8113 | 11.2569 | 4.3905 | 2.9742 | 5.7196 | 4.2997 | |
| HHO-SVR | 4.8155 | 11.2681 | 4.3913 | 2.9592 | 5.6995 | 4.2700 | |
| Central Bogor | ALO-SVR | 5.7810 | 10.7041 | 12.1166 | 6.2478 | 10.2258 | 15.8929 |
| DA-SVR | 4.3874 | 8.9101 | 10.7906 | 6.6115 | 11.1947 | 15.0112 | |
| GWO-SVR | 3.7080 | 8.0739 | 9.8102 | 7.3733 | 12.3152 | 15.6830 | |
| HHO-SVR | 5.7481 | 10.6587 | 12.1076 | 6.2669 | 10.2410 | 15.7519 | |
| Cibinong | ALO-SVR | 8.3231 | 16.0574 | 21.3344 | 8.0667 | 14.3722 | 20.6413 |
| DA-SVR | 8.3122 | 16.0201 | 21.1645 | 8.0651 | 14.3961 | 20.5157 | |
| GWO-SVR | 8.2839 | 15.9428 | 20.9129 | 8.0332 | 14.3770 | 20.1301 | |
| HHO-SVR | 8.3287 | 16.0903 | 21.1599 | 8.1089 | 14.4361 | 20.5762 | |
| Pancoran Mas | ALO-SVR | 6.0867 | 13.5747 | 23.8343 | 10.4548 | 28.1467 | 23.6148 |
| DA-SVR | 6.2118 | 13.6298 | 23.6386 | 10.5847 | 29.1693 | 23.6061 | |
| GWO-SVR | 5.8503 | 13.2998 | 23.3867 | 10.7510 | 29.1245 | 23.9000 | |
| HHO-SVR | 6.1109 | 13.5202 | 23.4747 | 10.7361 | 30.1391 | 23.6922 | |
| Tangerang | ALO-SVR | 6.7808 | 11.9114 | 8.7512 | 7.4428 | 12.0701 | 13.6102 |
| DA-SVR | 4.8350 | 9.7954 | 7.1156 | 8.8736 | 14.9054 | 15.7767 | |
| GWO-SVR | 6.4645 | 12.3794 | 8.5280 | 6.4189 | 10.1794 | 12.4010 | |
| HHO-SVR | 7.1660 | 13.4689 | 8.8960 | 6.4355 | 10.5223 | 12.3587 | |
| Tigaraksa | ALO-SVR | 6.3821 | 11.6606 | 8.9372 | 5.2927 | 8.3243 | 13.5201 |
| DA-SVR | 3.3725 | 6.6176 | 6.3227 | 11.5291 | 20.9583 | 24.2082 | |
| GWO-SVR | 6.3266 | 11.5929 | 8.8641 | 5.2129 | 8.2879 | 13.2639 | |
| HHO-SVR | 6.4577 | 11.8330 | 8.9629 | 5.3653 | 8.3445 | 13.7711 | |
| Ciputat | ALO-SVR | 3.8172 | 9.7248 | 10.6004 | 12.1429 | 22.0601 | 24.4188 |
| DA-SVR | 7.7853 | 14.4656 | 14.0519 | 7.4201 | 12.7418 | 17.1620 | |
| GWO-SVR | 7.8086 | 14.5078 | 14.0274 | 7.3996 | 12.7125 | 16.8982 | |
| HHO-SVR | 7.4060 | 13.9598 | 13.6100 | 7.1641 | 12.3950 | 17.8710 | |
| South Bekasi | ALO-SVR | 6.7232 | 12.9864 | 8.9698 | 7.7912 | 13.7529 | 12.3903 |
| DA-SVR | 6.7256 | 12.9968 | 8.9643 | 7.7794 | 13.7241 | 12.3614 | |
| GWO-SVR | 6.7894 | 13.0630 | 8.9943 | 7.7574 | 13.4720 | 12.7105 | |
| HHO-SVR | 7.9507 | 14.6241 | 10.2255 | 7.6057 | 12.6481 | 12.1553 | |
| Central Cikarang | ALO-SVR | 5.8243 | 10.3448 | 11.8884 | 7.0909 | 10.2859 | 18.1418 |
| DA-SVR | 4.7988 | 9.3083 | 10.6533 | 7.4732 | 11.1613 | 19.3587 | |
| GWO-SVR | 4.8574 | 9.3812 | 10.5582 | 7.3663 | 11.0528 | 18.8495 | |
| HHO-SVR | 5.0870 | 9.4838 | 11.2709 | 7.2826 | 10.7823 | 18.7812 | |
Model evaluation with R2 and VR
| Sub-District | R2 | VR | ||||||
|---|---|---|---|---|---|---|---|---|
| ALO-SVR | DA-SVR | GWO-SVR | HHO-SVR | ALO-SVR | DA-SVR | GWO-SVR | HHO-SVR | |
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) |
| Kembangan | 0.9507 | 0.9728 | 0.9729 | 0.9735 | 0.9902 | 0.9951 | 0.9955 | 0.9988 |
| Menteng | 0.8883 | 0.9616 | 0.9613 | 0.9624 | 0.9855 | 0.9720 | 0.9735 | 0.9742 |
| Kebayoran Baru | 0.9332 | 0.9332 | 0.9637 | 0.9641 | 0.9987 | 0.9987 | 0.9630 | 0.9640 |
| Cakung | 0.9742 | 0.9693 | 0.9706 | 0.9704 | 0.9420 | 0.9510 | 0.9495 | 0.9512 |
| Koja | 0.9540 | 0.9536 | 0.9722 | 0.9716 | 0.9720 | 0.9720 | 0.9736 | 0.9772 |
| North Thousand Islands | 0.9708 | 0.9712 | 0.9711 | 0.9713 | 0.9854 | 0.9873 | 0.9857 | 0.9861 |
| Central Bogor | 0.9763 | 0.9717 | 0.9658 | 0.9762 | 0.9961 | 0.9989 | 0.9935 | 0.9947 |
| Cibinong | 0.9705 | 0.9705 | 0.9705 | 0.9703 | 0.9946 | 0.9963 | 0.9963 | 0.9990 |
| Pancoran Mas | 0.9196 | 0.9149 | 0.9150 | 0.9102 | 0.8373 | 0.8272 | 0.8293 | 0.8194 |
| Tangerang | 0.9699 | 0.9552 | 0.9782 | 0.9766 | 0.9912 | 0.9670 | 0.9894 | 0.9974 |
| Tigaraksa | 0.9796 | 0.8789 | 0.9796 | 0.9797 | 0.9890 | 0.9809 | 0.9911 | 0.9874 |
| Ciputat | 0.9283 | 0.9761 | 0.9762 | 0.9776 | 0.9204 | 0.9762 | 0.9774 | 0.9759 |
| South Bekasi | 0.9680 | 0.9682 | 0.9694 | 0.9729 | 0.9956 | 0.9953 | 0.9973 | 0.9792 |
| Central Cikarang | 0.9756 | 0.9715 | 0.9720 | 0.9732 | 0.9534 | 0.9858 | 0.9855 | 0.9930 |
The test results using DM test
| Sub-district | Diebold–Mariano value | ||
|---|---|---|---|
| ALO-SVR | DA-SVR | GWO-SVR | |
| (1) | (2) | (3) | (4) |
| Kembangan | 2.4560 (0.0073)* | 1.1126 (0.1334) | 1.1510 (0.1254) |
| Menteng | 2.7892 (0.0028)* | 0.6047 (0.2730) | 2.1304 (0.0170)* |
| Kebayoran Baru | 3.7392 (0.0001)* | 3.7392 (0.0001)* | 1.1548 (0.1246) |
| Cakung | − 1.2629 (0.8962) | 1.6765 (0.0474)* | − 0.1412 (0.5561) |
| Koja | 1.6350 (0.0516)* | 1.6407 (0.0510)* | − 1.8830 (0.9696) |
| North Thousand Islands | 1.6907 (0.0460)* | 1.1254 (0.1307) | 1.4833 (0.0696)* |
| Central Bogor | − 0.6645 (0.7465) | 2.2871 (0.0115)* | 3.4096 (0.0004)* |
| Cibinong | − 0.8258 (0.7952) | − 0.9871 (0.8378) | − 1.6719 (0.9522) |
| Pancoran Mas | − 1.4474 (0.9255) | − 1.8925 (0.9703) | − 0.7039 (0.7590) |
| Tangerang | 3.0974 (0.0011)* | 2.7948 (0.0028)* | − 0.5093 (0.6945) |
| Tigaraksa | − 0.3301 (0.6292) | 4.7196 (0.0000)* | − 0.5793 (0.7186) |
| Ciputat | 3.5289 (0.0002)* | 1.5488 (0.0613)* | 1.3353 (0.0914)* |
| South Bekasi | 1.4808 (0.0699)* | 1.4768 (0.0704)* | 1.6556 (0.0495)* |
| Central Cikarang | − 1.7172 (0.9565) | 2.3443 (0.0099)* | 1.6605 (0.0490)* |
*Significancy of p-value at 10%