| Literature DB >> 31771273 |
Dong-Her Shih1, Ting-Wei Wu1, Wen-Xuan Liu1, Po-Yuan Shih2.
Abstract
With the development of industrialization and urbanization, air pollution in many countries has become more serious and has affected people's health. The air quality has been continuously concerned by environmental managers and the public. Therefore, accurate air quality deterioration warning system can avoid health hazards. In this study, an air quality index (AQI) warning system based on Azure cloud computing platform is proposed. The prediction model is based on DFR (Decision Forest Regression), NNR (Neural Network Regression), and LR (Linear Regression) machine learning algorithms. The best algorithm was selected to calculate the 6 pollutants required for the AQI calculation of the air quality monitoring in real time. The experimental results show that the LR algorithm has the best performance, and the method of this study has a good prediction on the AQI index warning for the next one to three hours. Based on the ACES system proposed, it is hoped that it can prevent personal health hazards and help to reduce medical costs in public.Entities:
Keywords: AQI; Azure; air pollution; cloud computing; machine learning
Mesh:
Substances:
Year: 2019 PMID: 31771273 PMCID: PMC6926579 DOI: 10.3390/ijerph16234679
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Table for individualized air quality index (IAQI) and AQI formulas.
| Calculation | Symbol | Explanation |
|---|---|---|
| IAQI | IAQIP | Individual air quality index of pollutant item P. |
| CP | Concentration value of pollutant item P. | |
| BPHi | The upper limit for classification of pollutant items and CPs. | |
| BPLo | The lower limit for classification of pollutant and CPs | |
| IAQIHi | The upper limit of AQI classification corresponding to BPHi for pollutant items. | |
| IAQILo | The lower grading limit of AQI value corresponding to BPLo for pollutant items. | |
| AQI | IAQI | Individual air quality index. |
| n | Pollutant projects. |
Research on Air Pollution in Recent Years.
| Research Category | Method | Pollution Index | Author |
|---|---|---|---|
| Discussion on the Influences | Gauss Distribution | CO | Pan et al., [ |
| Statistical Analysis | PM2.5, NO2, SO2 | Ng et al., [ | |
| NO2, NOx | Hjortebjerg et al., [ | ||
| PM10, NO2, SO2 | Deng et al., [ | ||
| PM2.5, PM10, NO2, SO2, CO, O3 | Lee et al., [ | ||
| PM10, O3 | Lichter et al., [ | ||
| PM2.5, BC | Kingsley et al., [ | ||
| Literature Review | PM2.5, PM10, NO2, SO2, CO, O3 | Vizcaino et al., [ | |
| PM2.5, PM10 | Chen et al., [ | ||
| Santibáñez-Andrade et al., [ | |||
| Time Series | PM10, NO2, SO2 | Ma et al., [ | |
| PM2.5 | Li et al., [ | ||
| Prediction of Air Quality Index | Machine Learning | PM2.5 | Perez & Gramsch, [ |
| NO2, SO2, O3 | Shaban et al., [ | ||
| PM2.5 | Zhan et al., [ | ||
| AQI | Wang et al., [ | ||
| Chen et al., [ | |||
| Statistical Model | PM2.5 | Dong et al., [ | |
| Xu & Wang, [ | |||
| AQI | Zhu et al., [ | ||
| Numerical Analysis | NO, NO2, SO2, CO, O3 | Feng et al., [ | |
| IoT Monitoring | PM2.5 | Chen et al., [ |
Figure 1System Architecture Diagram.
Figure 2Data Collection and Preprocessing Module Diagram.
Figure 3Prediction Model Constructing and Application Module Diagram.
Figure 4Decision Module Diagram.
Figure 5Algorithm for Early Warning Alert Decision.
Figure 6Early Warning Module Diagram.
Azure adopt function.
| Item | Use of Efficiency Layer |
|---|---|
| App Service | B1 (Cores: 1, RAM: 1.75 GB, Storage: 10 GB, Disk Space: 10 GB) |
| SQL Database | S0 (DTUs: 10, Included Storage: 250 GB) |
| Machine Learning Studio | S1 (Included transactions: 100,000, Included compute hours: 25, Total number of web services: 10) |
| Storage | Standard |
Figure 7Prediction Model Establishment.
Figure 8Time history chart of experimental data.
Figure 9Overall experimental flow chart.
Figure 10Window Size test schematic.
Testing results of different window size.
| Window Size | 1 | 2 | 3 | 4 | |
|---|---|---|---|---|---|
| Performance | |||||
| MAE | 3.833 | 3.868 | 3.824 | 3.633 | |
| RMSE | 5.302 | 5.671 | 5.341 | 4.969 | |
| R2 | 0.881 | 0.868 | 0.882 | 0.898 | |
Figure 11Genetic representation of model training.
Figure 12Flow chart of model prediction.
Data Source Content and variables description.
| Data Source | Variables | Data Field | Measurement/Units | Related Study |
|---|---|---|---|---|
| EPA | NA | Date Time | yyyy/MM/ddHH:mm:ss | Lee et al., [ |
| NA | Observatory Name | Station name/NA | ||
| SO2(t + 1),…,SO2(t + 6) | Sulfur dioxide/ppb | |||
| CO(t + 1),…,CO(t + 6) | Carbon monoxide/ppm | |||
| O3(t + 1),…,O3(t + 6) | Ozone/ppb | |||
| PM10(t + 1),…,PM10(t + 6) | Suspended particulates/μg/m3 | |||
| PM2.5(t + 1),…,PM2.5(t + 6) | Particulate matter/μg/m3 | |||
| NO2(t + 1),…,NO2(t + 6) | Nitrogen dioxide/ppb | |||
| SO2(t + 1),…,SO2(t + 5) | Sulfur dioxide/ppb | |||
| CO(t + 1),…,CO(t + 5) | Carbon monoxide/ppm | |||
| O3(t + 1),…,O3(t + 5) | Ozone/ppb | |||
| PM10(t + 1),…,PM10(t + 5) | Suspended particulates/μg/m3 | |||
| PM2.5(t + 1),…,PM2.5(t + 5) | Particulate matter/μg/m3 | |||
| NO2(t + 1),…,NO2(t + 5) | Nitrogen dioxide/ppb | |||
| SO2(t − 3),…,SO2(t) | Sulfur dioxide/ppb | |||
| CO(t − 3),…,CO(t) | Carbon monoxide/ppm | |||
| O3(t − 3),…,O3(t) | Ozone/ppb | |||
| PM10(t − 3),…,PM10(t) | Suspended particulates/μg/m3 | |||
| PM2.5(t − 3),…,PM2.5(t) | Particulate matter/μg/m3 | |||
| NO2(t − 3),…,NO2(t) | Nitrogen dioxide/ppb | |||
| NOX(t − 3),…,NOX(t) | Nitrogen oxide/ppb | Hjortebjerg et., [ | ||
| NO(t − 3),…,NO(t) | Nitric oxide/ppb | Feng et al., [ | ||
| AMB_TEMP(t − 3),…,AMB_TEMP(t) | Atmospheric temperature/°C | Voukantsis et al., [ | ||
| RAINFALL(t − 3),…,RAINFALL(t) | Rainfall/mm | Sun et al., [ | ||
| RH(t − 3),…,RH(t) | Relative humidity/% | Voukantsis et al., [ | ||
| WIND_SPEED(t − 3),…,WIND_SPEED(t) | Wind speed/m/sec | Heyes et al., [ | ||
| WIND_DIREC(t − 3),…,WIND_DIREC(t) | Wind direction/degress | |||
| WS_HR(t − 3),…,WS_HR(t) | Wind speed per hour/m/sec | Voukantsis et al., [ | ||
| WD_HR(t − 3),…,WS_HR(t) | Wind direction per hour/degress | Heyes et al., [ |
Notes: y = Output variables; x = Input variables; x = Input Variables Predicted value of y.
Comparison Table of AQI and classification.
| AQI Value | Health Effects | Status in Color |
|---|---|---|
| 0–50 | good | green |
| 51–100 | ordinary | yellow |
| 101–150 | Poor to sensitive | orange |
| 151–200 | Bad | red |
| 201–300 | Very bad | purple |
| 301–500 | Harmful | maroon |
Primitive Structure of Historical Data.
| Data Field | Content |
|---|---|
| Date | (yyyy/MM/dd) |
| Station | Name of station (example: DouLiu, LunBei etc.) |
| Items | Monitoring items (example: SO2, CO, O3 etc.) |
| Hour | Hourly monitoring item values, 00~23 (24 h) |
Real-time data original structure.
| Data Field | Items/Unit | Notes |
|---|---|---|
| Observatory_Name | Station name/NA | Non-input variable |
| DateTime | yyyy/MM/dd HH:mm:ss | |
| SO2 | Sulfur dioxide/ppb | |
| CO | Carbon monoxide/ppm | |
| O3 | Ozone/ppb | |
| PM10 | Suspended particulates/μg/m3 | |
| PM2.5 | Particulate matter/μg/m3 | |
| NO2 | Nitrogen dioxide/ppb | |
| NOX | Nitrogen oxide/ppb | |
| NO | Nitric oxide/ppb | |
| THC | Total hydrocarbon/ppm | Delete in subsequent processing |
| NMHC | Non-methane hydrocarbons/ppm | |
| CH4 | Methane/ppm | |
| UVB | UV index/UVI | |
| AMB_TEMP | Atmospheric temperature/°C | |
| RAINFALL | Rainfall/mm | |
| RH | Relative humidity/% | |
| WIND_SPEED | Wind speed/m/sec | |
| WIND_DIREC | Wind direction/degress | |
| WS_HR | Wind speed per hour/m/sec | |
| WD_HR | Wind direction per hour/degress | |
| PH_RAIN | PH (acid rain)/pH | Delete in subsequent processing |
| RAIN_COND | Conductivity (acid rain)/μS/cm |
Performance comparison of algorithms and pollutant.
| Pollutant | Algorithms | MAE | RMSE | R2 |
|---|---|---|---|---|
| SO2 | DFR | 0.778 | 1.642 | 0.556 |
| LR | 0.747 | 1.592 | 0.583 | |
| NNR | 0.793 | 1.624 | 0.566 | |
| CO | DFR | 0.061 | 0.115 | 0.808 |
| LR | 0.061 | 0.117 | 0.802 | |
| NNR | 0.059 | 0.112 | 0.817 | |
| O3 | DFR | 3.867 | 5.596 | 0.917 |
| LR | 3.852 | 5.557 | 0.918 | |
| NNR | 3.967 | 5.611 | 0.916 | |
| PM10 | DFR | 5.144 | 7.758 | 0.926 |
| LR | 4.849 | 7.452 | 0.932 | |
| NNR | 12.894 | 16.826 | 0.656 | |
| PM2.5 | DFR | 3.573 | 4.961 | 0.904 |
| LR | 3.363 | 4.675 | 0.914 | |
| NNR | 4.784 | 6.201 | 0.850 | |
| NO2 | DFR | 2.539 | 3.756 | 0.839 |
| LR | 2.461 | 3.641 | 0.848 | |
| NNR | 2.458 | 3.679 | 0.845 |
Overall Performance in Training Stage.
| Pollutant | Performance | ||||||
|---|---|---|---|---|---|---|---|
| AQI | MAE | 5.051 | 2.930 | 2.974 | 3.004 | 3.066 | 3.133 |
| RMSE | 11.458 | 4.324 | 4.562 | 4.668 | 5.261 | 5.287 | |
| R2 | 0.897 | 0.986 | 0.984 | 0.983 | 0.978 | 0.978 | |
| SO2 | MAE | 0.832 | 0.751 | 0.751 | 0.755 | 0.757 | 0.760 |
| RMSE | 1.784 | 1.621 | 1.625 | 1.620 | 1.629 | 1.644 | |
| R2 | 0.483 | 0.5740 | 0.569 | 0.571 | 0.568 | 0.562 | |
| CO | MAE | 0.074 | 0.061 | 0.061 | 0.061 | 0.061 | 0.062 |
| RMSE | 0.144 | 0.116 | 0.117 | 0.117 | 0.119 | 0.120 | |
| R2 | 0.699 | 0.801 | 0.800 | 0.799 | 0.795 | 0.790 | |
| O3 | MAE | 5.235 | 3.909 | 3.953 | 3.976 | 4.008 | 4.091 |
| RMSE | 9.335 | 5.683 | 5.787 | 5.849 | 5.964 | 6.160 | |
| R2 | 0.773 | 0.913 | 0.910 | 0.908 | 0.905 | 0.899 | |
| PM10 | MAE | 6.263 | 4.861 | 4.875 | 4.877 | 4.948 | 4.955 |
| RMSE | 11.071 | 7.609 | 7.644 | 7.641 | 7.977 | 7.860 | |
| R2 | 0.853 | 0.929 | 0.928 | 0.928 | 0.922 | 0.924 | |
| PM2.5 | MAE | 4.134 | 3.373 | 3.388 | 3.390 | 3.422 | 3.440 |
| RMSE | 6.453 | 4.747 | 4.789 | 4.802 | 4.951 | 4.921 | |
| R2 | 0.839 | 0.912 | 0.911 | 0.910 | 0.905 | 0.906 | |
| NO2 | MAE | 3.000 | 2.479 | 2.482 | 2.478 | 2.478 | 2.500 |
| RMSE | 4.942 | 3.678 | 3.695 | 3.687 | 3.714 | 3.781 | |
| R2 | 0.725 | 0.844 | 0.842 | 0.842 | 0.839 | 0.833 |
Overall Performance in Test Stage.
| Pollutant | Performance | ||||||
|---|---|---|---|---|---|---|---|
| AQI | MAE | 3.124 | 6.001 | 8.649 | 12.843 | 12.069 | 13.420 |
| RMSE | 4.516 | 8.319 | 11.762 | 18.080 | 15.984 | 17.613 | |
| R2 | 0.981 | 0.936 | 0.870 | 0.683 | 0.758 | 0.705 | |
| SO2 | MAE | 0.674 | 0.921 | 1.046 | 1.196 | 1.184 | 1.223 |
| RMSE | 1.277 | 1.600 | 1.744 | 1.922 | 1.894 | 1.934 | |
| R2 | 0.635 | 0.426 | 0.316 | 0.174 | 0.196 | 0.161 | |
| CO | MAE | 0.058 | 0.089 | 0.108 | 0.128 | 0.126 | 0.128 |
| RMSE | 0.104 | 0.150 | 0.174 | 0.205 | 0.194 | 0.196 | |
| R2 | 0.805 | 0.592 | 0.451 | 0.254 | 0.319 | 0.307 | |
| O3 | MAE | 3.765 | 6.268 | 8.227 | 11.182 | 11.183 | 12.223 |
| RMSE | 5.310 | 8.490 | 10.930 | 14.958 | 14.569 | 15.821 | |
| R2 | 0.922 | 0.801 | 0.671 | 0.395 | 0.418 | 0.316 | |
| PM10 | MAE | 5.107 | 8.382 | 10.832 | 14.046 | 13.473 | 14.389 |
| RMSE | 7.746 | 12.428 | 15.813 | 20.445 | 19.210 | 20.758 | |
| R2 | 0.926 | 0.810 | 0.393 | 0.489 | 0.544 | 0.490 | |
| PM2.5 | MAE | 3.456 | 5.010 | 6.209 | 7.654 | 7.423 | 7.889 |
| RMSE | 4.785 | 6.954 | 8.625 | 10.576 | 10.204 | 10.773 | |
| R2 | 0.896 | 7.781 | 0.664 | 0.502 | 0.531 | 0.479 | |
| NO2 | MAE | 2.556 | 3.848 | 4.658 | 5.560 | 5.521 | 5.690 |
| RMSE | 3.759 | 5.392 | 6.356 | 7.628 | 7.328 | 7.505 | |
| R2 | 0.841 | 0.671 | 0.541 | 0.344 | 0.384 | 0.352 |
Figure 13Regression analysis chart of AQI prediction in Douliu City (May 2018).
Overall Performance for time t + 1 to t + 6.
| Pollutant | Performance | ||||||
|---|---|---|---|---|---|---|---|
| AQI | MAE | 3.246 | 5.936 | 8.076 | 9.283 | 10.430 | 11.242 |
| RMSE | 5.983 | 10.110 | 13.426 | 15.140 | 16.466 | 17.262 | |
| R2 | 0.947 | 0.853 | 0.728 | 0.638 | 0.555 | 0.506 | |
| SO2 | MAE | 0.766 | 1.026 | 1.146 | 1.221 | 1.269 | 1.301 |
| RMSE | 1.442 | 1.787 | 1.932 | 2.026 | 2.084 | 2.111 | |
| R2 | 0.592 | 0.380 | 0.282 | 0.217 | 0.171 | 0.146 | |
| CO | MAE | 0.049 | 0.073 | 0.087 | 0.096 | 0.102 | 0.103 |
| RMSE | 0.091 | 0.121 | 0.138 | 0.148 | 0.154 | 0.155 | |
| R2 | 0.735 | 0.528 | 0.391 | 0.302 | 0.249 | 0.243 | |
| O3 | MAE | 1.239 | 6.750 | 8.758 | 10.375 | 12.013 | 13.136 |
| RMSE | 6.857 | 9.611 | 12.035 | 13.946 | 16.103 | 17.554 | |
| R2 | 0.895 | 0.795 | 0.682 | 0.579 | 0.422 | 0.344 | |
| PM10 | MAE | 4.513 | 6.870 | 8.380 | 9.070 | 9.775 | 10.265 |
| RMSE | 7.451 | 10.927 | 12.856 | 13.797 | 14.748 | 15.267 | |
| R2 | 0.852 | 0.674 | 0.530 | 0.444 | 0.361 | 0.131 | |
| PM2.5 | MAE | 2.923 | 3.925 | 4.565 | 4.792 | 5.116 | 5.350 |
| RMSE | 3.991 | 5.276 | 6.108 | 6.368 | 6.785 | 7.036 | |
| R2 | 0.767 | 0.601 | 0.470 | 0.421 | 0.351 | 0.314 | |
| NO2 | MAE | 2.072 | 2.966 | 3.520 | 3.917 | 4.200 | 4.369 |
| RMSE | 3.046 | 4.059 | 4.661 | 5.099 | 5.425 | 5.603 | |
| R2 | 0.786 | 0.622 | 0.503 | 0.406 | 0.328 | 0.281 |
Figure 14Regression analysis chart of AQI prediction in Douliu City (June 2018).
Figure 15Model retraining process.
Comparison of Studies.
| Shaban et al. [ | Chen et al. [ | Zhu et al. [ | This Study | |
|---|---|---|---|---|
| Computing platform | Local | Local | Local | Cloud |
| Prediction interval | 1–24 h in the future | 1 day in the future | 1 h in the future | 1–6 h in the future |
| Prediction target | SO2, O3, NO2 | AQI, PM2.5, PHI, SSI | AQI | AQI, SO2, CO, O3, PM10, PM2.5, NO2 |
| Research method | Machine Learning | Data Mining, Machine Learning | Machine Learning | Machine Learning |
| Algorithm | SVM, M5P, ANN | ANN | SVR | DFR, LR, NNR |
| Early Warning notice | N | N | N | Y |
| Visualization | N | N | N | Y |