| Literature DB >> 32288172 |
Chao-Tung Yang1, Cai-Jin Chen1, Yu-Tse Tsan2,3, Po-Yu Liu4, Yu-Wei Chan5, Wei-Chen Chan2.
Abstract
Recently, air pollution has become the primary concern in Taiwan as it significantly affected people's health. Some air pollution monitoring, analysis, and prediction systems were proposed to solve the problem. However, there is very little research to see whether the air quality is associated with the Influenza-Like Illness (ILI) disease or not. In this study, a system is needed, in which the air quality data and the influenza-like illness data can be analyzed together to determine their associations accurately and effectively. In this work, a novel integrated platform was implemented by building a cluster environment based on Hadoop, Spark and a visualization environment with ELK Stack as well as a backup storage system based on Ceph object storage architecture. Also, Sqoop and Alluxio were used to solve the inefficiency problem in processing vast amounts of data. The experimental results showed the visualization of air quality and influenza-like illness data collected from 2016 to 2017 in Taichung, Taiwan. Besides, the association analyses and discussion between air quality and influenza-like illness were also presented.Entities:
Keywords: Alluxio; Association analysis; Ceph; Influenza-like illness; air pollution
Year: 2018 PMID: 32288172 PMCID: PMC7126110 DOI: 10.1016/j.chb.2018.10.009
Source DB: PubMed Journal: Comput Human Behav ISSN: 0747-5632
Fig. 1The HDFS architecture.
Fig. 2The Ceph storage architecture.
Fig. 3The ILI statistics data in Taiwan.
Fig. 4The proposed system architecture.
Fig. 5The data processing architecture.
Fig. 6The architecture of Alluxio with Spark.
Fig. 7The Ceph object storage architecture.
The concentration of pollutants and air quality index values.
| Air Poll. | O3, 8 h | O3 | PM2.5 | PM10 | CO | SO2 | NO2 |
|---|---|---|---|---|---|---|---|
| Unit | Ppm | Ppm | g/m3 | g/m3 | Ppm | Ppb | Ppb |
| 050 | 0.0000.054 | – | 0.0 15.4 | 054 | 04.4 | 035 | 053 |
| 51100 | 0.0550.070 | – | 15.535.4 | 55125 | 4.5 9.4 | 3675 | 54100 |
| 101150 | 0.0710.085 | 0.1250.164 | 35.554.4 | 126–254 | 9.512.4 | 76–185 | 101–360 |
| 151200 | 0.086–0.105 | 0.165 0.204 | 54.5150.4 | 255–354 | 12.515.4 | 186 - 304(3) | 361–649 |
| 201300 | 0.1060.200 | 0.205 0.404 | 150.5250.4 | 355 424 | 15.5 30.4 | 305 604(3) | 650–1249 |
| 301400(2) | – | 0.4050.504 | 250.5350.4 | 425 504 | 30.5 40.4 | 605 804 | 1250 1649 |
| 401500(2) | – | 0.5050.604 | 350.5500.4 | 505–604 | 40.5–50.4 | 805–1004 | 1650–2049 |
Fig. 8The flowchart of Data processing.
Fig. 9The flowchart of data backup processing.
Fig. 10The collected data of air quality and influenza-like illness from 2016 to 2017.
Fig. 11Visualization of the weekly data of air quality and influenza-like illness collected in 2016.
Fig. 12Visualization of the weekly data of air quality and influenza-like illness collected in 2017.
Fig. 13Higher and lower values of AQI and ILI.
Fig. 14Visualization of three weeks delay of ILI.