| Literature DB >> 31561489 |
Abstract
The study collected particulate matter (PM)-related documents in Korea and classified main keywords related to particulate matter, health, and social problems using text and opinion mining. The study attempted to present a prediction model for important causes related to particulate matter by using social big-data analysis. Topics related to particulate matter were collected from online (online news sites, blogs, cafés, social network services, and bulletin boards) from 1 January 2015, to 31 May 2016, and 226,977 text documents were included in the analysis. The present study applied machine-learning analysis technique to forecast the risk of particulate matter. Emotions related to particulate matter were found to be 65.4% negative, 7.7% neutral, and 27.0% positive. Intelligent services that can detect early and prevent unknown crisis situations of particulate matter may be possible if risk factors of particulate matter are predicted through the linkage of the machine-learning prediction model.Entities:
Keywords: Particulate Matter; Social Big-Data Analysis; South Korea; health
Mesh:
Substances:
Year: 2019 PMID: 31561489 PMCID: PMC6801971 DOI: 10.3390/ijerph16193607
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Status of online documents related to particulate matter.
| Factors | Variables | Factors | Variables | ||||
|---|---|---|---|---|---|---|---|
| Emotion | Negative | 89,084 | (65.4) | Cause | Dust | 29,258 | (7.5) |
| Neutral | 10,455 | (7.7) | Yellow sand | 100,545 | (25.9) | ||
| Positive | 36,745 | (27.0) | PM10(PM10) | 5260 | (1.4) | ||
| Sub-Total | 136,284 | Powder | 1985 | (0.5) | |||
| No expressions of emotion | 90,693 | Tobacco | 6977 | (1.8) | |||
| Total | 226,977 | Grilling | 581 | (0.1) | |||
| Disease | Common cold | 31,205 | (45.9) | Chinese influence | 11,317 | (2.9) | |
| Lung disease | 12,650 | (18.6) | PM2.5(PM2.5) | 37,529 | (9.7) | ||
| Cardiac disorder | 7082 | (10.4) | Air pollution | 35,386 | (9.1) | ||
| Cerebrovascular Disease | 5419 | (8.0) | Ozone | 7948 | (2.0) | ||
| Hypertension | 3166 | (4.7) | Smog | 46,316 | (11.9) | ||
| Depression | 3413 | (5.0) | Pollutants | 24,427 | (6.3) | ||
| Death disease | 5010 | (7.4) | Carcinogens | 19,628 | (5.0) | ||
| Total | 67,945 | Fossil fuel | 11,081 | (2.8) | |||
| Bacteria | 14,248 | (3.7) | |||||
| Exhaust gas | 19,989 | (5.1) | |||||
| Chemical substances | 16,344 | (4.2) | |||||
| Total | 388,819 | ||||||
Figure 1Random Forest Model of Cause-and-Disease Factor.
Figure 2Decision-Tree Model of Cause-and-Disease Factor.
Figure 3Random Forest Model of Cause-and-Disease Factor.
Figure 4Particulate-Matter Cause-and-Disease Risk Multilayer Neural Network Prediction Model.
Figure 5Disease Multilayer Neural Network Prediction Model for Causes of Particulate Matter.
Association Rule between Cause-and-Disease Factor.
| Rules | Support | Confidence | Lift | Count |
|---|---|---|---|---|
| {Pollutants,Carcinogens,Common_Cold} ≥ {Lung_Disease} | 0.010886566 | 0.646689348 | 11.60344729 | 2471 |
| {Common_Cold,Cardiac_Disorder} ≥ {Lung_Disease} | 0.010269763 | 0.624932976 | 11.21307605 | 2331 |
| {Carcinogens,Exhaust_Gas} ≥ {Chemical_Substances} | 0.010838103 | 0.67937034 | 9.434743122 | 2460 |
| {Fossil_Fuel,Chemical_Substances} ≥ {Exhaust_Gas} | 0.010014231 | 0.72342457 | 8.21455494 | 2273 |
| {Chemical_Substances,Lung_Disease} ≥ {Carcinogens} | 0.010040665 | 0.630254425 | 7.288223893 | 2279 |
| {Pollutants,Common_Cold,Lung_Disease} ≥ {Carcinogens} | 0.010886566 | 0.616055846 | 7.124032395 | 2471 |
| {Dust,Carcinogens,Common_Cold} ≥ {Pollutants} | 0.01085132 | 0.738530735 | 6.862467374 | 2463 |
| {Carcinogens,Common_Cold,Lung_Disease} ≥ {Pollutants} | 0.010886566 | 0.705798343 | 6.558316231 | 2471 |
| {Dust,Pollutants,Lung_Disease} => {Common_Cold} | 0.01085132 | 0.894985465 | 6.509889951 | 2463 |
| {Chemical_Substances,Common_Cold} ≥ {Pollutants} | 0.013538817 | 0.696351688 | 6.470537403 | 3073 |
| {Pollutants,Carcinogens,Lung_Disease} ≥ {Common_Cold} | 0.010886566 | 0.888529306 | 6.46292954 | 2471 |
| {Dust,Yellow_Sand,Lung_Disease} ≥ {Common_Cold} | 0.010410746 | 0.886679174 | 6.449472168 | 2363 |
| {Dust,Carcinogens} ≥ {Pollutants} | 0.022284196 | 0.691739606 | 6.427681687 | 5058 |
| {Lung_Disease,Cardiac_Disorder} ≥ {Common_Cold} | 0.010269763 | 0.865256125 | 6.293646512 | 2331 |
| {Dust,Lung_Disease} ≥ {Common_Cold} | 0.017213198 | 0.850457118 | 6.186002412 | 3907 |
| {Dust,Chemical_Substances} ≥ {Pollutants} | 0.013847218 | 0.663080169 | 6.161376652 | 3143 |
| {Pollutants,Lung_Disease} ≥ {Common_Cold} | 0.017671394 | 0.844065657 | 6.139512595 | 4011 |
| {Bacteria,Chemical_Substances} ≥ {Pollutants} | 0.010829291 | 0.656166578 | 6.097135191 | 2458 |
| {Yellow_Sand,Carcinogens,Common_Cold} ≥ {Pollutants} | 0.010062694 | 0.645562465 | 5.998601201 | 2284 |
| {Bacteria,Lung_Disease} ≥ {Common_Cold} | 0.013640149 | 0.815595364 | 5.932427138 | 3096 |
| {Dust,Yellow_Sand,Carcinogens} ≥ {Pollutants} | 0.010331443 | 0.636536374 | 5.914730276 | 2345 |
| {Air_Pollution,Lung_Disease} ≥ {Pollutants} | 0.010556136 | 0.632690784 | 5.878996853 | 2396 |
| {Dust,Common_Cold,Lung_Disease} ≥ {Pollutants} | 0.01085132 | 0.630406962 | 5.857775453 | 2463 |
| {Air_Pollution,Lung_Disease} ≥ {Common_Cold} | 0.013234821 | 0.793240032 | 5.769820307 | 3004 |
| {Carcinogens,Common_Cold} ≥ {Pollutants} | 0.016834305 | 0.607279085 | 5.642869971 | 3821 |
| {Yellow_Sand,Lung_Disease} ≥ {Common_Cold} | 0.018208893 | 0.739620616 | 5.379806713 | 4133 |
| {Carcinogens,Lung_Disease} ≥ {Common_Cold} | 0.01542447 | 0.722451506 | 5.254923107 | 3501 |
| {Yellow_Sand,Pollutants,Common_Cold} ≥ {Dust} | 0.011657569 | 0.665995469 | 5.166643436 | 2646 |
| {Yellow_Sand,Carcinogens,Common_Cold} ≥ {Dust} | 0.010124374 | 0.649519503 | 5.038826582 | 2298 |
| {Chemical_Substances,Lung_Disease} ≥ {Common_Cold} | 0.011001115 | 0.690542035 | 5.02282197 | 2497 |
| {Pollutants,Carcinogens,Common_Cold} ≥ {Dust} | 0.01085132 | 0.644595656 | 5.000628482 | 2463 |
| {PM2.5, Lung_Disease} ≥ {Common_Cold} | 0.010732365 | 0.664303245 | 4.831967879 | 2436 |
| {Lung_Disease} ≥ {Common_Cold} | 0.036805491 | 0.660395257 | 4.803542196 | 8354 |
| {Pollutants,Common_Cold,Lung_Disease} ≥ {Dust} | 0.01085132 | 0.614061331 | 4.763750045 | 2463 |
| {Yellow_Sand,Pollutants,Carcinogens} ≥ {Dust} | 0.010331443 | 0.609724389 | 4.730105019 | 2345 |
| {Dust,Yellow_Sand,Carcinogens} ≥ {Common_Cold} | 0.010124374 | 0.623778502 | 4.537201505 | 2298 |
| {Pollutants,Exhaust_Gas} ≥ {Air_Pollution} | 0.014107156 | 0.638994213 | 4.098711056 | 3202 |
| {PM10} ≥ {PM2.5} | 0.014446398 | 0.62338403 | 3.770253326 | 3279 |
| {Dust,Carcinogens,Common_Cold} ≥ {Yellow_Sand} | 0.010124374 | 0.689055472 | 1.555519856 | 2298 |
| {Dust,Pollutants,Common_Cold} ≥ {Yellow_Sand} | 0.011657569 | 0.65140325 | 1.470521213 | 2646 |
Figure 6Performance of Machine Learning for Predicting Disease Causes of Particulate Matter.