| Literature DB >> 27649530 |
Zhu Tao1, Aynne Kokas2, Rui Zhang3, Daniel S Cohan3, Dan Wallach1.
Abstract
Although studies have increasingly linked air pollution to specific health outcomes, less well understood is how public perceptions of air quality respond to changing pollutant levels. The growing availability of air pollution measurements and the proliferation of social media provide an opportunity to gauge public discussion of air quality conditions. In this paper, we consider particulate matter (PM) measurements from four Chinese megacities (Beijing, Shanghai, Guangzhou, and Chengdu) together with 112 million posts on Weibo (a popular Chinese microblogging system) from corresponding days in 2011-2013 to identify terms whose frequency was most correlated with PM levels. These correlations are used to construct an Air Discussion Index (ADI) for estimating daily PM based on the content of Weibo posts. In Beijing, the Chinese city with the most PM as measured by U.S. Embassy monitor stations, we found a strong correlation (R = 0.88) between the ADI and measured PM. In other Chinese cities with lower pollution levels, the correlation was weaker. Nonetheless, our results show that social media may be a useful proxy measurement for pollution, particularly when traditional measurement stations are unavailable, censored or misreported.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27649530 PMCID: PMC5029919 DOI: 10.1371/journal.pone.0161389
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information of Weibo posts and air quality condition in four mega cities of China for this study.
| City | Twitter site ( | Twitter reports starting date | Number of valid days | Average daily Weibo posts | Average daily PM2.5 concentration (μg/ |
|---|---|---|---|---|---|
| Beijing | ../BeijingAir | Feb 19, 2009 | 528 | 89000 | 100.1 |
| Shanghai | ../CGShanghaiAir | May 15, 2012 | 281 | 83000 | 53.4 |
| Guangzhou | ../Guangzhou_Air | Jun 15, 2011 | 462 | 81000 | 54.5 |
| Chengdu | ../CGChengduAir | Jun 28, 2012 | 207 | 21000 | 93.4 |
Fig 1Fraction of Weibo posts containing terms strongly correlated with PM2.5.
From left to right, positively correlated terms: “dust-haze,” “haze,” “misty,” “dusky,” “air pollution,” “degree of pollution”; and negatively correlated terms: “very blue,” “bright,” “good day,” and “sunny.”
Performance of Air Discussion Index in estimating observed PM2.5.
| City | Number of terms in FTS | Learning Period (valid days) | R | Validation Period (valid days) | R |
|---|---|---|---|---|---|
| Beijing | 20 | Jul 23, 2011 –Dec 31, 2012 (438) | 0.805 | Jan 1, 2013 –May 15, 2013 (438) | 0.882 |
| Shanghai | 12 | May 12, 2012 –Dec 31, 2012 (189) | 0.737 | Jan 1, 2013 –May 15, 2013 (51) | 0.633 |
| Guangzhou | 42 | Jul 26, 2011 –Jan 31, 2013 (349) | 0.649 | Feb 1, 2013 –May 15, 2013 (113) | 0.425 |
| Chengdu | 18 | Jul 2, 2012 –Jan 28, 2013 (136) | 0.853 | Feb 1, 2013 –May 15, 2013 (51) | 0.361 |
aFTS: Final Term Set
bR: Correlation Coefficient
Fig 2Correlation between observed and estimated PM2.5 for Beijing during learning (black) and testing (red) period.
Fig 3Comparison of ADI model estimates against U.S. Embassy reported PM2.5 concentration for Beijing.
Fig 4Correlation of daily PM2.5 concentrations between U.S. Embassy and 12 BJ-BEP sites.
The location of the sites is provided in S1 Fig. The color bar is the annual mean concentration.
Fig 5Box plot of daily PM2.5 concentration (μg/m3) reported in different U.S. Embassy (Consulate) in China.