| Literature DB >> 29393869 |
Xiaodong Cao1, Piers MacNaughton2, Zhengyi Deng3, Jie Yin4, Xi Zhang5,6, Joseph G Allen7.
Abstract
Twitter provides a rich database of spatiotemporal information about users who broadcast their real-time opinions, sentiment, and activities. In this paper, we sought to investigate the holistic influence of land use and time period on public sentiment. A total of 880,937 tweets posted by 26,060 active users were collected across Massachusetts (MA), USA, through 31 November 2012 to 3 June 2013. The IBM Watson Alchemy API (application program interface) was employed to quantify the sentiment scores conveyed by tweets on a large scale. Then we statistically analyzed the sentiment scores across different spaces and times. A multivariate linear mixed-effects model was used to quantify the fixed effects of land use and the time period on the variations in sentiment scores, considering the clustering effect of users. The results exposed clear spatiotemporal patterns of users' sentiment. Higher sentiment scores were mainly observed in the commercial and public areas, during the noon/evening and on weekends. Our findings suggest that social media outputs can be used to better understand the spatial and temporal patterns of public happiness and well-being in cities and regions.Entities:
Keywords: Twitter; happiness; land use; sentiment analysis; social media
Mesh:
Year: 2018 PMID: 29393869 PMCID: PMC5858319 DOI: 10.3390/ijerph15020250
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Geographical distribution of 33 land use types within MA (mapping based on the MassGIS land use data [44]).
Classification of land use in MA.
| Category | Land Use Type | Number of Tweets |
|---|---|---|
| Commercial | Commercial | 35,610 |
| Farmland | Cranberry Bog, Cropland, Orchard, Pasture | 14,951 |
| Industrial | Industrial, Junkyard, Mining, Powerline/Utility, Waste Disposal | 14,292 |
| Nature | Brushland, Forest, Forested Wetland, Non-Forested Wetland, Open Land, Saltwater, Sandy Beach, Saltwater Wetland, Water | 567,086 |
| Public | Urban Public/Institutional | 27,786 |
| Recreation | Golf Course, Marina, Participation Recreation, Spectator Recreation, Water-based Recreation | 21,365 |
| Residential | High Density Residential, Low Density Residential, Medium Density Residential, Multi-Family Residential, Very Low Density Residential | 181,577 |
| Transportation | Transportation | 17,981 |
| Excluded | Cemetery, Nursery, Transitional | 289 |
Figure 2Geographical distribution of tweets with quartiles of sentiment scores in (a) Massachusetts and (b) Greater Boston area. First quartile (−1.000 to −0.410), second quartile (−0.410 to 0.000), third quartile (0.000 to 0.514), and fourth quartile (0.514 to 1.000).
Figure 3Average sentiment scores by land use categories.
Figure 4NSRs by land use categories. NSR: net sentiment rate.
Figure 5Average sentiment scores by (a) hours of the day and (b) days of the week.
Figure 6NSRs by (a) hours of the day and (b) days of the week.
Figure 7NSRs by (a) land uses and times of the day; (b) land uses and days of the week; and (c) times of the day and days of the week.
Summary of fixed effect estimates on sentiment scores.
| Variables | Coefficients of Fixed Effects | Standard Error | ||
|---|---|---|---|---|
| Intercept | −0.111 | 0.007 | <0.0001 | |
| Land use | Commercial | 0.064 | 0.006 | <0.0001 |
| Farmland | 0.000 (referent) | - | - | |
| Industrial | 0.021 | 0.008 | 0.0082 | |
| Nature | 0.026 | 0.006 | <0.0001 | |
| Public | 0.037 | 0.007 | <0.0001 | |
| Recreation | 0.022 | 0.007 | 0.0022 | |
| Residential | 0.026 | 0.006 | <0.0001 | |
| Transportation | 0.016 | 0.007 | 0.0302 | |
| Days of the week | Mon. | 0.004 | 0.002 | 0.0345 |
| Tue. | 0.007 | 0.002 | 0.0004 | |
| Wed. | 0.000 (referent) | - | - | |
| Thurs. | 0.006 | 0.002 | 0.0042 | |
| Fri. | 0.014 | 0.002 | <0.0001 | |
| Sat. | 0.030 | 0.002 | <0.0001 | |
| Sun. | 0.030 | 0.002 | <0.0001 | |
| Times of the day | Late night | 0.012 | 0.004 | 0.0044 |
| Before dawn | 0.000 (referent) | - | - | |
| Morning | 0.034 | 0.003 | <0.0001 | |
| Noon | 0.044 | 0.003 | <0.0001 | |
| Afternoon | 0.046 | 0.003 | <0.0001 | |
| Evening | 0.054 | 0.003 | <0.0001 | |
| Night | 0.040 | 0.003 | <0.0001 | |
Figure 8Temporal frequency of active users by land use categories.
Figure 9Variations of NSRs during the investigated date range.