| Literature DB >> 27751984 |
Quynh C Nguyen1, Dapeng Li, Hsien-Wen Meng, Suraj Kath, Elaine Nsoesie, Feifei Li, Ming Wen.
Abstract
BACKGROUND: Studies suggest that where people live, play, and work can influence health and well-being. However, the dearth of neighborhood data, especially data that is timely and consistent across geographies, hinders understanding of the effects of neighborhoods on health. Social media data represents a possible new data resource for neighborhood research.Entities:
Keywords: Twitter messaging; food; happiness; health behavior; physical activity; social media
Year: 2016 PMID: 27751984 PMCID: PMC5088343 DOI: 10.2196/publichealth.5869
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Descriptive statistics of our national Twitter database, April 2015 to March 2016 (N=79,848,992).
| Mean (SD) | ||
| % Tweets that are happy | 19.9 (6.7) | |
| % Tweets about food | 5.1 (22.0) | |
| % Food tweets about healthy foods | 15.9 (36.6) | |
| % Food tweets about fast food | 9.2 (29.0) | |
| Caloric density of food tweets (per 100 grams) | 238.5 (219.8) | |
| % Food tweets that are happy | 27.0 (44.4) | |
| % Healthy food tweets that are happy | 28.3 (45.0) | |
| % Fast food tweets that are happy | 14.5 (35.2) | |
| % Tweets about physical activity | 1.8 (13.3) | |
| Exercise intensity (per 30 minutes) | 199.1 (117.5) | |
| % Physical activity tweets that are happy | 28.2 (45.0) | |
Figure 1National distribution of happy tweets, by census tract. Geotagged tweets were spatially joined to their 2010 census tract locations and sentiment scores were computed. This color coded map presents the proportion of happy tweets in each census tract, with darker colors signifying higher proportions of happy tweets.
Demographic and economic predictors of happy, food, and physical activity tweets from 70,515 census tracts (data source: 2010 US Census data).
| Tract characteristics | % happy tweets | % healthy food tweets | % fast food tweets | % physical activity tweets | ||||
| Urban (yes) | −.01 | .79 | .01 | .54 | .29 | <.001 | −.02 | <.001 |
| Population density | .06 | <.001 | .04 | .001 | −.03 | <.001 | .00 | .82 |
| % 65 years and older | .02 | .09 | −.03 | <.001 | −.03 | <.001 | .02 | <.001 |
| % 10-24 years | −.02 | .01 | −.05 | <.001 | .00 | .49 | .00 | .14 |
| % Male | .04 | <.001 | .01 | .21 | −.05 | <.001 | .01 | <.001 |
| % African American | −.11 | <.001 | −.03 | <.001 | −.03 | <.001 | −.01 | <.001 |
| % Hispanic | −.04 | .05 | .02 | .00 | .07 | <.001 | .00 | .77 |
| Household size | −.18 | <.001 | −.11 | <.001 | −.07 | <.001 | −.01 | <.001 |
| Economic disadvantageb | −.19 | <.001 | −.09 | <.001 | −.09 | <.001 | −.03 | <.001 |
aAdjusted linear regression included all tract demographic and economic predictors simultaneously. Standard errors accounted for clustering at the county level.
bEconomic disadvantage factor score derived from the following census tract characteristics: percent female-headed households, percent families living in poverty, unemployment rate, percent college graduates (reverse coded), and median family income (reverse coded).
Figure 2Items in the top 50% of food tweets.
Figure 3Items in the top 50% of healthy food tweets.
Zip code and business characteristics as predictors of food tweets and happiness (data sources: 2013 zip code business patterns and 2010 US Census data).
| Zip code characteristics | Average caloric density of food tweets | % fast food tweets | % happy tweets | |||
| Urban (yes) | .08 (.05 to .11) | <.001 | .16 (.12 to .20) | <.001 | −.02 (−.06 to .02) | .29 |
| Population density | .00 (.00 to .01) | .24 | .00 (−.01 to .01) | .86 | .01 (.00 to .03) | .18 |
| Number of businesses | −.01 (−.02 to .01) | .34 | .02 (.00 to .04) | .04 | .11 (.08 to .15) | <.001 |
| Businesses that sell alcohol | −.03 (−.04 to −.02) | <.001 | −.04 (−.05 to −.04) | <.001 | −.01 (−.02 to .00) | .02 |
| Full service restaurants | −.04 (−.06 to −.02) | <.001 | .01 (−.01 to .03) | .43 | .16 (.13 to .20) | <.001 |
| Fast food restaurants | .08 (.06 to .10) | <.001 | .15 (.13 to .17) | <.001 | −.16 (−.20 to −.12) | <.001 |
| Grocery stores | .01 (.00 to .01) | .28 | −.04 (−.05 to −.03) | <.001 | −.02 (−.04 to .00) | .05 |
| Convenience stores | .02 (.01 to .02) | <.001 | −.03 (−.04 to −.02) | <.001 | −.07 (−.08 to −.05) | <.001 |
aAdjusted linear regression included all zip code and business characteristics simultaneously. Standard errors accounted for clustering at the county level.
Figure 4Items in the top 75% of physical activity tweets.
Zip code and business characteristics as predictors of physical activity tweets and happiness (data sources: 2013 zip code business patterns and 2010 US Census data).
| Zip code characteristics | % physical activity tweets | Exercise intensity | % happy tweets | |||
| Urban (yes) | −.09 (−.11 to −.07) | <.001 | .07 (.04 to .11) | <.001 | −.08 (−.12 to −.04) | <.001 |
| Population density | −.01 (−.02 to .00) | .01 | −.01 (−.01 to .00) | .03 | .01 (.00 to .02) | .08 |
| Fitness/recreational centers | .01 (.00 to .02) | .003 | .05 (.04 to .06) | <.001 | .07 (.06 to .08) | <.001 |
| Nature parks | .01 (.00 to .02) | .05 | −.01 (−.01 to .00) | .21 | .03 (.02 to .04) | <.001 |
| Zoos/botanical gardens | .00 (.00 to .01) | .19 | .00 (−.01 to .00) | .35 | .02 (.01 to .03) | <.001 |
| Golf/country clubs | .03 (.02 to .03) | <.001 | −.05 (−.06 to −.04) | <.001 | .03 (.02 to .04) | <.001 |
| Skiing facilities | .04 (.04 to .05) | <.001 | .02 (.02 to .03) | <.001 | .03 (.02 to .03) | <.001 |
| Bowling centers | −.01 (−.02 to −.01) | <.001 | −.01 (−.02 to .00) | .01 | −.02 (−.03 to −.01) | <.001 |
aAdjusted linear regression included all zip code and business characteristics simultaneously. Standard errors accounted for clustering at county level.
Twitter happiness as a predictor of health outcomes in 232 zip codes in Utah (data source: Utah Behavioral Risk Factor Surveillance System [BRFSS] survey 2009-2014. BRFSS underwent design feature changes. Life dissatisfaction values were only available for 2009 and 2010. All other variables were averages from available data from 2011-2014).
| Zip code health outcomes | Beta (95% CI)a | |
| Life dissatisfaction | .01 (−.13 to .15) | .91 |
| Self-rated health (higher score=worse health) | −.08 (−.21 to .05) | .21 |
| Any past month physical activity/exercise | .13 (.00 to .26) | .05 |
| Body mass index (kg/m2) | −.13 (−.26 to −.01) | .04 |
aSeparate linear regression models for each zip code health outcome.
State level Twitter sentiment predictors of health outcomes (N=49 states in the contiguous United States plus District of Columbia. Data sources: 2013 National Vital Statistics Reports and 2013 Behavioral Risk Factor Surveillance System [BRFSS] survey on adults).
| Twitter predictor variables | ||||||
| State-level adult health outcomes | Happiness | Positive sentiment toward healthy foods | Positive sentiment toward physical activity | |||
| All-cause mortality per 100,000 | −32.34 | .03 | −23.51 | .01 | −25.37 | .004 |
| Homicide per 100,000 | −1.02 (−1.98 to −.06) | .03 | −.76 (−1.28 to −.25) | .01 | −.75 (−1.28 to −.23) | .01 |
| % With diabetes | −.58 (−1.05 to −.12) | .02 | −.52 (−.78 to −.27) | <.001 | −.41 (−.68 to −.14) | .004 |
| % With obesity | −2.27 (−3.35 to −1.18) | <.001 | −1.67 (−2.25 to −1.09) | <.001 | −1.43 (−2.05 to −.80) | <.001 |
| % Poor/fair self-rated health | −1.13 (−2.13 to −.13) | .03 | −.77 (−1.36 to −.19) | .01 | −.61 (−1.21 to −.02) | .05 |
| % With high cholesterol | −.78 (−1.66 to .11) | .08 | −.51 (−1.04 to .01) | .06 | −.75 (−1.25 to −.26) | .003 |
| % Physical inactivity | −2.46 (−4.80 to −.12) | .04 | −2.32 (−3.61 to −1.03) | .001 | −1.59 (−2.97 to −.22) | .02 |
| % Current smoking | −1.47 (−2.68 to −.27) | .02 | −1.20 (−1.88 to −.52) | .001 | −1.14 (−1.82 to −.45) | .002 |
aEach cell in the table represents the coefficient estimate of the predictor variable (given by the column) on the state-level health outcome (given by the row). Adjusted linear regression models controlled for state-level demographics: median age, % non-Hispanic white, median household income.
State level Twitter food and physical activity characteristics as predictors of health outcomes (N=49 states in the contiguous United States plus District of Columbia. Data sources: 2013 National Vital Statistics Reports and 2013 Behavioral Risk Factor Surveillance System [BRFSS] survey on adults).
| State-level adult health outcomes | ||||||
| Twitter predictors | All-cause mortality per 100,000 | % with obesity | % poor/fair self-rated health | |||
| % Of food tweets about healthy food | 11.74 (−6.48 to 29.96) | .20 | −.09 (−.64 to .45) | .73 | .11 (−.48 to .70) | .71 |
| % Of food tweets about fast food | 9.84 (−8.56 to 28.25) | .29 | .68 (.13 to 1.23) | .02 | .77 (.18 to 1.37) | .01 |
| % Of tweets about physical activity | −28.17 (−46.68 to −9.65) | .004 | −1.86 (−2.41 to −1.31) | <.001 | −.89 (−1.49 to −.29) | .01 |
aAdjusted linear regression models were run separately for each state-level health outcome (column) and included all three predictors (row) simultaneously in addition to the following state-level control variables: median age, % non-Hispanic white, median household income. Beta coefficient represents a change in the outcome for every standard deviation change in the predictor (row variable).