| Literature DB >> 29107961 |
Hsien-Wen Meng1, Suraj Kath2, Dapeng Li3, Quynh C Nguyen4.
Abstract
PURPOSE: We examined openly shared substance-related tweets to estimate prevalent sentiment around substance use and identify popular substance use activities. Additionally, we investigated associations between substance-related tweets and business characteristics and demographics at the zip code level.Entities:
Mesh:
Year: 2017 PMID: 29107961 PMCID: PMC5673183 DOI: 10.1371/journal.pone.0187691
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Descriptive statistics of substance-related tweets.
| Happiness score | N | Mean (Standard Deviation) | Percent of tweets that are happy |
|---|---|---|---|
| All substance related tweets | 687,495 | 0.69 (0.20) | 34.1% |
| Alcohol tweets | 638,347 | 0.70 (0.20) | 35.5% |
| Smoking tweets | 14,256 | 0.67 (0.19) | 26.4% |
| Drug tweets | 36,284 | 0.56 (0.22) | 12.3% |
| Underage engagement tweets | 509 | 0.64 (0.23) | 28.1% |
* Happiness scores ranged from 0 (sad) to1 (happy). Tweets with scores ≥ 0.80 were classified as “happy.”
** Includes both controlled and recreational substances.
Popular items for alcohol, smoking, and substance use tweets in descending order.
| Alcohol (n = 608809) | Smoking (n = 14126) | Drug (n = 34437) | All substance-related tweets (n = 654201) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| beer | 287,201 | 47.2% | cigars | 8431 | 59.7% | get high | 7668 | 22.3% | beer | 317,706 | 48.6% |
| drunk | 77,562 | 12.7% | tobacco | 4386 | 31.1% | smok*weed | 6414 | 18.6% | drunk | 77,562 | 11.9% |
| winery | 32,057 | 5.3% | smok* cigarette* | 999 | 7.1% | Stoner | 6259 | 18.2% | cocktail | 44,945 | 6.9% |
| beers | 30,505 | 5.0% | beer | 310 | 2.2% | Cocaine | 5625 | 16.3% | winery | 32,057 | 4.9% |
| cocktail | 23,343 | 3.8% | Heroin | 2654 | 7.7% | tequila | 21,379 | 3.3% | |||
| cocktails | 21,602 | 3.6% | crack head / crackhead | 1653 | 4.8% | alcohol | 20,534 | 3.1% | |||
| tequila | 21,379 | 3.5% | Shrooms | 1352 | 3.9% | champagne | 18,319 | 2.8% | |||
| alcohol | 20,534 | 3.4% | got high | 1139 | 3.3% | margarita | 14,877 | 2.3% | |||
| champagne | 18,319 | 3.0% | smok* pot | 908 | 2.6% | vodka | 13,228 | 2.0% | |||
| margarita | 14,877 | 2.4% | on crack | 765 | 2.2% | martini | 10,836 | 1.7% | |||
| vodka | 13,228 | 2.2% | margaritas | 10,564 | 1.6% | ||||||
| martini | 10,836 | 1.8% | rum | 10,151 | 1.6% | ||||||
| margaritas | 10,564 | 1.7% | booze | 8,634 | 1.3% | ||||||
| rum | 10,151 | 1.7% | cigars | 8,431 | 1.3% | ||||||
| booze | 8,634 | 1.4% | get drunk | 8,017 | 1.2% | ||||||
| get drunk | 8,017 | 1.3% | get high | 7,668 | 1.2% | ||||||
| gin | 7,509 | 1.2% | |||||||||
| smok* weed | 6,414 | 1.0% | |||||||||
| cocaine | 5,625 | 0.9% | |||||||||
| bloody mary | 5,195 | 0.8% | |||||||||
| whisky | 4,550 | 0.7% | |||||||||
Keywords extracted from a total of 687495 tweets. These terms represent 90% of tweets in their respective groups. Items with an asterisk include any term stemming from that term. For example, “smok*” includes the following terms: smoked, smoker, smoking. A tweet could have more than one substance keyword.
Most commonly tweeted substance terms among adolescents and young adults.
| drunk | 194 | 34% |
| beer | 58 | 10% |
| alcohol | 57 | 10% |
| prom + weed | 26 | 5% |
| get drunk | 24 | 4% |
| champagne | 20 | 3% |
| winery | 17 | 3% |
| vodka | 16 | 3% |
| booze | 13 | 2% |
| smok* weed | 12 | 2% |
| got drunk | 11 | 2% |
| prom + booze | 11 | 2% |
| cocaine | 10 | 2% |
| beers | 9 | 2% |
| tequila | 9 | 2% |
| brandy | 8 | 1% |
| alcoholic | 7 | 1% |
| cocktail | 6 | 1% |
| tobacco | 6 | 1% |
These terms composed 90% of underage substance tweets. They are listed in descending order of popularity. Underage tweets (n = 509) can mention more than one substance. In total, there were 572 substance mentions among underage tweets.
Business and compositional predictors of alcohol, smoking, substance, and underage engagement tweets.
| Zip code characteristics | % of alcohol tweets (R-squared: 0.04) | % of smoking tweets (R-squared: 0.02) | % of drug tweets (R-squared: 0.05) | % of underage tweets (R-squared: 0.44) |
|---|---|---|---|---|
| Beta (95% CI) | Beta (95% CI) | Beta (95% CI) | Beta (95% CI) | |
| Total number of businesses | -0.05 (-0.08, -0.02) | 0.00 (-0.04, 0.04) | -0.03 (-0.06, 0.00) | -0.04 (-0.17, 0.08) |
| Alcohol places | 0.01 (0.00, 0.03) | 0.00 (-0.02, 0.01) | 0.00 (-0.02, 0.01) | 0.00 (-0.04, 0.03) |
| Full service restaurants | 0.03 (0.01, 0.06) | 0.01 (-0.02, 0.05) | -0.02 (-0.05, 0.01) | 0.00 (-0.09, 0.09) |
| Fast food restaurants | 0.02 (-0.02, 0.05) | -0.02 (-0.07, 0.03) | 0.03 (0.00, 0.07) | 0.05 (-0.07, 0.18) |
| Grocery stores and supermarkets | 0.01 (-0.01, 0.03) | 0.00 (-0.02, 0.03) | 0.00 (-0.01, 0.02) | 0.01 (-0.08, 0.09) |
| Convenience stores | 0.06 (0.05, 0.08) | 0.00 (-0.02, 0.02) | -0.01 (-0.02, 0.01) | -0.01 (-0.06, 0.04) |
| Urban (yes/no) | -0.11 (-0.15, -0.07) | 0.02 (-0.08, 0.12) | -0.22 (-0.27, -0.16) | -0.17 (-0.41, 0.07) |
| Population size | -0.11 (-0.13, -0.09) | -0.06 (-0.10, -0.02) | -0.08 (-0.11, -0.05) | -0.17 (-0.28, -0.06) |
| Population density | 0.01 (-0.01, 0.02) | 0.01 (-0.01, 0.04) | 0.01 (0.00, 0.03) | 0.04 (-0.03, 0.11) |
| Percent 65 years+ | 0.02 (-0.01, 0.04) | 0.03 (-0.02, 0.09) | 0.01 (-0.03, 0.04) | 0.03 (-0.13, 0.19) |
| Percent 10–24 years | -0.06 (-0.08, -0.03) | 0.02 (-0.02, 0.07) | -0.06 (-0.09, -0.03) | 0.14 (0.06, 0.23) |
| Percent male | 0.07 (0.04, 0.09) | 0.13 (0.08, 0.19) | 0.01 (-0.03, 0.05) | 0.02 (-0.17, 0.21) |
| Percent African American | -0.05 (-0.06, -0.03) | 0.01 (-0.03, 0.05) | -0.06 (-0.08, -0.04) | -0.17 (-0.27, -0.08) |
| Percent Hispanic | -0.01 (-0.03, 0.01) | -0.05 (-0.09, -0.01) | -0.06 (-0.08, -0.03) | -0.17 (-0.26, -0.08) |
| Household size | -0.03 (-0.06, 0.00) | 0.13 (0.08, 0.19) | 0.06 (0.03, 0.10) | 0.08 (-0.06, 0.21) |
| Unemployment rate | -0.05 (-0.08, -0.02) | -0.06 (-0.14, 0.02) | 0.08 (0.03, 0.12) | -0.16 (-0.34, 0.02) |
| Percent less than high school graduate | 0.01 (-0.02, 0.03) | -0.07 (-0.15, 0.00) | -0.03 (-0.07, 0.02) | 1.01 (0.85, 1.16) |
| Median family income, 4th quartile (highest) | 0.08 (0.02, 0.13) | -0.11 (-0.25, 0.03) | -0.20 (-0.28, -0.12) | -0.02 (-0.33, 0.28) |
| Median family income, 3rd quartile | 0.02 (-0.03, 0.07) | -0.07 (-0.20, 0.05) | -0.15 (-0.22, -0.07) | -0.03 (-0.30, 0.24) |
| Median family income, 2nd quartile | 0.00 (-0.05, 0.04) | 0.07 (-0.05, 0.18) | -0.10 (-0.17, -0.03) | -0.04 (-0.31, 0.23) |
| Sample size | 17377 | 3924 | 9151 | 425 |
All variables are standardized to have a mean of 0 and standard deviation of 1.
*p<0.05.
Distribution of substance-related tweets, by month.
| January | 5.1% | 5.7% | 2.1% |
| February | 6.2% | 7.0% | 2.0% |
| March | 2.7% | 3.2% | 0.8% |
| April | 20.1% | 18.5% | 56.7% |
| May | 11.6% | 9.8% | 10.5% |
| June | 6.0% | 5.7% | 4.6% |
| July | 10.9% | 10.0% | 6.3% |
| August | 9.0% | 9.0% | 4.9% |
| September | 6.5% | 6.5% | 3.3% |
| October | 8.2% | 8.5% | 3.2% |
| November | 7.2% | 8.5% | 3.0% |
| December | 6.7% | 7.7% | 2.6% |
Tweets were collected from the continental United States from April 2015 to March 2016. Each entry denotes the percent of tweets for the column (alcohol, smoking, drug) in a month. For instance, 20.1% of all alcohol-related tweets were captured in April.
Fig 1Percent of alcohol-related tweets that are happy, by state.