| Literature DB >> 28363883 |
Brianna A Lienemann1, Jennifer B Unger1, Tess Boley Cruz1, Kar-Hai Chu1.
Abstract
BACKGROUND: As Twitter has grown in popularity to 313 million monthly active users, researchers have increasingly been using it as a data source for tobacco-related research.Entities:
Keywords: Internet; review; social marketing; tobacco
Mesh:
Year: 2017 PMID: 28363883 PMCID: PMC5392207 DOI: 10.2196/jmir.7022
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Tobacco search terms.
| Search terma | Tobacco products covered by search term |
| tobacco | Tobacco, smokeless tobacco, chewing tobacco, dissolvable tobacco |
| nicotine | Nicotine, electronic nicotine delivery system |
| cig* | Cigarette, cigar, little cigar, large cigar, cigarillo, electronic cigarette, e-cigarette, e-cig |
| pipe | Pipe, waterpipe |
| bidi | Bidi |
| kretek | Kretek |
| shisha | Shisha |
| hookah | Hookah, e-hookah, hookah pen |
| narghile | Narghile |
| argileh | Argileh |
| cheroot | Cheroot |
| smok* | Smoke, smokeless tobacco, smoking, smoker |
| chew | Chew, chewing tobacco |
| snuff | Snuff, dry snuff, moist snuff |
| snus | Snus |
| betel quid | Betel quid |
| gutkha | Gutkha |
| zarda | Zarda |
| toombak | Toombak |
| dissolvable | Dissolvable, dissolvable tobacco |
| ENDS | ENDS (electronic nicotine delivery system) |
| vap* | Vape, vaper, vape pen, vaping, vapor |
aAsterisk (*) represents stemmed words; for example, cig* would capture all words beginning with cig.
Figure 1PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) diagram of articles included in the systematic review.
Data collection methods.
| Article | Date collected | Type of tweets or accounts | Keyword selectiona,b | Data source | Retrieval precision | Retrieval recall |
| [ | March 22 to June 27, 2015 | Tweets about the California Department of Public Health “Still Blowing Smoke” | #stillblowingsmoke, stillblowingsmoke, “still | Gnip | 97.5% | NRc |
| [ | May 1, 2013, to May 1, 2014 | E-cigarettes | vaping, vape, vaper, vapers, vapin, vaped, evape, vaporing, e-cig*, ecig*, e-pen, epen, e-juice, ejuice, e-liquid, eliquid, cloud chasing, cloudchasing, deeming AND regulation, deeming AND FDA, deemed AND FDA, deem* AND FDA | Gnip | 59.23% | NR |
| [ | July 1, 2008, to February 28, 2013 | E-cigarettes | Radian6 | 91% | 93% | |
| [ | December 5, 2011, to July 17, 2012 (15-day intervals) | Tobacco | cig*, nicotine, smok*, tobacco; hookah, shisha, | Twitter’s | 57.25% | 95%-99% |
| [ | January 1 to December 31, 2014 | E-cigarettes and smoking cessation | Sysomos | NR | NR | |
| [ | February 1 to April 30, 2014 | Blu e-cigarettes’ tweets and retweets | @blucigs | Twitter REST API | NR | NR |
| [ | April 12 to May 10, 2014 | Hookah or shisha | hookah, #hookah, shisha, #shisha, hooka, #hooka, sheesha, #sheesha | Simply | 99.56% | NR |
| [ | November 1, 2011, to August 31, 2013 | Hookah, cigarettes, and cigars | cigar, cigars, cigarette, cigarettes, hookah, waterpipe, water pipe, shisha, sheesha | Twitter’s | NR | NR |
| [ | October 4 to November 3, 2010 | Tobacco | Smoking, tobacco, cigarette, cigar, hookah, hooka | Twitter’s | NR | NR |
| [ | May 1, 2012, to June 30, 2012 | E-cigarettes | Gnip | >99% of a random sample of 500 tweets | NR | |
| [ | December 6, 2012, to June 20, 2013 | Tobacco or cessation price promotion | Gnip | 56.94% | NR | |
| [ | July 2014 | Slogans for the Dutch health campaign “Smoking is so outdated” (Roken kan echt niet meer) | #rokenkanechtnietmeer [#smokingissooutdated] | Twitter’s | NR | NR |
| [ | December 2013 | Little cigars | Swisher Sweets, Black & Milds | Twitonomy | 67.50% | NR |
| [ | September 2012 and January to May 2013 | Genetic information on smoking | genetic, smoking | NR | 49.1% | NR |
| [ | August 2010 | Smoking cessation accounts | NR | NR | NR | |
| [ | January 8-15, 2014 | Tweets about Chicago | @ChiPublicHealth | twitteR package for R and NodeXL | NR | NR |
| [ | January 2010 to January 2015 | E-cigarettes | vape, ecig, ecigarette, vaping, ejuice, vapers, drip AND tip, dripping, eliquid AND flavor, e AND juice, e AND liquid, smoke AND free, off AND cigarettes, ex AND smoker, no AND analogs, I AND quit | NR | NR | NR |
| [ | January 2012 to December 2014 | E-cigarettes | e(-)cig, e(-)cigarette, electronic cigarette, | Twitter’s Streaming API | 81% to 90.8% for 4 groups | NR |
| [ | September to December 2013 and March 2015 | E-cigarettes | Electronic-cigarette, e-cig, e-cigarette, e-juice, | Twitter’s Streaming API and Twitter’s Firehose | 97.21% | 86.63% |
| [ | April 21 to October 20, 2014 | Blu and V2 e-cigarettes’ tweets and retweets | @blucigs, @v2cigs | NR | 100% | NR |
| [ | July 7 to 21, 2014 | Tweets about the Centers for Disease Control and | #cdctips, CDC AND smoking | Social | 81.70% | NR |
| [ | May 1, 2013, to May 1, 2014 | E-cigarettes | vaping, vape, vaper, vapers, vapin, vaped, evape, vaporing, e-cig*, ecig*, e-pen, epen, e-juice, ejuice, e-liquid, eliquid, cloud chasing, cloudchasing, deeming AND regulation, deeming AND FDA, deemed AND FDA, deem* AND FDA | Gnip | 59.23% | NR |
| [ | March to June 2013 | Tobacco control program tweets during the months that the national CDC Tips smoking cessation campaign aired | Radian6 | NR | NR | |
| [ | March 15 to June 9, 2012 | Tweets about the CDC’s Tips campaign | Gnip | 78.87% | 94% | |
| [ | February 5-12, 2014 | CVS Health-related tweets surrounding the | #cvs, #cvsquits | Twitter’s Streaming API | 72.38% | NR |
| [ | 50 most recent tweets from July 18, 2012 | Smoking cessation accounts | NR | NR | NR | |
| [ | February 23 to April 9, 2015 | Exposure to secondhand e-cigarette aerosol | “secondhand vape” OR “secondhand vaping” OR “second-hand vape” OR “second-hand vaping” OR “vape smoke” OR “ecig smoke” OR “e-cig smoke” OR “e-cigarette smoke” OR “vape shs” OR “ecig shs” OR “vape | NCapture | NR | NR |
aAsterisk (*) represents stemmed words; for example, cig* would capture all words beginning with cig.
bWords in italics were not keywords used for searches.
cNR: not reported.
dAPI: application programming interface.
Coding methods.
| Article | Coding method | No. of coders | No. of tweets coded | Coded retweets | No. of Twitter accounts | Followed URLs | Coding |
| [ | Hand-coded by researchers | 1: all tweets; | 2248: relevance; | Yes | NRa | No | 91%: |
| [ | Hand-coded by researchers | 6: for a subset of 250 tweets; | 17,098: relevance; | Yes, if additional context | NR | Yes | κ=.64 to .70 |
| [ | Machine learning with initial hand-coding; Python Scikit-Learn | NR | 1,669,123 | Yes | NR | Yes | NR |
| [ | Machine learning and hand-coding; naïve Bayes, | 2: pilot of 1000; | 7362: relevance; | Retweeted posts were only | NR | NR | κ>.70 for the random subset of 150 |
| [ | Hand-coded by researchers | 1: all tweets; | 300: complete sample; | Yes | 148: complete sample; | Yes | κ=.74 |
| [ | Hand-coded by researchers | 2 | NR | Yes | Approximately 3400 | NR | NR |
| [ | Crowdsourcing with initial hand-coding | 3 | 5000: relevance; | NR | 3804 | NR | κ=.66 to .85 among a subset coded by researchers |
| [ | Topic modeling with machine learning; | NR | 319,315: total; | NR | NR | NR | NR |
| [ | Topic modeling (LDA) with | NR | 4962 | NR | NR | NR | NR |
| [ | Machine learning and hand-coding; DiscoverText | 2: for a subset of 500 for relevance, 4500 for commercial versus | 73,672 | Yes | 23,700 | Yes, hand-coded tweets with URLs | κ=.87 to .93 |
| [ | Hand-coded by researchers | 1: all; | 5000: relevance; | NR | NR | Yes | κ=.64 to 1.00 |
| [ | Hand-coded by researchers | 1: all tweets; | 133 | No | NR | NR | alpha = .61 to 1.00 |
| [ | Hand-coded by researchers | 3 | 3935: relevance, | No | 346 | Yes | κ=.64 to .91 |
| [ | Hand-coded by researchers; wordcloud R package | NR | 171: relevance; | NR | 84 | NR | NR |
| [ | Hand-coded by researchers | 1: all tweets; | 143,287: identified; | NR | 153 | Yes | >90% |
| [ | Hand-coded by researchers | 2 | 684 | Yes | 306 | Yes | NR |
| [ | Machine learning and hand-coding; naïve Bayes, | 1: all tweets; | 13,146 | NR | 2147 | No, removed URLs | κ=.87 for subsample |
| [ | Machine learning and hand-coding; human detection algorithm; | 2: for all tweets from 500 automated accounts and 500 organic | 850,000 | Yes | 131,622: | No, but the | 94.6% true- |
| [ | Machine learning with initial hand-coding; Python Scikit-Learn; | 2: for a subset of 1000 profiles | 224,000 in 2013 sample; | Yes | 34,000 in 2013 sample; | No; metadata on the presence of URL links | κ=.88 |
| [ | Hand-coded by researchers and MySQL pattern matcher | NR | 1180 | Yes | 2: Blu and V2; | NR | NR |
| [ | Hand-coded by researchers | 1: all tweets; | 2191: relevance; | Yes | NR (>21) | NR | κ=.95 for 20% |
| [ | Machine learning with initial hand-coding; naïve Bayes classifier, k-nearest | 6: for a subset of 250 tweets; | 17,098: relevance; | Yes, if additional context | NR | NR | κ=.64 to .70 |
| [ | Hand-coded by researchers | 3 | 1776 | No | 16 | Yes | For 5% of data, 95.7%; |
| [ | Machine learning with initial hand-coding; naïve Bayes classifier | 2: subset of 450 tweets for relevance; | 245,319: relevance; | NR | 166,857 | NR; metadata on the presence of URL links | κ=.93 |
| [ | Hand-coded by researchers | 1: all tweets; | 8645: relevance; | Yes | NR | Yes | 90% for a 1% sample of tweets |
| [ | Hand-coded by researchers | 2 | 900, with 50 tweets per account | Yes | 18 | NR | 84% |
| [ | Hand-coded by researchers | 2 | 1519 | No | 1321 | Yes | κ=.84 |
aNR: not reported.
Coded categories.
| Category type | Category | Number of articles and percent of totala
| Articles | |
| Relevance | Relevant versus nonrelevant | 16 (59) | [ | |
| Sentiment | 9 (33) | [ | ||
| Positive or negative (ie, supportive or against) | 6 (22) | [ | ||
| Positive or negative (ie, emotional tone) | 2 (7) | [ | ||
| Positive or negative valence | 1 (4) | [ | ||
| Neutral or unknown | 6 (22) | [ | ||
| Message attitude | Pro or con | 1 (4) | [ | |
| Type of utterance | Comparison versus attribution versus metonymy | 1 (4) | [ | |
| Topics, themes, or genres | 21 (78) | [ | ||
| Joke or humorous | 3 (11) | [ | ||
| Song or music | 2 (7) | [ | ||
| Profanity | 1 (4) | [ | ||
| Social relationships | 2 (7) | [ | ||
| Sex or romance | 1 (4) | [ | ||
| Image or stereotype | 1 (4) | [ | ||
| Risky behaviors or other substances | 6 (22) | [ | ||
| Illicit substance use in e-cigarettes | 2 (7) | [ | ||
| Preference for another substance | 1 (4) | [ | ||
| Affiliation and preference | 1 (4) | [ | ||
| Flavors | 7 (26) | [ | ||
| Pleasure | 1 (4) | [ | ||
| Tastes good | 1 (4) | [ | ||
| Craving, desire, and need | 5 (19) | [ | ||
| Addiction | 1 (4) | [ | ||
| Type of tobacco product | 4 (15) | [ | ||
| Type of tobacco product brand | 1 (4) | [ | ||
| E-cigarettes’ smoke-free aspect | 1 (4) | [ | ||
| Health, safety, harms | 9 (33) | [ | ||
| Downplayed or refuted harms, harm reduction | 2 (7) | [ | ||
| E-cigarettes for smoking cessation | 7 (26) | [ | ||
| Cessation | 5 (19) | [ | ||
| Cessation product | 2 (7) | [ | ||
| Socioemotional support tweets regarding quitting smoking | 1 (4) | [ | ||
| Encouraging or engaging tweets regarding quitting smoking | 1 (4) | [ | ||
| Clinical practice guidelines for treating tobacco dependence | 1 (4) | [ | ||
| Demonstration | 1 (4) | [ | ||
| 8 (30) | [ | |||
| Use: general | 2 (7) | [ | ||
| First-person use or intent | 5 (19) | [ | ||
| Second- or third-person experience | 4 (15) | [ | ||
| Starting use or smoking initiation | 3 (11) | [ | ||
| Recent use | 1 (4) | [ | ||
| Underage use | 3 (11) | [ | ||
| Parental use | 2 (7) | [ | ||
| Does not use or does not want to use | 1 (4) | [ | ||
| Secondhand smoke | 1 (4) | [ | ||
| Rejection and prevention | 1 (4) | [ | ||
| Disgust, unattractive, or uncool | 2 (7) | [ | ||
| Policy, government, regulation, activism, politics | 7 (26) | [ | ||
| Normalization versus discouragement | 1 (4) | [ | ||
| Getting others started or advocating use | 1 (4) | [ | ||
| Attempt to engage other Twitter users | 1 (4) | [ | ||
| Fear appeals | 1 (4) | [ | ||
| Lies or propaganda | 2 (7) | [ | ||
| Advertisement, promotion, marketing, industry, commercial | 12 (44) | [ | ||
| Offering advice | 1 (4) | [ | ||
| Personal opinion or communication | 6 (22) | [ | ||
| News or update | 4 (15) | [ | ||
| Information | 5 (19) | [ | ||
| Science or scientific publication | 2 (7) | [ | ||
| Cultural reference | 1 (4) | [ | ||
| Issue salience | 1 (4) | [ | ||
| Commodity | 1 (4) | [ | ||
| Connoisseurship | 1 (4) | [ | ||
| Cheaper than smoking | 1 (4) | [ | ||
| Money | 1 (4) | [ | ||
| Price promotion, discount, coupon | 4 (15) | [ | ||
| Backgrounded | 1 (4) | [ | ||
| Other or undetermined | 2 (7) | [ | ||
| Domains smoking was compared with for | Personal features; hobby or hype; person or group; social norm; big event; technology and innovation; sex or relation; eating, drinking, and stimulants; school; transport; and campaign | 1 (4) | [ | |
| Links (URLs) | Most common links | 1 (4) | [ | |
| Location of use | 1 (4) | [ | ||
| Class | 1 (4) | [ | ||
| House, room, bed | 1 (4) | [ | ||
| School | 1 (4) | [ | ||
| Public | 1 (4) | [ | ||
| Bathroom | 1 (4) | [ | ||
| Work | 1 (4) | [ | ||
| In front of someone | 1 (4) | [ | ||
| Car | 1 (4) | [ | ||
| Restaurant | 1 (4) | [ | ||
| Movie theater | 1 (4) | [ | ||
| Airplanes or airport | 1 (4) | [ | ||
| Store | 1 (4) | [ | ||
| Bars or clubs | 1 (4) | [ | ||
| Dormitory | 1 (4) | [ | ||
| Library | 1 (4) | [ | ||
| Mall | 1 (4) | [ | ||
| Bowling alley | 1 (4) | [ | ||
| Café or coffee shop | 1 (4) | [ | ||
| Hospital | 1 (4) | [ | ||
| Locker room | 1 (4) | [ | ||
| Topic modeling | 2 (7) | [ | ||
| Hookah topic 1: social locations, leisure time, and positive affect | 1 (4) | [ | ||
| Hookah topic 2: fun, leisure time, and sociability | 1 (4) | [ | ||
| Cigarette topic 1: death and unpleasant smell | 1 (4) | [ | ||
| Cigar topic 1: positive affect and enjoyment | 1 (4) | [ | ||
| Cigar topic 2: luxury alcohol products | 1 (4) | [ | ||
| Tobacco topic 1: tobacco use and substance use | 1 (4) | [ | ||
| Tobacco topic 2: addiction recovery | 1 (4) | [ | ||
| Tobacco topic 3: addiction recovery and tobacco promotion by clubs or bars | 1 (4) | [ | ||
| Tobacco topic 4: tobacco promotion by bars or clubs and | 1 (4) | [ | ||
| Tobacco topic 5: antismoking and addiction recovery | 1 (4) | [ | ||
| User or account | 10 (37) | [ | ||
| Government | 3 (11) | [ | ||
| Foundations or nonprofit organizations | 4 (15) | [ | ||
| Public health and health care | 1 (4) | [ | ||
| Researcher or research center | 2 (7) | [ | ||
| 5 (19) | [ | |||
| Reputable news source | 2 (7) | [ | ||
| Press, media, or news | 3 (11) | [ | ||
| Medical news source | 1 (4) | [ | ||
| 7 (26) | [ | |||
| Personal accounts, everyday people, individuals | 6 (22) | [ | ||
| Personal accounts with industry ties | 1 (4) | [ | ||
| Person: supporter | 1 (4) | [ | ||
| Person: basic profile (no mention of e-cigarettes) | 1 (4) | [ | ||
| Celebrity, public figures | 3 (11) | [ | ||
| Organic (human) | 1 (4) | [ | ||
| E-cigarette community movement | 2 (7) | [ | ||
| 5 (19) | [ | |||
| Industry: retailer or manufacturer | 2 (7) | [ | ||
| Retailer or vendor | 3 (11) | [ | ||
| Tobacco company | 2 (7) | [ | ||
| Industry: other (eg, vaping magazine, Web marketer) | 1 (4) | [ | ||
| For-profit organization | 1 (4) | [ | ||
| Entity: general (eg, company, store, advocacy group) | 1 (4) | [ | ||
| Nonperson (eg, musical band) | 1 (4) | [ | ||
| Bots, automatic, fake | 4 (15) | [ | ||
| Unclassified or other | 5 (19) | [ | ||
| Profile photo | 1 (4) | [ | ||
| Single person versus multiple people | 1 (4) | [ | ||
| Gender (male, female, mixed group) | 1 (4) | [ | ||
| Age (babies or children, high school or college, adult) | 1 (4) | [ | ||
| Race (African American, white, Hispanic, Asian, undetermined) | 1 (4) | [ | ||
| Location of user | 4 (15) | [ | ||
| City, state, and country | 1 (4) | [ | ||
| State | 1 (4) | [ | ||
| Country | 1 (4) | [ | ||
| Continent | 1 (4) | [ | ||
aPercentages are rounded to the nearest whole percent.