| Literature DB >> 29685871 |
Sunir Gohil1, Sabine Vuik1, Ara Darzi1.
Abstract
BACKGROUND: Twitter is a microblogging service where users can send and read short 140-character messages called "tweets." There are several unstructured, free-text tweets relating to health care being shared on Twitter, which is becoming a popular area for health care research. Sentiment is a metric commonly used to investigate the positive or negative opinion within these messages. Exploring the methods used for sentiment analysis in Twitter health care research may allow us to better understand the options available for future research in this growing field.Entities:
Keywords: Twitter; social media
Year: 2018 PMID: 29685871 PMCID: PMC5938573 DOI: 10.2196/publichealth.5789
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Tools used for sentiment analysis.
| Author | Year | Location | Subject area | Sentiment toward | Type of method |
| Bhattacharya et al [ | 2012 | United States | Public health | 25 Federal health agencies | Open source |
| Black et al [ | 2011 | United States | Emergency medicine | 2011 Japanese earthquake and tsunami | Commercial |
| Cole-Lewis et al [ | 2013-2014 | United States | Public health | Electronic cigarette | Produced for study |
| Daniulaityte et al [ | 2016 | United States | Public health | Sentiment toward drug-related tweets | Produced for study |
| Desai et al [ | 2011 | United States | Medical conference | Twitter activity at Kidney Week 2011 | Produced for study |
| Greaves et al [ | 2012 | United Kingdom | Public health | Hospital quality | Commercial |
| Hawkins et al [ | 2015 | United States | Public health | Hospital quality | Open source |
| Myslin et al [ | 2012 | Global | Public health | Tobacco | Produced for study |
| Nwosu et al [ | 2015 | Global | Disease specific | Palliative medicine | Open source |
| Ramagopalan et al [ | 2006-2014 | United States | Disease treatment | Multiple sclerosis treatments | Open source |
| Sofean and Smith [ | 2013 | United States | Public health | Tobacco | Produced for study |
| Tighe et al [ | 2015 | United States | Disease symptoms | Pain | Produced for study |
Sentiment tools based on type of tool: KNN: k-nearest-neighbors; N/A: not applicable; NB: Naïve Bayes; SVM; support vector machines.
| Author | Tool | Annotators | Kappa | Manually annotated sample | Sample size | Manually annotated compared with total sample, n (%) |
| Cole-Lewis et al [ | Produced for study: machine learning classifiers based on 5 categories (NB, KNN, and SVM) | 6 | .64 | 250 | 17,098 | 250 (1.46) |
| Desai et al [ | Produced for study: rule based using AFINN (Named after the author, Finn Arup Neilsen) | N/A | N/A | N/A | 993 | N/A |
| Daniulaityte et al [ | Produced for study: logistic regression, NB, SVM | 2 | .68 | 3000 | N/A | N/A |
| Myslin et al [ | Produced for study: machine learning (NB, KNN, SVM) | 2 | >.7 | 1000 | 7362 | 1000 (13.58) |
| Sofean and Smith [ | Produced for study: 5-fold validation using support vector machines (SVM’s) model using Waikato Environment for Knowledge Analysis toolkit toolkit | N/A | N/A | 500 | N/A | N/A |
| Tighe et al [ | Produced for study: rule based using AFINN | N/A | N/A | N/A | 65,000 | N/A |
| Bhattacharya et al [ | Open source: SentiStrength | 3 | N/A | N/A | 164,104 | N/A |
| Hawkins et al [ | Open source: machine learning classifier using Python library TextBlob | 2+Amazon Mechanical Turk | >.79 | 2216 | 404,065 | 2216 (0.55) |
| Ramagopalan et al [ | Open source: TwitteR R package + Jeffrey Breen’s sentiment analysis code | N/A | N/A | N/A | 60,037 | N/A |
| Black et al [ | Commercial: radian6 | N/A | N/A | N/A | N/A | N/A |
| Greaves et al [ | Commercial: TheySay | N/A | N/A | 250 | 198,499 | 250 (0.13) |
| Nwosu et al [ | Open source: TopsyPro | N/A | N/A | N/A | 683,500 | N/A |