| Literature DB >> 31016215 |
Joshua J Vertalka1, Eva Kassens-Noor2, Mark Wilson2.
Abstract
Two code files and one dataset related to Olympic Twitter activity are the foundation for this article. Through Twitter's Spritzer streaming API (Application Programming Interface), we collected over 430 million tweets from May 12th, 2016 to September 12th, 2016 windowing the Rio de Janeiro Olympics and Paralympics. We cleaned and filtered these tweets to contain Olympic-related content. We then analyzed the raw data of 21,218,652 tweets including location data, language, and tweet content to distill the sentiment and emotions of Twitter users pertaining to the Olympic Games Kassens-Noor E. et al., 2019. We generalized the original data set to comply with the Twitter's Terms of Service and Developer agreement, 2018. We present the modified dataset and accompanying code files in this article to suggest using both for further analysis on sentiment and emotions related to the Rio de Janeiro Olympics and for comparative research on imagery and perceptions of other Olympic Games.Entities:
Keywords: Emotion lexicon; Olympic; R rdata; Twitter
Year: 2019 PMID: 31016215 PMCID: PMC6468189 DOI: 10.1016/j.dib.2019.103869
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Number of Tweets by Month in 2016. Note: Rio de Janeiro Olympic Games (5-21 August 2016).
Specifications table
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility | |
| Related research article | Kassens-Noor E., Vertalka J, Wilson M. (2019). “Good Games, bad host? Using big data to measure public attention and imagery of the Olympic Games” |
Code files can be easily adopted to extract similar raw data sets from life streaming Twitter APIs Data can be used to measure the magnitude and sentiments of Olympic related Twitter activity before during and after the Rio de Janeiro Olympic Games Code files can be used for scoring of sentiments and emotions of Olympic themed tweets Data can be used for comparisons related to Olympic Games' imagery between Twitter and other Social Media |