| Literature DB >> 33996120 |
Salud María Jiménez-Zafra1, Antonio José Sáez-Castillo2, Antonio Conde-Sánchez2, María Teresa Martín-Valdivia1.
Abstract
Virality on Twitter is catching the attention of researchers, trying to identify factors which increase or decrease the probability of retweeting. We study how terms expressing sentiments affect retweeting frequencies by means of a regression model on the number of retweets, which is specially accurate to deal with virality. We focus on the Spanish political situation during the pseudo-referendum held in Catalonia on 1 October 2017. We have found that the use of negativity in a tweet increases the probability of retweeting and that iSOL lexicon is the one that better determines the relationship between polarity and virality.Entities:
Keywords: Twitter; generalized Waring regression; sentiment analysis; virality
Year: 2021 PMID: 33996120 PMCID: PMC8059576 DOI: 10.1098/rsos.201756
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Examined features.
| retweet | # of retweets recorded for a given tweet |
| favourite | # of favourites recorded for a given tweet |
| quote | whether a tweet includes a quote of other tweet |
| reply | # whether a tweet is in reply to other tweet |
| hashtags | # of hashtags in a tweet |
| urls | # of URLs in a tweet |
| mentions | # of usernames specified in a tweet |
| time | time interval when the tweet was created (categorical variable)a |
| pos_iSOL | # of positive words (iSOL) |
| neg_iSOL | # of negative words (iSOL) |
| pos_NRC | # of positive words (NRC) |
| neg_NRC | # of negative words (NRC) |
| pos_mlS | # of positive words (ML-SentiCon) |
| neg_mlS | # of negative words (ML-SentiCon) |
| verified | whether the tweet is from a verified user |
| followers | # of users who follow the author of a tweet |
| friends | # of friends that the author is following |
| listed | # of lists which include the author of a tweet |
| user_fav | # of favourited tweets by a user |
| statuses | # of tweets made by the author since the creation of the account |
| months | # of months since the creation of the account |
aMorning (6.00–12.00), afternoon (12.00–18.00), evening (18.00–0.00) or night (0.00–6.00)
Descriptive statistics of numerical features.
| min | max | mean | median | s.d. | |
|---|---|---|---|---|---|
| retweet | 0 | 26 212 | 8.92 | 0 | 214.31 |
| favourite | 0 | 34 324 | 9.06 | 0 | 244.61 |
| hashtags | 1 | 10 | 1.79 | 1 | 1.13 |
| urls | 0 | 3.00 | 0 | 0 | 0.46 |
| mentions | 0 | 8 | 0.28 | 0 | 0.66 |
| pos_iSOL | 0 | 5 | 0.28 | 0 | 0.56 |
| neg_iSOL | 0 | 8 | 0.55 | 0 | 0.79 |
| pos_NRC | 0 | 6 | 0.43 | 0 | 0.69 |
| neg_NRC | 0 | 6 | 0.42 | 0 | 0.70 |
| pos_mlS | 0 | 4 | 0.07 | 0 | 0.26 |
| neg_mlS | 0 | 5 | 0.14 | 0 | 0.41 |
| followers | 0 | 6 408 669 | 11536.50 | 298 | 175546.78 |
| friends | 0 | 483 424 | 1178.87 | 383 | 5989.94 |
| listed | 0 | 54 377 | 113.54 | 5 | 1400.75 |
| user_fav | 0 | 770 740 | 6220.21 | 1067 | 23278.15 |
| statuses | 1 | 7 930 051 | 22827.33 | 4692 | 85552.64 |
| months | 1 | 134 | 59.86 | 68 | 30.39 |
Kendall’s tau coefficient for the number of positive words.
| pos_iSOL | pos_NRC | pos_mlS | |
|---|---|---|---|
| pos_iSOL | 1.00000 | 0.24798 | 0.37931 |
| pos_NRC | 1.00000 | 0.13987 | |
| pos_mlS | 1.00000 |
Kendall’s tau coefficient for the number of negative words.
| neg_iSOL | neg_NRC | neg_mlS | |
|---|---|---|---|
| neg_iSOL | 1.00000 | 0.45122 | 0.41194 |
| neg_NRC | 1.00000 | 0.27833 | |
| neg_mlS | 1.00000 |
Regression coefficient estimates in the fitted GW models with each lexicon.
| features | iSOL | NRC | ML-SentiCon |
|---|---|---|---|
| (intercept) | −2.363*** | −2.330*** | −2.336*** |
| positive_words | −0.074*** | −0.040*** | −0.104*** |
| negative_words | 0.034*** | −0.023* | −0.109*** |
| log(favourite + 1) | 1.263*** | 1.261*** | 1.261*** |
| reply | −0.326*** | −0.329*** | −0.328*** |
| quote | −0.051* | −0.057* | −0.050 |
| hashtags | 0.080*** | 0.077*** | 0.078*** |
| urls | 0.329*** | 0.325*** | 0.323*** |
| mentions | 0.037** | 0.035** | 0.036** |
| time_morning | 0.058* | 0.058* | 0.055* |
| time_afternoon | −0.054 | −0.058* | −0.061* |
| time_evening | −0.064 | −0.082 | −0.090 |
| verified | −0.268*** | −0.260*** | −0.261*** |
| log(followers+1) | 0.113*** | 0.113*** | 0.112*** |
| log(friends + 1) | 0.040*** | 0.039*** | 0.039*** |
| log(listed + 1) | 0.045*** | 0.045*** | 0.044*** |
| log(user_fav + 1) | −0.032*** | −0.032*** | −0.031*** |
| log(statuses) | 0.012 | 0.013* | 0.013* |
| months | −0.002*** | −0.002*** | −0.002*** |
***Significant at ; **Significant at ; *Significant at .