| Literature DB >> 26375581 |
Karolina Sylwester1, Matthew Purver1.
Abstract
Previous research has shown that political leanings correlate with various psychological factors. While surveys and experiments provide a rich source of information for political psychology, data from social networks can offer more naturalistic and robust material for analysis. This research investigates psychological differences between individuals of different political orientations on a social networking platform, Twitter. Based on previous findings, we hypothesized that the language used by liberals emphasizes their perception of uniqueness, contains more swear words, more anxiety-related words and more feeling-related words than conservatives' language. Conversely, we predicted that the language of conservatives emphasizes group membership and contains more references to achievement and religion than liberals' language. We analysed Twitter timelines of 5,373 followers of three Twitter accounts of the American Democratic and 5,386 followers of three accounts of the Republican parties' Congressional Organizations. The results support most of the predictions and previous findings, confirming that Twitter behaviour offers valid insights to offline behaviour.Entities:
Mesh:
Year: 2015 PMID: 26375581 PMCID: PMC4574198 DOI: 10.1371/journal.pone.0137422
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Predictions about language use by liberals and conservatives.
The “+” and “-” represent the direction of the expected relationship.
| Prediction Category | Measurement Category (with example words) | Prediction |
|---|---|---|
|
| 1st person singular pronouns (I, me, mine) | +DEM,-GOP due to higher perception [ |
|
| 1st person plural pronouns (we, our, us) | -DEM, +GOP due to conservatives’ perception of high in-group similarity [ |
|
| Swear words dictionary (damn, piss, fuck) | +DEM,-GOP due to reported politeness of conservatives [ |
|
| Positive emotion dictionary (love, nice, sweet) | +DEM,-GOP due to the finding that liberals express more happiness than conservatives [ |
|
| Negative emotion dictionary (hurt, ugly, nasty) | -DEM, +GOP, due to more frequent negative sentiment expressed in the language of conservatives [ |
|
| Anxiety dictionary (worried, fearful, nervous) | +DEM,-GOP due to reported higher neuroticism of liberals [ |
|
| Feeling dictionary (feels, touch) | +DEM,-GOP due to reported higher compassion and emotionality of liberals [ |
|
| Tentative dictionary (maybe, perhaps, guess) | ?DEM,? GOP, there is an established relationship between conservative orientation and ambiguity avoidance but it is difficult to predict how it would affect language use [ |
|
| Certainty dictionary (always, never) | ?DEM,? GOP, as above |
|
| Achievement dictionary (earn, hero, win) | -DEM, +GOP due to reported higher emphasis on achievement in conservatives [ |
|
| Religion dictionary (altar, church, mosque) | -DEM, +GOP due to known higher religiosity of conservatives [ |
|
| Death dictionary (bury, coffin, kill) | ?DEM,? GOP, conservatives report greater death anxiety but it is unclear whether this would lead to more frequent death-related discussions [ |
Fig 1Follower-friend ratio by political orientation.
Follower-friend ratio was calculated by dividing each user’s follower count (number of following users) by friend count (number of followed users). Boxplots represent interquartile regions with medians.
Fig 2Mention ratio by political orientation.
Mention ratio was calculated by dividing the total number of mentions per user by the total number of tweets. Boxplots represent interquartile regions with medians.
Fig 3Conditional density plot showing the change in probability of following Republicans vs. Democrats over the frequency of using 1st person singular pronouns.
The plot describes how the conditional distribution of political orientation changes over the use of the first person singular pronoun. For example, when the first person singular pronoun is 15, the probability of the political orientation being DEM is 100%, however, this changes as the first person singular pronoun usage increases.
Twenty most differentiating word stems between Republicans and Democrats obtained with 50-smoothing and weighted word frequency method (hashtags excluded).
| Top GOP word | Count GOP | Count DEM | Top DEM word | Count GOP | Count DEM |
|---|---|---|---|---|---|
|
| 339 | 11 |
| 80 | 315 |
|
| 272 | 14 |
| 20 | 132 |
|
| 326 | 26 |
| 12 | 105 |
|
| 259 | 16 |
| 99 | 317 |
|
| 708 | 116 |
| 57 | 207 |
|
| 720 | 126 |
| 48 | 181 |
|
| 299 | 33 |
| 224 | 591 |
|
| 393 | 61 |
| 28 | 125 |
|
| 230 | 23 |
| 13 | 91 |
|
| 2089 | 509 |
| 21 | 108 |
|
| 238 | 27 |
| 39 | 148 |
|
| 828 | 191 |
| 53 | 178 |
|
| 2266 | 586 |
| 330 | 778 |
|
| 867 | 207 |
| 11 | 82 |
|
| 274 | 42 |
| 18 | 97 |
|
| 674 | 162 |
| 14 | 87 |
|
| 349 | 69 |
| 18 | 94 |
|
| 10891 | 3226 |
| 62 | 186 |
|
| 296 | 57 |
| 139 | 344 |
|
| 1253 | 369 |
| 61 | 181 |
Initial logistic regression model.
| Predictors | Estimate | Standard Error | Z value | P value |
|---|---|---|---|---|
|
| 0.4711053 | 0.1192538 | -3.95 | 7.80E-05*** |
|
| 0.1036425 | 0.0103252 | 10.038 | 2.00E-16*** |
|
| -0.1361112 | 0.0310361 | -4.386 | 1.16E-05*** |
|
| 0.2490142 | 0.0512089 | 4.863 | 1.16E-06*** |
|
| 0.0406131 | 0.0094521 | 4.297 | 1.73E-05*** |
|
| -0.0595763 | 0.0261562 | -2.278 | 0.022744* |
|
| 0.3952645 | 0.0916744 | 4.312 | 1.62E-05*** |
|
| 0.1586861 | 0.0577905 | 2.746 | 0.006035** |
|
| -0.0908508 | 0.0272784 | -3.33 | 0.000867*** |
|
| 0.0003329 | 0.0325081 | 0.01 | 0.99183 |
|
| 0.0250449 | 0.0226362 | 1.106 | 0.26855 |
|
| -0.1362726 | 0.0236901 | -5.752 | 8.80E-09*** |
|
| 0.0887033 | 0.0815541 | 1.088 | 0.276744 |
Republican followers were coded as 0 and Democrat followers as 1.
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Fisher's exact tests for political buzzwords, p < 0.001 for all tests.
| Buzz word | Count DEM | Count GOP | 95% Confidence intervals | Odds ratio |
|---|---|---|---|---|
|
| 446 | 1842 | 3.544189, 4.370964 | 3.932325 |
|
| 868 | 3068 | 3.120775, 3.633426 | 3.365708 |
|
| 5153 | 7561 | 1.348463, 1.447930 | 1.397302 |
|
| 31 | 5 | 2.510737, 21.453153 | 6.512738 |
|
| 113 | 32 | 2.486100, 5.678139 | 3.709079 |
Twenty most differentiating word stems between Democrats and Republicans based on difference in proportions.
| Top GOP word | Count GOP | Count DEM | Top DEM word | Count GOP | Count DEM |
|---|---|---|---|---|---|
|
| 11242 | 3514 |
| 16778 | 19732 |
|
| 4099 | 450 |
| 6129 | 8258 |
|
| 23516 | 19335 |
| 26654 | 27678 |
|
| 7798 | 5346 |
| 5386 | 7109 |
|
| 3089 | 879 |
| 2183 | 3852 |
|
| 3763 | 1828 |
| 22187 | 22695 |
|
| 2427 | 621 |
| 7731 | 8876 |
|
| 4383 | 2732 |
| 9620 | 10508 |
|
| 14825 | 12711 |
| 47050 | 45895 |
|
| 1845 | 449 |
| 7805 | 8578 |
|
| 2985 | 1648 |
| 7720 | 8462 |
|
| 1860 | 627 |
| 1700 | 2734 |
|
| 5288 | 3940 |
| 1086 | 2129 |
|
| 4583 | 3273 |
| 2533 | 3472 |
|
| 3826 | 2558 |
| 11043 | 11505 |
|
| 2576 | 1373 |
| 1840 | 2740 |
|
| 1252 | 280 |
| 17335 | 17405 |
|
| 6348 | 5148 |
| 14052 | 14242 |
|
| 1312 | 379 |
| 7297 | 7822 |
|
| 3027 | 2009 |
| 6195 | 6743 |
Logistic model including only predictors significant at p<0.01.
| Predictors | Estimate | Standard Error | Z value | P value | Odds Ratio |
|---|---|---|---|---|---|
|
| -0.490264 | 0.092818 | -5.282 | 1.28E-07 | 0.6124646 |
|
| 0.102213 | 0.009959 | 10.264 | 2.00E-16 | 1.1076199 |
|
| -0.13309 | 0.030918 | -4.305 | 1.67E-05 | 1.1423534 |
|
| 0.180094 | 0.04187 | 4.301 | 1.70E-05 | 1.1973295 |
|
| 0.044791 | 0.009083 | 4.932 | 8.16E-07 | 1.0458096 |
|
| 0.301711 | 0.081022 | 3.724 | 0.000196 | 1.3521706 |
|
| 0.151838 | 0.058548 | 2.593 | 0.009503 | 1.1639717 |
|
| -0.09837 | 0.02689 | -3.658 | 0.000254 | 1.1033705 |
|
| -0.139183 | 0.023423 | -5.942 | 2.81E-09 | 1.1493341 |
The odds ratios were calculated by exponentiating coefficients. Republican followers were coded as 0 and Democrat followers as 1.
Logistic regression model with all predictors using data without outliers.
| Predictors | Estimate | Standard Error | Z value | P value |
|---|---|---|---|---|
|
| -0.32546 | 0.22806 | -1.427 | 0.153555 |
|
| 0.08867 | 0.02054 | 4.316 | 1.59E-05*** |
|
| -0.3435 | 0.09793 | -3.507 | 0.000452*** |
|
| 0.88577 | 0.21676 | 4.086 | 4.38E-05*** |
|
| 0.13363 | 0.02517 | 5.31 | 1.10E-07*** |
|
| -0.12299 | 0.05362 | -2.294 | 0.021806* |
|
| 0.85417 | 0.21939 | 3.893 | 9.88E-05*** |
|
| -0.03407 | 0.14776 | -0.231 | 0.817626 |
|
| -0.11298 | 0.05405 | -2.09 | 0.036593* |
|
| -0.03662 | 0.07287 | -0.503 | 0.615252 |
|
| -0.09091 | 0.05773 | -1.575 | 0.115288 |
|
| -0.22651 | 0.1444 | -1.569 | 0.116749 |
|
| -0.28486 | 0.24512 | -1.162 | 0.245182 |
Republican followers were coded as 0 and Democrat followers as 1.
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Final logistic regression model using data without outliers.
| Predictors | Estimate | Standard Error | Z value | P value | Odds Ratio |
|---|---|---|---|---|---|
|
| -0.90293 | 0.14419 | -6.262 | 3.79E-10*** | 0.4053813 |
|
| 0.09616 | 0.01888 | 5.092 | 3.53E-07*** | 1.1009364 |
|
| -0.36281 | 0.09659 | -3.756 | 0.000173*** | 1.4373656 |
|
| 0.61113 | 0.19703 | 3.102 | 0.001924** | 1.8425114 |
|
| 0.13338 | 0.02417 | 5.518 | 3.43E-08*** | 1.1426805 |
Republican followers were coded as 0 and Democrat followers as 1.
Results of the analyses against predictions.
Prediction category and outcome columns are in bold if the prediction is supported and not if there is insufficient evidence or if the direction of the prediction was not determined in the first place.
| Prediction category (Measurement category) | Prediction outcome | Evidence |
|---|---|---|
|
|
| “I”, “my”, “I’m” and “me” are the most frequently used unstemmed words by Democrats but not Republicans. Frequency of 1st person singular pronoun use is a significant predictor of following Democrats in all regression models. |
|
|
| “We”, “our” and “us” are among the most frequently used unstemmed words by Republicans but not Democrats. Frequency of 1st person plural pronoun use is a significant predictor of following Republicans in all regression models. |
|
|
| “Fuck” and “shit” are among the most frequently used stemmed words by Democrats but not Republicans. Frequency of swear words is a significant predictor of following Democrats in all regression models |
|
|
| In the most frequently used word stems Republicans use “great” but Democrats use “love”, “like”, “happi” and “amaz”, in unstemmed words Democrats use “lol”. Frequency of positive emotion words is a significant predictor of following Democrats in the regression models including outliers. |
|
| -DEM, +GOP | This prediction is mildly supported: Republicans frequently use “not” (unstemmed words analysis), and often address their adversaries: “obama” “obamacare”, “liberals”, “his”. The first regression shows weakly significant (p<0.05) effect for negative emotion word use predicting Republican affiliation, a consistent, yet not significant trend is present in the model with no outliers. |
|
|
| Frequency of anxiety-related words is a significant predictor of following Democrats in three out of four regression models. |
|
|
| “Feel” is one of the top words used by Democrats. Frequency of feeling-related words is a significant predictor of following Democrats in the regression models including outliers. |
|
| ?DEM,? GOP | Frequency of tentative words is a significant predictor of following Republicans in the model with outliers. |
|
| ?DEM,? GOP | No effect found. |
|
| -DEM, +GOP | No effect found. |
|
|
| “God” and “psalm” are among the top words used by Republicans. Frequency of religion-related words is a predictor of following Republicans in the regression models including outliers. |
|
| ?DEM,? GOP | No effect found. |