| Literature DB >> 24312181 |
Joseph Digrazia1, Karissa McKelvey, Johan Bollen, Fabio Rojas.
Abstract
Is social media a valid indicator of political behavior? There is considerable debate about the validity of data extracted from social media for studying offline behavior. To address this issue, we show that there is a statistically significant association between tweets that mention a candidate for the U.S. House of Representatives and his or her subsequent electoral performance. We demonstrate this result with an analysis of 542,969 tweets mentioning candidates selected from a random sample of 3,570,054,618, as well as Federal Election Commission data from 795 competitive races in the 2010 and 2012 U.S. congressional elections. This finding persists even when controlling for incumbency, district partisanship, media coverage of the race, time, and demographic variables such as the district's racial and gender composition. Our findings show that reliable data about political behavior can be extracted from social media.Entities:
Mesh:
Year: 2013 PMID: 24312181 PMCID: PMC3842288 DOI: 10.1371/journal.pone.0079449
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Results for Regression of Republican Vote Share on Tweet Share with Controls.
| Variable | Bivariate (SE) | Full Model (SE) |
| Republican Tweet Share | 0.268 (0.022) | 0.022 (0.01) |
| Republican Incumbent | 11.06 (0.66) | |
| % McCain | 0.776 (0.03) | |
| Median Age | 0.012 (0.09) | |
| % White | 0.129 (0.02) | |
| % College Educated | −0.004 (0.05) | |
| Median HH Income | 0.016 (0.03) | |
| % Female | 0.089 (0.30) | |
| CNN share | 0.002 (0.01) | |
|
| 37.042 (1.35) | −4.07 (15.04) |
|
| 406 | 406 |
|
| .26 | .92 |
Explaining Republican vote share with the proportion of tweets that included a Republican candidate during the three months before the 2010 election. The share of Republican tweets remains significant after adding controls. Standard error (SE) is in parentheses.
().
** ().
().
Results for Regression of Republican Vote share on User Share with Controls.
| Variable | Bivariate (SE) | Full Model (SE) |
| Republican User Share | 0.279 (0.02) | 0.027 (0.01) |
| Republican Incumbent | 10.956 (0.65) | |
| % McCain | 0.772 (0.03) | |
| Median Age | 0.010 (0.09) | |
| % White | 0.131 (0.02) | |
| % College Educated | −0.005 (0.05) | |
| Median HH Income | 0.017 (0.03) | |
| % Female | 0.117 (0.30) | |
| CNN share | 0.001 (0.01) | |
|
| 36.423 (1.32) | −5.474 (15.01) |
|
| 406 | 406 |
|
| .28 | .92 |
Explaining Republican vote share with the proportion of users who included a Republican candidate in at least one tweet during the three months before the 2010 election. The relationship remains significant after adding controls. Standard error (SE) is in parentheses.
* ().
().
().
Figure 12010 Republican Tweet Share vs. Vote Share.
Bivariate relationship between the share of occurrences of Republican names in tweets and vote share in the 2010 congressional elections. We show a significant positive relationship.
Figure 22012 Republican Tweet Share vs. Vote Share.
Bivariate relationship between the share of occurrences of Republican names in tweets and vote share in the 2012 congressional elections. We show a significant positive relationship.
Figure 3Effects of Name Share Mention by Month.
Effects of Republican tweet share during the months of August, September, and October with a 95% confidence interval.