| Literature DB >> 35729965 |
Abstract
Misinformation on social media has become a horrendous problem in our society. Fact-checks on information often fall behind the diffusion of misinformation, which can lead to negative impacts on society. This research studies how different factors may affect the spread of fact-checks over the internet. We collected a dataset of fact-checks in a six-month period and analyzed how they spread on Twitter. The spread of fact-checks is measured by the total retweet count. The factors/variables include the truthfulness rating, topic of information, source credibility, etc. The research identifies truthfulness rating as a significant factor: conclusive fact-checks (either true or false) tend to be shared more than others. In addition, the source credibility, political leaning, and the sharing count also affect the spread of fact-checks. The findings of this research provide practical insights into accelerating the spread of the truth in the battle against misinformation online.Entities:
Keywords: Credibility; Data analysis; Fact-check; Information sharing; Misinformation; Social media
Year: 2022 PMID: 35729965 PMCID: PMC9188446 DOI: 10.1007/s10796-022-10296-z
Source DB: PubMed Journal: Inf Syst Front ISSN: 1387-3326 Impact factor: 5.261
Distribution of statements with different ratings
| Rating | Fact-checks | Tweeted Fact-checks | ||
|---|---|---|---|---|
| 51 | 5.1% | 23 | 3.6% | |
| 49 | 4.9% | 30 | 4.7% | |
| 83 | 8.3% | 45 | 7.1% | |
| 144 | 14.4% | 103 | 16.2% | |
| 468 | 46.7% | 296 | 46.6% | |
| 208 | 20.7% | 138 | 21.7% | |
| Total | 1,003 | 100% | 635 | 100% |
Two topics identified using the LDA model
| Topic | Top 20 keywords | Interpretation |
|---|---|---|
| 1 | Biden, vaccine, Joe, covid, vote, Trump, year, people, show, president, Texas, house, elect, win, photo, state illegal, white, border, million | COVID-19 and vaccines |
| 2 | Elect, vote, ballot, state, Biden, show, covid, Trump, photo, people, vaccine, president, Joe, voter, video, Georgia, Donald, day, new, capitol | 2020 Presidential Election |
A mapping table to convert ratings to scores
| PolitiFact | |
|---|---|
| Rating | Score |
| 1 | |
| 0.5 | |
| 0 | |
| -0.5 | |
| -1 | |
| -2 | |
Summary of variables collected from 635 statements from November 2020 to May 2021
| Variables | Mean | Std. Dev. | Minimum | Maximum | Count |
|---|---|---|---|---|---|
|
| 127.8551 | 274.8686 | 1 | 3406 | - |
|
| - | - | - | - | 23 (3.6%) |
|
| - | - | - | - | 30 (4.7%) |
|
| - | - | - | - | 45 (7.1%) |
|
| - | - | - | - | 103 (16.2%) |
|
| - | - | - | - | 296 (46.6%) |
|
| - | - | - | - | 138 (21.7%) |
|
| 0.4951 | 0.2368 | 0.1454 | 0.8340 | - |
|
| 0.5049 | 0.2368 | 0.1660 | 0.8546 | - |
|
| -0.8400 | 0.4859 | -2 | 1 | - |
|
| 614.7228 | 573.6298 | 1 | 1300 | - |
|
| - | - | - | - | 70 (11.0%) |
|
| - | - | - | - | 191 (30.1%) |
|
| - | - | - | - | 374 (58.9%) |
|
| 1.9921 | 1.1240 | 1 | 10 | |
|
| 4.0252 | 13.2284 | 0 | 131 | - |
Fig. 1Histograms of Total Retweet Count, Source Statement Count, Tweet Count, Time Range, and their log transforms
Pairwise Pearson correlation coefficients among the numerical variables
| Variables | log( |
|
| log( | log( | log( |
|---|---|---|---|---|---|---|
| log( | 1.000 | -0.017 | -0.164 | 0.047 | 0.602 | 0.404 |
|
| - | 1.000 | -0.028 | -0.020 | 0.013 | -0.013 |
|
| - | - | 1.000 | -0.438*** | 0.057 | 0.101** |
| log( | - | - | - | 1.000 | -0.010 | -0.039 |
| log( | - | - | - | - | 1.000 | 0.708*** |
| log( | - | - | - | - | - | 1.000 |
Pearson correlation coefficients that are significant at the 0.1%, 1%, and 5% significance levels are marked by ***, **, and *, respectively
The estimated effects from the negative binomial regression models
| Dependent Variable: | Reduced Model | Full Model | |||
|---|---|---|---|---|---|
| Estimate | (s.e.) | Estimate | (s.e.) | ||
| Independent Variables |
| 1.8892 | (0.2449) *** | ||
|
| 0.9328 | (0.2239) *** | |||
|
| 0.9268 | (0.1771) *** | |||
|
| 1.4561 | (0.1690) *** | |||
|
| 1.6709 | (0.1842) *** | |||
|
| -0.1412 | (0.1567) | |||
| Control Variables |
| -0.5966 | (0.1222) *** | -0.4837 | (0.1308) *** |
| log ( | 0.0137 | (0.0222) | -0.0003 | (0.0211) | |
|
| 0.2017 | (0.2064) | 0.2273 | (0.1957) | |
|
| 0.4430 | (0.1390) ** | 0.4367 | (0.1331) ** | |
| log ( | 1.5068 | (0.1140) *** | 1.4238 | (0.1075) *** | |
| log ( | 0.0079 | (0.0107) | 0.0093 | (0.0101) | |
| Constant | 2.9551 | (0.2216) *** | 1.8602 | (0.2563) *** | |
| Others | Dispersion Parameter | 1.0346 | (0.0524) | 1.1816 | (0.0608) |
| AIC | 6982.2 | 6890.3 | |||
| 2 × log-likelihood | -6966.2 | -6862.3 | |||
Estimates that are significant at the 0.1%, 1%, and 5% significance levels are marked by ***, **, and *, respectively. A statement that is rated as Half True is used as the baseline. A source that is neither liberal nor conservative is used as the baseline
Fig. 2The 95% confidence intervals of the ratios of total retweets received by True, Mostly True, Mostly False, False, and Pants on Fire statements, compared to that of Half True statements. The centers of the 95% confidence intervals are marked by solid circles