| Literature DB >> 35494820 |
Michael C Galgoczy1, Atharva Phatak2, Danielle Vinson3, Vijay K Mago2, Philippe J Giabbanelli1.
Abstract
Influencing and framing debates on Twitter provides power to shape public opinion. Bots have become essential tools of 'computational propaganda' on social media such as Twitter, often contributing to a large fraction of the tweets regarding political events such as elections. Although analyses have been conducted regarding the first impeachment of former president Donald Trump, they have been focused on either a manual examination of relatively few tweets to emphasize rhetoric, or the use of Natural Language Processing (NLP) of a much larger corpus with respect to common metrics such as sentiment. In this paper, we complement existing analyses by examining the role of bots in the first impeachment with respect to three questions as follows. (Q1) Are bots actively involved in the debate? (Q2) Do bots target one political affiliation more than another? (Q3) Which sources are used by bots to support their arguments? Our methods start with collecting over 13M tweets on six key dates, from October 6th 2019 to January 21st 2020. We used machine learning to evaluate the sentiment of the tweets (via BERT) and whether it originates from a bot. We then examined these sentiments with respect to a balanced sample of Democrats and Republicans directly relevant to the impeachment, such as House Speaker Nancy Pelosi, senator Mitch McConnell, and (then former Vice President) Joe Biden. The content of posts from bots was further analyzed with respect to the sources used (with bias ratings from AllSides and Ad Fontes) and themes. Our first finding is that bots have played a significant role in contributing to the overall negative tone of the debate (Q1). Bots were targeting Democrats more than Republicans (Q2), as evidenced both by a difference in ratio (bots had more negative-to-positive tweets on Democrats than Republicans) and in composition (use of derogatory nicknames). Finally, the sources provided by bots were almost twice as likely to be from the right than the left, with a noticeable use of hyper-partisan right and most extreme right sources (Q3). Bots were thus purposely used to promote a misleading version of events. Overall, this suggests an intentional use of bots as part of a strategy, thus providing further confirmation that computational propaganda is involved in defining political events in the United States. As any empirical analysis, our work has several limitations. For example, Trump's rhetoric on Twitter has previously been characterized by an overly negative tone, thus tweets detected as negative may be echoing his message rather than acting against him. Previous works show that this possibility is limited, and its existence would only strengthen our conclusions. As our analysis is based on NLP, we focus on processing a large volume of tweets rather than manually reading all of them, thus future studies may complement our approach by using qualitative methods to assess the specific arguments used by bots.Entities:
Keywords: Bots; Impeachment; Politics; Sentiment; Twitter
Year: 2022 PMID: 35494820 PMCID: PMC9044321 DOI: 10.7717/peerj-cs.947
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
Distribution of tweets collected.
| Date | Tweets collected |
|---|---|
| 6 October 2019 | 1,968,943 |
| 17 October 2019 | 1,960,808 |
| 14 November 2019 | 1,977,712 |
| 5 December 2019 | 1,960,813 |
| 19 December 2019 | 2,041,924 |
| 21 January 2020 | 3,360,434 |
Language detection using langdetect.
| Tweet | Output |
|---|---|
| tüm dıŞ politikayı tek bir adama yani trump’a bağlamak tüm | tr |
| I’m sure Trump will tweet about rep Elijah Cummings passing | en |
Nicknames of key actors.
| Actor | Nicknames |
|---|---|
| Nancy Pelosi | Nervous Nancy |
| Adam Schiff | Shifty Schiff |
| Joe Biden | Sleepy Joe, Creepy Joe |
| Mitch McConnell | Midnight Mitch |
Comparison of classification accuracy for BERT (which we use in this study), TextBlob library, and two scikit-learn algorithms.
| Model name | Class | Metric | Value |
|---|---|---|---|
| Decision Tree | Overall | Accuracy | 0.62 |
| Precision | 0.64 | ||
| Recall | 0.8816 | ||
| Negative | F1 | 0.744 | |
| Precision | 0.537 | ||
| Recall | 0.2562 | ||
| Neutral | F1 | 0.3465 | |
| Precision | 0.478 | ||
| Recall | 0.2104 | ||
| Positive | F1 | 0.2922 | |
| Naive Bayes | Overall | Accuracy | 0.658 |
| Precision | 0.664 | ||
| Recall | 0.918 | ||
| Negative | F1 | 0.7707 | |
| Precision | 0.638 | ||
| Recall | 0.3607 | ||
| Neutral | F1 | 0.4608 | |
| Precision | 0.617 | ||
| Recall | 0.0979 | ||
| Positive | F1 | 0.1686 | |
| BERT | Overall | Accuracy | 0.7398 |
| Precision | 0.821 | ||
| Recall | 0.806 | ||
| Negative | F1 | 0.813 | |
| Precision | 0.6246 | ||
| Recall | 0.666 | ||
| Neutral | F1 | 0.6348 | |
| Precision | 0.612 | ||
| Recall | 0.584 | ||
| Positive | F1 | 0.598 | |
| Logistic Regressor | Overall | Accuracy | 0.635 |
| Precision | 0.652 | ||
| Recall | 0.9 | ||
| Negative | F1 | 0.7535 | |
| Precision | 0.543 | ||
| Recall | 0.303 | ||
| Neutral | F1 | 0.389 | |
| Precision | 0.72 | ||
| Recall | 0.13125 | ||
| Positive | F1 | 0.217 | |
| Support Vector Machine | Overall | Accuracy | 0.64 |
| Precision | 0.639 | ||
| Recall | 0.94 | ||
| Negative | F1 | 0.762 | |
| Precision | 0.637 | ||
| Recall | 0.238 | ||
| Neutral | F1 | 0.3465 | |
| Precision | 0.665 | ||
| Recall | 0.128 | ||
| Positive | F1 | 0.199 |
Botometer results.
| Twitter account ID | English | Content | … | Universal |
|---|---|---|---|---|
| 999800336750530560 | 0.067063 | 0.938607 | … | 0.536176 |
| 306127388 | 0.046774 | 0.833483 | … | 0.207906 |
Classification performances on bot detection.
| Approach | Hyper-parameters | Performances | |||||
|---|---|---|---|---|---|---|---|
| Values explored | Best values | Acc. | F1 | ROC-AUC | Precision | Recall | |
| Decision Tree | Criterion (gini/entropy), | entropy | 0.885 | 0.903 | 0.890 | 0.938 | 0.871 |
| max_depth (1, 2,…, 10), | 5 | ||||||
| min samples leaf (2, 3, …, 20), | 14 | ||||||
| max leaf nodes (1, 2, …, 20) | 17 | ||||||
| eXtreme Gradient Boosting (XGBoost) for trees | Max depth (5, 6, …, 10), | 5 | 0.892 | 0.909 | 0.893 | 0.933 | 0.887 |
| alpha (0.1, 0.3, 0.5), | 0.5 | ||||||
| learning rate (0.01, 0.02, …, 0.05), | 0.02 | ||||||
| estimators (100, 200, 300) | 200 | ||||||
| Random Forests | Max depth (5, 6, …, 10), | 8 | 0.898 | 0.915 | 0.899 | 0.934 | 0.897 |
| criterion (gini/entropy), | entropy | ||||||
| estimators (100, 200, 300) | 200 | ||||||
Distribution of tweets.
| Date | Tweets collected | Proportion remaining (%) |
|---|---|---|
| 6 October 2019 | 1,877,693 | 95.3 |
| 17 October 2019 | 1,830,552 | 93.4 |
| 14 November 2019 | 1,828,402 | 92.5 |
| 5 December 2019 | 1,855,771 | 94.6 |
| 19 December 2019 | 1,998,802 | 97.9 |
| 21 January 2020 | 3,230,843 | 96.1 |
Figure 1Classification results (sentiments and bots) for each actor.
The high-resolution figure can be zoomed in, using the digital version of this article.
Figure 2Percentage of tweets per category of sentiments over time for each actor.
(A) Donald Trump. (B) Nancy Pelosi. (C) Adam Schiff. (D) Joe Biden. Note that using a percentage instead of the absolute number of tweets allows to compare results across actors, since each one is associated with a different volume of tweets.
Two sample tweets for each day of data collection.
| Date | Tweets |
|---|---|
| 5 December 2019 | RT @usa4_trump: Ukraine Pulls Back Curtain On Biden – Claims Burisma Paid The Vice Presi… |
| @GOP @RepMikeJohnson Too much evidence to ignore. Trump won’t even let staff testify. No transparency, no innocence! | |
| 19 December 2019 | Merry Christmas Eve Patriots! I have a grateful heart for President Donald Trump, a man who loves our God and our country, and defends our people. Keep the faith Patriots, and know that we are on the right path. God Bless you all. #TRUMP2020Landside |
| @RyanHillMI @realDonaldTrump You’re right. It wasn’t a trial. It was a perfect example of tyranny. But, tweets can only be so long and impeachment didn’t fit. So, trial. | |
| 6 October 2019 | @AndyOstroy @realDonaldTrump What Law has been broken? What rule? Why are they not taking a formal vote to begin impeachment? What crime or misdemeanors took place? Pelosi cant articulate the actual crime! Schiff is 4–50 pinnochios deep in 3 years! |
| Jim Jordan Doesn’t ‘Think’ Trump Did Something He Actually Did On Camera | |
| 17 October 2019 | Survey: 54 percent of Americans support Trump impeachment inquiry |
| RT @weavejenn: Cause he’s a spineless coward #TrumpIsADisgrace | |
| 14 November 2019 | @CumeTalitha trump is our god and lord and savior. dont say gods name in vein. i will report you. |
| @realDonaldTrump Adam Schiff is the fake whistle blower! No one exists all made up! And what are they whistle blowing we have the transcripts end of story. | |
| 21 January 2020 | @CHHR01 Pence also tried to oust Trump from the ticket in 2016. They even created a website, but then abandoned the effort and threatened the life of the web designer. I will find that story for you. |
| @realDonaldTrump FATHER TODAY I ASK FOR A MIRACLE FOR PRWSIDENT TRUMP LET HIM SEE ALL HIS ENEMYS HANDCUFFED ESCORTED TO GITMO4 TREASON IN JESUSNAME.REMOVE SOROS ALL THE DEMOCRATES WHO WE SEE DESTROYING CALI NEW YORK CHICAGO ALL STATES/CITYS(ISA41:11-16)JNA |
Note:
Some links may have broken since the study was conducted, which is one indicator that they were propaganda posts.
Figure 3Joint examination of sentiments and bots for each major actor, with additional political figures included.
The high-resolution figure can be zoomed in, using the digital version of this article.
Figure 4Temporal trend in the number of posts by bots for each major actor, with additional political figures included.
The high-resolution figure can be zoomed in, using the digital version of this article.
Website categories per source for bias rating (AllSides, Ad Fontes), using their nomenclature (e.g., ‘very left’ in AllSides, ‘hyper-partisan left’ in Ad Fontes).
*Manual screening was applied to websites not rated by Ad Fontes.
| Leaning | AllSides | Ad Fontes | Manual* | Ad Fontes + Manual |
|
|---|---|---|---|---|---|
| Not rated | 40 | 16 | |||
| Very left/hyper-partisan left | 9 | 6 | 6 | Left: 25 | |
| Left | 12 | 19 | 19 | ||
| Center/balanced | 8 | 13 | 1 | 14 | |
| Right | 5 | 16 | 1 | 17 | Right: 46 |
| Very right/hyper-partisan right | 14 | 16 | 16 | ||
| Most extreme right | 2 | 11 | 13 | ||
| No longer accessible | 3 | 3 |
Sample tweets pointing to hyper-partisan right or extreme right websites.
| Website | Tweet |
|---|---|
|
| Hes one to ca’ll the kettle black, Where’s the whistle blower there Schifty?? You said there was, then he wasn’t, you werent clear either, but Pelosi had your backside, didn’t she? You, all the other DemocRATS. @realDonaldTrump @GodLovesUSA1 @FogCityMidge |
|
| Straightforward from here! Vox sounds pretty convinced that impeachment could ultimately lead to President Nancy Pelosi |
|
| RT @gatewaypundit: REMINDER: PELOSI AND SCHIFF BOTH CONNECTED TO UKRAINIAN ARMS DEALER @JoeHoft |
|
| RT @marklevinshow: Fascistic Nancy Pelosi, also the dumbest speaker |
|
| RT @SaraCarterDC: Trump Calls Speaker Pelosi ‘A Third-Rate Politician’ |
|
| This Says it All: Watch Chairman Nadler Fall ASLEEP During Impeachment Hearing | Dan Bongino |
Note:
Some links may have broken since the study was conducted, which is one indicator that they were propaganda posts.
Figure 5Interactive visualization for topics from bots on 21 January 2020.