| Literature DB >> 35465441 |
Joshua Uyheng1, Iain J Cruickshank1, Kathleen M Carley1.
Abstract
This paper presents a new computational framework for mapping state-sponsored information operations into distinct strategic units. Utilizing a novel method called multi-view modularity clustering (MVMC), we identify groups of accounts engaged in distinct narrative and network information maneuvers. We then present an analytical pipeline to holistically determine their coordinated and complementary roles within the broader digital campaign. Applying our proposed methodology to disclosed Chinese state-sponsored accounts on Twitter, we discover an overarching operation to protect and manage Chinese international reputation by attacking individual adversaries (Guo Wengui) and collective threats (Hong Kong protestors), while also projecting national strength during global crisis (the COVID-19 pandemic). Psycholinguistic tools quantify variation in narrative maneuvers employing hateful and negative language against critics in contrast to communitarian and positive language to bolster national solidarity. Network analytics further distinguish how groups of accounts used network maneuvers to act as balanced operators, organized masqueraders, and egalitarian echo-chambers. Collectively, this work breaks methodological ground on the interdisciplinary application of unsupervised and multi-view methods for characterizing not just digital campaigns in particular, but also coordinated activity more generally. Moreover, our findings contribute substantive empirical insights around how state-sponsored information operations combine narrative and network maneuvers to achieve interlocking strategic objectives. This bears both theoretical and policy implications for platform regulation and understanding the evolving geopolitical significance of cyberspace.Entities:
Keywords: COVID-19 pandemic; Information operations; Multi-view modularity clustering; Social cyber-security; State-sponsored disinformation; Unsupervised machine learning
Year: 2022 PMID: 35465441 PMCID: PMC9014406 DOI: 10.1140/epjds/s13688-022-00338-6
Source DB: PubMed Journal: EPJ Data Sci ISSN: 2193-1127 Impact factor: 3.630
Figure 1Tweet creation and account creation times of state-sponsored accounts. Top: A spike in tweet creation is observed in November 2019. Bottom: Accounts active before the spike (Era 1) are older than those active after it (Era 2)
Descriptive statistics for Chinese state-sponsored accounts. Statistics for Followers and Following are given as the mean and the range in brackets. Percentages are provided for retweets in each era, as well as percentage of tweets in Chinese/English. The remaining minority of tweets were in other languages or were not identified by Twitter
| Era | Followers | Following | Retweets | Language (Chinese/English) |
|---|---|---|---|---|
| 1 | 2.837 [0,806] | 4.375 [0,1556] | 68.12% | 85.91%/6.19% |
| 2 | 2.100 [0,806] | 2.806 [0,1556] | 68.51% | 79.11%/11.98% |
Summary of network views used to process state-sponsored actors for multi-view modularity clustering
| View | Node (From) | Node (To) | Edge Weights |
|---|---|---|---|
| Account Interaction | Twitter Account | Twitter Account | Sum of the number of shared accounts that those accounts mention, reply to, or retweet |
| Account Text | Twitter Account | Tweet Text | Counts of psycholinguistic features based on words used by accounts across all their tweets |
| Account Hashtags | Twitter Account | Hashtags | Counts of the hashtags that accounts use in all of their tweets |
Descriptive statistics for unimodal representations of each network view across time periods analyzed
| Era | View | Size | Density | Mean Link Weights |
|---|---|---|---|---|
| 1 | Account Interaction | 8268 | 3.334 × 10−4 | 4.3406 |
| Account Text | 8268 | 0.551 | 1.517 | |
| Account Hashtags | 8268 | 0.431 | 1.459 | |
| 2 | Account Interaction | 16,661 | 1.292 × 10−4 | 3.756 |
| Account Text | 16,661 | 0.448 | 1.599 | |
| Account Hashtags | 16,661 | 4.527 × 10−4 | 3.064 |
Figure 2Graphical depiction of the MVMC technique used in this study. In the first step of the method, A, a graph representation is learned for every view of the data. In the second step, B, the view graphs are all collectively clustered to produce a single clustering across all of the views. Figure adapted from prior foundational work on MVMC [21, 22]
Figure 3Conceptual framework of relationship between overarching information operations, specific narrative and network maneuvers enacted by account clusters, and the methods to characterize them
Synthesis of MVMC-based mapping of Chinese state-sponsored account clusters
| Strategic Theme | Narrative and Network Information Maneuvers | |
|---|---|---|
| Time 1 | Time 2 | |
| Hong Kong and Anti-Protest | - - - | - - - |
| Guo Wengui and Anti-Fugitive | - | - |
| Pandemic Care and Community | (not yet present) | - |
Figure 4Organization of clusters of Chinese state-sponsored accounts, visualized on ORA. Nodes represent MVMC-derived clusters, connected by edges with thickness proportional to their weight. Top: Clustering obtained on accounts in Era 1. Bottom: Clustering obtained on accounts in Era 2
Descriptive statistics of derived MVMC clusters. Centralities are calculated on the cluster by cluster networks per time period
| Era | Cluster | Size | Centrality (Total) | Centrality (Betweenness) |
|---|---|---|---|---|
| 1 | Cluster 0: Defend Hong Kong | 3397 | 0.094 | 0.000 |
| Cluster 1: Guo Wengui and Bannon | 2889 | 0.091 | 0.000 | |
| Cluster 2: Hong Kong and Taiwan Elections | 659 | 0.016 | 0.153 | |
| Cluster 3: Kpop Fans and Health Tips | 462 | 0.039 | 0.167 | |
| Cluster 4: Stop Violence and Love Country | 367 | 0.005 | 0.000 | |
| Cluster 5: Guo Liar and COVID-19 | 313 | 0.008 | 0.000 | |
| Cluster 6: Love Hong Kong and Support the Police | 170 | 0.003 | 0.236 | |
| 2 | Cluster 0: Stop Violence and the Color Revolution | 5945 | 0.060 | 0.083 |
| Cluster 1: Guo Wengui Con Man | 5534 | 0.057 | 0.276 | |
| Cluster 2: Encouraging China during COVID | 2746 | 0.031 | 0.032 | |
| Cluster 3: Carrie Lam and Hong Kong Peace | 534 | 0.008 | 0.000 | |
| Cluster 4: Stop Violence during Elections | 433 | 0.005 | 0.192 | |
| Cluster 5: Punishment for Protestors | 369 | 0.004 | 0.058 | |
| Cluster 6: Guo Wengui Fugitive | 338 | 0.004 | 0.208 | |
| Cluster 7: Western Musicals and the Pandemic | 331 | 0.014 | 0.052 | |
| Cluster 8: Assorted Entertainment and Hong Kong Encouragement | 314 | 0.007 | 0.038 | |
| Cluster 9: Justin Bieber and Stopping Hong Kong Violence | 83 | 0.001 | 0.244 | |
| Cluster 10: Hong Kong (Isolated) | 38 | 0.000 | 0.321 | |
| Cluster 11: Guo Wengui (Isolated) | 15 | 0.000 | 0.173 |
Figure 5Comparison of MVMC clustering results with unimodal Louvain and Leiden baselines. Top: Baselines produce a significant number of degenerate clusters. Bottom: Baselines also capture only a fraction of coordinated accounts
Summary of hashtags used by MVMC clusters of Chinese state-sponsored accounts. Hashtags marked with a (*) are the hashtags with the top 3 highest tf-idf scores. Italics indicate hashtags did not undergo translation; otherwise they were in Chinese
| Era | Cluster | Hashtags |
|---|---|---|
| Era 1 | Cluster 0: Defend Hong Kong | Hong Kong, Mob, Defending Hong Kong, Protest*, Black Police, Defend Hong Kong, Hong Kong Police, Waste Youth*, Hong Kong Protests*, Guo Wengui |
| Cluster 1: Guo Wengui and Bannon | Guo Wen Gui, Bannon, Hong Kong, Mob, Wang Yanping*, Defending Hong Kong, New York Times*, Wengui*, Protest, Hong Kong Police | |
| Cluster 2: Hong Kong and Taiwan Elections | Wang Liqiang*, Tsai Ingwen*, Hong Kong, Taiwan Election*, Hong Kong Election, Legislative Council, Guo Wengui, Hong Kong Human Rights And Democracy Act, Wang Liqiang, Defending Hong Kong | |
| Cluster 3: Kpop Fans and Health Tips | Hong Kong, Guo Wengui, Health Tips*, | |
| Cluster 4: Stop Violence and Love Country | Hong Kong, Guo Wengui, Defend Hong Kong*, Stop Violence And Chaos*, Defend Hong Kong, Bannon, Mob, Police*, Hong Kong Police, Love The Country And Love Hong Kong | |
| Cluster 5: Guo Liar and COVID-19 | Guo Wengui, Hong Kong, Pneumonia*, Fever*, Guo Liar*, | |
| Cluster 6: Love Hong Kong and Support the Police | Guo Wengui*, Hong Kong, Guo*, Defending Hong Kong, Black Police, Love Hong Kong And Support The Police*, Hong Kong Election, Legislative Council, Bannon, Guo Wengui | |
| Era 2 | Cluster 0: Stop Violence and the Color Revolution | Hong Kong, Guowengui, Guarding Hong Kong, Police*, Pneumonia, Stop Violence And Control Chaos*, Virus, Mob, Color Revolution*, Color Revolution |
| Cluster 1: Guo Wen Gui Con Man | Guo Wengui, Hong Kong, | |
| Cluster 2: Encouraging China during COVID | Pneumonia, Hong Kong, Guo Wengui, Virus, | |
| Cluster 3: Carrie Lam and Hong Kong Peace | Hong Kong, Guo*, Old Monk Says Hong Kong*, | |
| Cluster 4: Stop Violence during Elections | Hong Kong, Stop Violence And Control Chaos*, Defending Hong Kong*, Guo Wengui, Bannon*, Hong Kong Election, Legislative Council, Protests, Hong Kong Human Rights And Democracy Act, | |
| Cluster 5: Punishment for Protestors | Hong Kong, Mob, Protests*, Mobs Destroy Youth*, Severe Punishments For Mobs*, Cockroach, Defending Hong Kong, Hong Kong Protests, Hong Kong Mob, Defend Hong Kong | |
| Cluster 6: Guo Wengui Fugitive | Hong Kong, Hong Kong Election*, Legislative Council*, Hong Kong Human Rights and Democracy Act, Guo*, Guo Wengui, New York Times, Hypocrite, Wall Street, Fugitive | |
| Cluster 7: Western Musicals and the Pandemic | ||
| Cluster 8: Assorted Entertainment and Hong Kong Encouragement | ||
| Cluster 9: Justin Bieber and Stopping Hong Kong Violence | Hong Kong*, Hong Kong Garrison*, Stop Violence and Control Chaos*, Hong Kong Police, Port, | |
| Cluster 10: Hong Kong (Isolated) | Hong Kong* | |
| Cluster 11: Guo Wengui (Isolated) | Guo Wengui* |
Figure 6Two-dimensional visualization of variation in cluster-level psycholinguistic features based on PCA. Blue points depict the coordinates of account clusters based on the first two principal components. Red rays depict vector of each psycholinguistic feature based on the first two principal components. Top: PCA results in Era 1. Bottom: PCA results in Era 2
Figure 7Follower-hierarchy coefficients for account clusters normalized to values between 0 and 1. Most clusters feature similar levels of hierarchical organization, with a few outliers exhibiting much stronger hierarchy. Top: Results for Era 1. Bottom: Results for Era 2
Figure 8Relative interaction preferences for account clusters normalized to values between 0 and 1. Most clusters specialize in mentions and replies, with a few outliers specializing in retweets. Top: Results for Era 1. Bottom: Results for Era 2