| Literature DB >> 36118953 |
Wingyan Chung1, Yinqiang Zhang2, Jia Pan2.
Abstract
The spreading of disinformation in social media threatens cybersecurity and undermines market efficiency. Detecting disinformation is challenging due to large volumes of social media content and a rapidly changing environment. This research developed and validated a theory-based, novel deep-learning approach (called TRNN) to disinformation detection. Grounded in social and psychological theories, TRNN uses deep-learning and data-centric augmentation to enhance disinformation detection in financial social media. Temporal and contextual information is encoded as specific knowledge about human-validated disinformation, which was identified from our unique collection of 745,139 financial social media messages about four U.S. high-tech company stocks and their fine-grained trading data. TRNN uses multiple series of long short-term memory (LSTM) recurrent neurons to learn dynamic and hidden patterns to support disinformation detection. Our experimental findings show that TRNN significantly outperformed widely-used machine learning techniques in terms of precision, recall, F-score and accuracy, achieving consistently better classification performance in disinformation detection. A case study of Apple Inc.'s stock price movement demonstrates the potential usability of TRNN for secure knowledge management. The research contributes to developing novel approach and model, producing new information systems artifacts and dataset, and providing empirical findings of detecting online disinformation.Entities:
Keywords: Cybersecurity; Deep learning; Design science; Disinformation detection; Financial market; Machine learning; Secure knowledge management; Sequence prediction; Social media; Temporal recurrent neural network
Year: 2022 PMID: 36118953 PMCID: PMC9465158 DOI: 10.1007/s10796-022-10327-9
Source DB: PubMed Journal: Inf Syst Front ISSN: 1387-3326 Impact factor: 5.261
A summary of methods for disinformation detection
| Category | Sub-Category | Description | Strength | Weakness | Work |
|---|---|---|---|---|---|
| Manual | Rule based | Rule-based methods utilize | Rule based methods are | Trivial blacklist | (Lee et al., |
| Pattern | methods | user white lists or keyword | able to easily incorporate | keywords matching tend |
Ribeiro et al., |
| Matching | blacklists and manually | expert’s domain | to be error-prone when | ||
| crafted rules to detect | knowledge into the | the context contains | |||
| disinformation | blacklists to guide the | negation, sarcasm, etc. | |||
| disinformation detection | |||||
| Network | Semantic | Semantic based methods | Network based methods | If the entity to be checked | (Ciampaglia et al., |
| Based | network | capture the structure of the | are promising in accuracy | is not in the existing |
Del Vicario et al., |
| Methods | methods | knowledge network and use it | of statements of the form | database, the |
Ruchansky et al., |
| to infer the truthfulness of | “A is B” and it also | disinformation detection |
Shi & Weninger, | ||
| given information | reveals the topology | can not be done | |||
| dynamics of the social | |||||
| connections | |||||
| Diffusion based | Diffusion based methods look | Once suspicious accounts | Extremely | (Kuhlman et al., | |
| methods | for critical network links or | and initial spread of | computationally heavy in |
Nguyen et al., | |
| nodes to control the spread of | disinformation are | the context of billions of |
Pham et al., | ||
| disinformation | identified, the epidemic | nodes and links |
Vosoughi et al., | ||
| of devastating | |||||
| consequences can be | |||||
| avoided | |||||
| Machine | Traditional | Traditional ML methods use a | With different variations | Traditional ML methods | (Delort et al., |
| Learning | machine learning | collection of labeled instances | and kernel tricks, | might not model the |
Feng et al., |
| (ML) | methods | to train a classifier such as | traditional machine | complex social dynamics |
Shu et al., |
| Methods | support vector machine | learning methods are | exhibited in social media. |
Reis et al., | |
| (SVM), Decision Tree and | flexible in different |
Giasemidis et al., | |||
| logistic regression to learn the | situations to handle the |
Langley et al., | |||
| disinformation patterns | disinformation features | ||||
| Deep Learning | DL methods use multi-layered | Deep learning methods | Difficulty in model | (Zhang et al., | |
| (DL) Methods | massive computational units | are powerful in modeling | interpretation and |
Zhang et al., | |
| to learn disinformation | complex non-linear | explanation. Need large |
Volkova et al., | ||
| features with | social dynamics | amount of labeled |
Kumar et al., | ||
| back-propagation algorithm. | training data | ||||
| Representative techniques | |||||
| include RNN, LSTM, and | |||||
| CNN. |
Components of systems used in disinformation detection research
| Category | Article | System Input | Feature | Technique | Results |
|---|---|---|---|---|---|
| Textual | (Giasemidis et al., | Twitter messages | n-grams, | Semi-supervised | Improved speed with less labeled |
| feature | part-of-speech | learning algorithm | data for stance classification | ||
| (Wang et al., | Weibo messages | Event features | Adversarial neural | Improved accuracy on fake news | |
| network | detection | ||||
| (Vosoughi et al., | Twitter messages | Liguistic and network | Hidden Markov | Improved accuracy on unverified | |
| features | model | rumours | |||
| (Liu et al., | Twitter and Weibo | Linguistic features | Neural network | Improved performance in | |
| messages | and temporal feature | method | detecting disinformation | ||
| Network | (Nguyen et al., | Twitter, Pokec, DBLP | Network node | Linear threshold | Reduced complexity in stopping |
| feature | nodes and edges | activation feature | model | cyber-epidemics | |
| (Tong et al., | Wiki, YouTube, | Neighbour influence | Randomized | Reduced complexity in rumor | |
| Epinions nodes and edges | Algorithm | blocking | |||
| (Yan et al., | Wikipedia, Slashdot, | Node disseminating | Link deletion | Improved approximation of | |
| Google+ nodes and | influence | algorithm | minimizing rumour spread | ||
| edges | |||||
| (Zhang et al., | Twitter, Epinion, | Network propagation | Network monitor | Reduced # of monitors to place in | |
| Slashdot nodes and | feature | placement algorithm | social network in detecting online | ||
| edges | misinformation |
Fig. 1Architecture of the temporal recurrent neural network approach
Fig. 2A long short-term memory unit
Fig. 3An automated system to collect and transform data to support the TRNN approach
Fig. 4A sample of social media discussions about stock price movements
Categorization (and count) of Messages in Abnormal Stock Price Movements
| Categorization | Abnormal upward movement | Abnormal downward movement |
|---|---|---|
| Benign messages in abnormal | Benign messages in abnormal | |
| upward scenarios (500 messages) | downward scenarios (500 messages) | |
| Disinformation in abnormal | Disinformation in abnormal | |
| upward scenarios (500 messages) | downward scenarios (500 messages) |
Notation and its Meaning in the Eqs. 11 – 13
| Notation | Meaning |
|---|---|
| Cronbach’s alpha | |
| Total number of responses from the annotators | |
| Average of all covariances between pairs of responses | |
| Average variance of each response | |
| Variance of the errors of estimate response | |
| Variance in each response that can be accounted for the | |
| linear regression of all of the other responses | |
| Mean of correlation coefficients | |
| Guttman’s Lambda-6 estimate of reliability |
Reliability test results
| Measure | Value |
|---|---|
| Cronbach’s alpha, | 0.9094 |
| Cronbach’s standardized alpha, | 0.9074 |
| Guttman’s Lambda-6, | 0.9059 |
Hyperparameters Used in LSTM and CRNN
| Technique | Hyperparameter | Description | Value |
|---|---|---|---|
| LSTM (Yu et al., | Dense Units | Fully-connected sequential layers, | [32, 32, 1] |
| each having a specified number of computational nodes | |||
| Activation function | Functions to transform input | [‘relu’, ‘relu’, ‘sigmoid’ ] | |
| values to output values | |||
| CRNN (Wang et al., | CNN Channels | Number of channels used in | [8, 16, 32] |
| CNN layers | |||
| CNN Kernel | Same kernel size used in all CNN layers | [3, 3, 3] | |
| Pooling Layer | Input window size for max-pooling in CNN layers | [[2, 2], [2, 2], [1, 1]] | |
| LSTM Units | Number of units in bidirectional LSTM layers | [64, 64] | |
| LSTM Dropout rate | Fraction of the units to drop for linear transformation | 0.5 |
Fig. 5Accuracies achieved by different techniques on disinformation detection
Performance of disinformation detection in downward and upward scenarios
| Scenario | Model | Precision | Recall | F-score |
|---|---|---|---|---|
| Downward | TRNN | |||
| CRNN | 0.7290 | 0.8211 | 0.7717 | |
| LSTM | 0.7527 | 0.8013 | 0.7752 | |
| RNN | 0.7549 | 0.7426 | 0.7465 | |
| ANN | 0.7428 | 0.7578 | 0.7485 | |
| Upward | TRNN | 0.8481 | ||
| CRNN | 0.7178 | 0.7808 | ||
| LSTM | 0.7697 | 0.8287 | 0.7975 | |
| RNN | 0.7270 | 0.8024 | 0.7620 | |
| ANN | 0.7202 | 0.8116 | 0.7623 |
The bold numbers indicate the best performance achieved by a model among all models’ performances in downward or upward scenarios
Note: the values are averaged from 30 random samples
Pairwise Two-Sample t-Test of Models Using F-score
The bold numbers indicate the best performance achieved by a model among all models' performances in downward or upward scenarios
| Hypothesis | textitp-value | Significant? | Conclusion |
|---|---|---|---|
| F_score(TRNN) > F_score(RNN) | 6.0974e-08 | Yes | Hypothesis confirmed |
| F_score(TRNN) > F_score(ANN) | 2.2614e-08 | Yes | Hypothesis confirmed |
| F_score(TRNN) > F_score(LSTM) | 0.0001319 | Yes | Hypothesis confirmed |
| F_score(TRNN) > F_score(CRNN) | 8.5647e-07 | Yes | Hypothesis confirmed |
Fig. 6A tweet about abnormal stock price movement