| Literature DB >> 35414095 |
Xiaojun Li1, Xvhao Xiao1, Jia Li2, Changhua Hu1, Junping Yao1, Shaochen Li1.
Abstract
Videos, especially short videos, have become an increasingly important source of information in these years. However, many videos spread on video sharing platforms are misleading, which have negative social impacts. Therefore, it is necessary to find methods to automatically identify misleading videos. In this paper, three categories of features (content features, uploader features and environment features) are proposed to construct a convolutional neural network (CNN) for misleading video detection. The experiment showed that all the three proposed categories of features play a vital role in detecting misleading videos. Our proposed approach that combines three categories of features achieved the best performance with the accuracy of 0.90 and the F1 score of 0.89. It also outperformed other baselines such as SVM, k-NN, decision tree and random forest models by more than 22%.Entities:
Mesh:
Year: 2022 PMID: 35414095 PMCID: PMC9002042 DOI: 10.1038/s41598-022-10117-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Process of extracting sentiment polarity features in texts.
Figure 2The CNN-based misleading video detection model.
Figure 3A convolutional neural network (CNN).
Features of the misleading video dataset.
| Feature category | Mean | S.D | Min | Median | Max | N | |
|---|---|---|---|---|---|---|---|
| Content features | C-Sen-Po | 20.07 | 23.93 | − 17 | 13 | 202.6 | 700 |
| C-Num-MoPar | 23.09 | 28.07 | 0 | 12 | 209 | 700 | |
| C-Num-PerPro | − 1.36 | 10.40 | − 64 | 0 | 53 | 700 | |
| C-Vtext-Len | 706.90 | 634.76 | 26 | 475 | 3779 | 700 | |
| Uploader features | FF-R | 6754.39 | 41,781.37 | 0 | 38.5 | 584,000 | 700 |
| Num-Likes | 117,549.29 | 748,769.63 | 0 | 237.5 | 16,265,714 | 700 | |
| Re-upload | 80.51 | 158.05 | − 1743 | 115 | 181 | 700 | |
| Num-ToVi | 3,914,180.37 | 54,192,132.06 | 0 | 28,500 | 1,257,389,046 | 700 | |
| Environment features | Likes | 184.48 | – | 0 | – | 27,000 | 700 |
| Retweets | 29.84 | – | 0 | – | 3084 | 700 | |
| Favourites | 46.87 | – | 0 | – | 3982 | 700 | |
| Rewards | 22.75 | – | 0 | – | 5111 | 700 | |
| E-Sen-Po | 0.78 | 3.09 | − 8 | 0 | 28 | 700 | |
| E-Num-MoPar | 1.16 | 2.94 | 0 | 0 | 25 | 700 | |
| E-Num-PerPro | 0.18 | 1.21 | − 15 | 0 | 7 | 700 | |
| E-Vtext-Len | 32.50 | 85.68 | 0 | 0 | 771 | 700 | |
Comparison of different detection models.
| Misleading video detection models | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|
| CNN | 0.90 | 0.92 | 0.88 | 0.89 |
| CNN (for content-based features only) | 0.74 | 0.77 | 0.68 | 0.72 |
| CNN (for uploader-based features only) | 0.80 | 0.75 | 0.9 | 0.82 |
| CNN (for environment-based features only) | 0.83 | 0.88 | 0.77 | 0.82 |
| CNN (for upload-environment-based features) | 0.88 | 0.82 | 0.9 | 0.86 |
| SVM | 0.58 | 0.58 | 0.56 | 0.57 |
| k-NN | 0.66 | 0.68 | 0.60 | 0.64 |
| Decision tree | 0.60 | 0.63 | 0.48 | 0.55 |
| Random forest | 0.68 | 0.63 | 0.88 | 0.73 |
Figure 4Experiment results from different models.