| Literature DB >> 34764575 |
Deepika Varshney1, Dinesh Kumar Vishwakarma1.
Abstract
Clickbait is one of the form of false content, purposely designed to attract the user's attention and make them curious to follow the link and read, view, or listen to the attached content. The teaser aim behind this is to exploit the curiosity gap by giving information within the short statement. Still, the given statement is not sufficient enough to satisfy the curiosity without clicking through the linked content and lure the user to get into the respective page via playing with human psychology and degrades the user experience. To counter this problem, we develop a Clickbait Video Detector (CVD) scheme. The scheme leverages to learn three sets of latent features based on User Profiling, Video-Content, and Human Consensus, these are further used to retrieve cognitive evidence for the detection of clickbait videos on YouTube. The first step is to extract audio from the videos, which is further transformed to textual data, and later on, it is utilized for the extraction of video content-based features. Secondly, the comments are analyzed, and features are extracted based on human responses/reactions over the posted content. Lastly, user profile based features are extracted. Finally, all these features are fed into the classifier. The proposed method is tested on the publicly available fake video corpus [FVC], [FVC-2018] dataset, and a self-generated misleading video dataset [MVD]. The achieved result is compared with other state-of-the-art methods and demonstrates superior performance. © Springer Science+Business Media, LLC, part of Springer Nature 2021.Entities:
Keywords: Clickbait detection; Cognitive evidence; Dataset; User profiling; YouTube
Year: 2021 PMID: 34764575 PMCID: PMC7778503 DOI: 10.1007/s10489-020-02057-9
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Fig. 1Example of Clickbait Video
Fig. 2Example of Non-Clickbait
Fig. 3Flow of Proposed Methodology
Fig. 4Number of Videos by Category and Class
Detailed Description of the Self-Generated Dataset (MVD)
| S.No. | Category | Number of Videos | Class |
|---|---|---|---|
| 1 | Autos & Vehicles | 5 | clickbait |
| 2 | Autos & Vehicles | 4 | Non-Clickbait |
| 3 | Comedy | 12 | clickbait |
| 4 | Comedy | 43 | Non-Clickbait |
| 5 | Education | 6 | clickbait |
| 6 | Education | 2 | Non-Clickbait |
| 7 | Entertainment | 241 | clickbait |
| 8 | Entertainment | 227 | Non-Clickbait |
| 9 | Film & Animation | 28 | clickbait |
| 10 | Film & Animation | 35 | Non-Clickbait |
| 11 | Gaming | 8 | clickbait |
| 12 | Gaming | 23 | Non-Clickbait |
| 14 | Music | 6 | clickbait |
| 15 | Music | 82 | Non-Clickbait |
| 16 | News & Politics | 26 | clickbait |
| 17 | News & Politics | 1 | Non-Clickbait |
| 18 | Non-profits & Activism | 6 | Non-Clickbait |
| 19 | People & Blogs | 132 | clickbait |
| 20 | People & Blogs | 6 | Non-Clickbait |
| 21 | Pets & Animals | 1 | clickbait |
| 22 | Science & Technology | 3 | clickbait |
| 23 | Science & Technology | 61 | Non-Clickbait |
| 24 | Sports | 2 | clickbait |
| 25 | Sports | 22 | Non-Clickbait |
| 26 | Travel & Events | 4 | clickbait |
| Total | 474 clickbait 513 Non-Clickbait |
Possible cases for the clickbait’s detection
| S.No. | Detection Cases |
|---|---|
| 1. | Title faithfully represents the video speech content and comments are not in contradiction |
| 2. | Title does not faithfully represent the video speech content and both are in contradiction. |
| 3. | Title faithfully represents the video speech content and comments are in contradiction. |
Fig. 5An Overview of CVD
Fig. 6The Figure represents the process of retrieving Evidence 1
Video-Content based feature
| Feature | Description |
|---|---|
| Audio Transcript based Features (Avg_cs) | The average cosine similarity measure between audio transcripts and the title of the video. This is one of the novel features and very few studies incorporate it. |
| Number of Likes | This feature represents the number of likes on a video. |
| Number of Dislikes | This feature represents the number of dislikes on a video. |
| Dislike to Like Ratio | The ratio of the number of dislikes to like count on a video. It has been observed that clickbait videos received more dislikes compared to likes. |
| Number of Views | The number of views Received by a video. |
Fig. 7The Figure represents the process of retrieving Evidence 2
Human-Consensus based feature
| Feature | Description |
|---|---|
| Number of Comment c(x) | This feature represents the number of comments received on a video. To restrict our search analysis, in total maximum of 200 comments have been considered. The below equation represent the comment count. |
| Positive Polarity p(x) | This is the feature that indicates, how many comments showing a positive opinion towards a video. |
| Negative Polarity | This is the feature that indicates, how many comments showing a negative opinion towards a video. |
| Positive-Negative Polarity Ratio | This is the ratio of positive to negative comment polarity count. |
| Fake_Comment_Count | The fake comment count is the number of comments having clickbait phrases. Clickbait’s Phrases(CP) = {fake, bullshit, hoax, wrong… etc.} |
Fake_Comment_Count Ratio | It is the ratio of the number of fake_comment_count to the total number of comments encountered. |
User-Profile based feature
| Feature | Description |
|---|---|
| Registration Age | The age of the user is an indicative measure of the rounded number of days that the user has spent on YouTube, i.e. from the day account was created up to the day of the current post. |
| Channel Views | The total number of views received by the channel. |
| Total_no_of_Videos | The total number of videos has been posted by the channel till date. |
| Subscriber Count | The total number of subscribers count on the channel. |
| Video_to_Age_ratio | This is the ratio of the total number of videos uploaded by the channel to its registration age. |
| Subscribers_to_Age_ratio | This feature represents the ratio of the number of subscribers on the channel to its registration age. |
| Channel_Views_to_Subscribers | It is a ratio of the number of views received by the channel to its subscriber count. |
Fig. 8The Figure represents the process of retrieving Evidence 3
The table shows the cases and the needful evidence required for the detection
| S.NO | Detection Cases | Essential Measure | Desirable Measure |
|---|---|---|---|
| 1. | Title faithfully represents the video speech content and comments are not in contradiction | Evidence 1 Evidence 2 Evidence 3 | Evidence 1 Evidence 2 Evidence 3 |
| 2. | Title does not faithfully represent the video speech content and both are in contradiction. | Evidence 1 | Evidence 1 Evidence 2 |
| 3. | Title faithfully represents the video speech content and comments are in contradiction. | Evidence 1 Evidence 2 | Evidence 1 Evidence 2 |
Performance of the various classifier by employing Cross-Validation
| All Set of Features | ||||||||
|---|---|---|---|---|---|---|---|---|
| Classifiers | Fold1 | Fold2 | TP | FP | PRE | REC | FM | ACC |
| Random Forest | 10 | 0.974 | 0.025 | 0.975 | 0.974 | 0.974 | 97.37 | |
| Naïve Bayes | 10 | 0.964 | 0.034 | 0.965 | 0.964 | 0.964 | 96.37 | |
| Logistic | 10 | 0.909 | 0.085 | 0.922 | 0.909 | 0.909 | 90.92 | |
| SVM | 10 | 0.955 | 0.042 | 0.959 | 0.955 | 0.955 | 95.46 | |
| SGD | 10 | 0.955 | 0.043 | 0.957 | 0.955 | 0.955 | 95.46 | |
| k-nearest | 10 | 0.964 | 0.037 | 0.964 | 0.964 | 0.964 | 96.37 | |
| J48 | 10 | 0.017 | 0.983 | 0.983 | ||||
| Random Forest | 20 | 0.974 | 0.025 | 0.974 | 0.974 | 0.974 | 97.37 | |
| Naïve Bayes | 20 | 0.965 | 0.033 | 0.966 | 0.965 | 0.965 | 96.47 | |
| Logistic | 20 | 0.888 | 0.105 | 0.905 | 0.888 | 0.887 | 88.81 | |
| SVM | 20 | 0.955 | 0.042 | 0.959 | 0.955 | 0.955 | 95.46 | |
| SGD | 20 | 0.957 | 0.041 | 0.959 | 0.957 | 0.957 | 95.66 | |
| k-nearest | 20 | 0.965 | 0.035 | 0.965 | 0.965 | 0.965 | 96.47 | |
| J48 | 20 | 0.017 | 0.983 | 0.983 | ||||
| Video-based features | ||||||||
| Random Forest | 10 | – | 0.875 | 0.120 | 0.885 | 0.875 | 0.875 | 87.5 |
| Naïve Bayes | 10 | – | 0.772 | 0.243 | 0.830 | 0.772 | 0.760 | 77.21 |
| Logistic | 10 | – | 0.755 | 0.255 | 0.773 | 0.755 | 0.749 | 75.50 |
| SVM | 10 | – | 0.542 | 0.490 | 0.757 | 0.542 | 0.406 | 54.23 |
| SGD | 10 | – | 0.746 | 0.264 | 0.763 | 0.746 | 0.740 | 74.59 |
| k-nearest | 10 | – | 0.876 | 0.125 | 0.876 | 0.876 | 0.876 | 87.60 |
| J48 | 10 | – | 0.102 | 0.898 | 0.898 | |||
| Human Consensus-based Feature | ||||||||
| Random Forest | 10 | – | 0.884 | 0.124 | 0.904 | 0.884 | 0.882 | 88.40 |
| Naïve Bayes | 10 | – | 0.856 | 0.154 | 0.883 | 0.856 | 0.852 | 85.58 |
| Logistic | 10 | – | 0.795 | 0.218 | 0.845 | 0.795 | 0.786 | 79.53 |
| SGD | 10 | – | 0.823 | 0.190 | 0.868 | 0.823 | 0.816 | 82.25 |
| k-nearest | 10 | – | 0.891 | 0.111 | 0.893 | 0.891 | 0.891 | 89.11 |
| J48 | 10 | – | 0.104 | 0.902 | 0.901 | |||
| User Profile-based features | ||||||||
| Random Forest | 10 | – | 0.955 | 0.042 | 0.959 | 0.955 | 0.955 | 95.46 |
| Naïve Bayes | 10 | – | 0.949 | 0.049 | 0.952 | 0.949 | 0.949 | 94.85 |
| Logistic | 10 | – | 0.800 | 0.187 | 0.857 | 0.800 | 0.794 | 80.04 |
| SVM | 10 | – | 0.955 | 0.042 | 0.959 | 0.955 | 0.955 | 95.46 |
| SGD | 10 | – | 0.949 | 0.048 | 0.953 | 0.949 | 0.949 | 94.85 |
| k-nearest | 10 | – | 0.027 | 0.973 | 0.973 | |||
| J48 | 10 | – | 0.971 | 0.030 | 0.971 | 0.971 | 0.971 | 97.07 |
Performance of the various classifier by employing Percentage Split
| Classifiers | Split1 | Split2 | TP | FP | PRE | REC | FM | ACC |
|---|---|---|---|---|---|---|---|---|
| Random Forest | 70:30 | – | 0.977 | 0.024 | 0.977 | 0.977 | 0.977 | |
| Naïve Bayes | 70:30 | – | 0.963 | 0.037 | 0.964 | 0.963 | 0.963 | 96.30 |
| Logistic | 70:30 | – | 0.903 | 0.098 | 0.907 | 0.903 | 0.902 | 90.26 |
| SVM | 70:30 | – | 0.899 | 0.102 | 0.916 | 0.899 | 0.898 | 89.93 |
| SGD | 70:30 | – | 0.956 | 0.044 | 0.957 | 0.956 | 0.956 | 95.63 |
| K-nearest | 70:30 | – | 0.970 | 0.030 | 0.970 | 0.970 | 0.970 | 96.97 |
| J48 | 70:30 | – | 0.023 | 0.977 | 0.977 | 0.977 | ||
| Random Forest | – | 80:20 | 0.975 | 0.026 | 0.975 | 0.975 | 0.975 | |
| Naïve Bayes | – | 80:20 | 0.929 | 0.067 | 0.936 | 0.929 | 0.929 | 92.92 |
| Logistic | – | 80:20 | 0.899 | 0.105 | 0.907 | 0.899 | 0.898 | 89.89 |
| SVM | – | 80:20 | 0.949 | 0.054 | 0.954 | 0.949 | 0.949 | 94.94 |
| SGD | 80:20 | 0.960 | 0.042 | 0.961 | 0.960 | 0.960 | 95.95 | |
| k-nearest | 80:20 | 0.970 | 0.030 | 0.970 | 0.970 | 0.970 | 96.96 | |
| J48 | 80:20 | 0.025 | 0.975 | 0.975 | 0.975 |
Fig. 9Comparative analysis of various classifiers on different set of features
Fig. 10The AUC-ROC Curve
Fig. 11Plot Matrix representation of proposed features against all other features for Clickbait’s (Red) and Non-Clickbait’s (Green)
Description of features used for the plot of Scatter matrix representation, as shown in Fig. 11
| Feature No | Feature | Feature No | Feature |
|---|---|---|---|
| X1 | DL(x) | Y1 | CV(x) |
| X2 | CS(x) | Y2 | FCCR(x) |
| X3 | Avg_cs | Y3 | VA(x) |
| X4 | Video_id | Y4 | SA(x) |
| X5 | PN(x) | Y5 | PN(x) |
| X6 | SA(x) | Y6 | Video_id |
| X7 | VA(x) | Y7 | Avg_cs |
| X8 | FCCR(x) | Y8 | CS(x) |
| X9 | CV(x) | Y9 | DL(x) |
Comparative Analysis with the State-of-the-art
| Method | Classifier | Split/fold | PRE | REC | FM | Dataset |
|---|---|---|---|---|---|---|
| [ | SVM(Video feature) | 10 fold | 0.88 | 0.79 | 0.82 | FVC |
| SVM(Comment Feature) | 10 fold | 0.88 | 0.74 | 0.79 | FVC | |
| SVM(Fusion) | 10 fold | 0.88 | 0.82 | 0.85 | FVC | |
| SVM(Video feature)[YT] | 10 fold | 0.87 | 0.59 | 0.70 | FVC-2018 | |
| SVM(Comment Feature)[YT] | 10 fold | 0.91 | 0.53 | 0.67 | FVC-2018 | |
| SVM(Fusion)[YT] | 10 fold | 0.79 | 0.61 | 0.69 | FVC-2018 | |
| [ | SVM (Video Feature) | 10 fold | 0.88 | 0.79 | 0.82 | FVC |
| SVM (Comment Feature) | 10 fold | 0.88 | 0.74 | 0.79 | FVC | |
| SVM RBF(Fusion) | 10 fold | 1.00 | 0.83 | 0.90 | FVC | |
| [ | Random Forest | 70:30 | 0.74 | 0.73 | 0.73 | FVC |
| Decision- Tree | 70:30 | 0.73 | 0.67 | 0.67 | FVC | |
| SVM | 70:30 | 0.56 | 0.55 | 0.54 | FVC | |
| Logistic Regression | 70:30 | 0.53 | 0.53 | 0.53 | FVC | |
| UCNet | 70:30 | 0.82 | 0.82 | 0.82 | FVC | |
| Our Method | Random Forest | 70:30 | 0.84 | 0.78 | 0.77 | FVC |
| Decision- Tree | 70:30 | 0.77 | 0.75 | 0.73 | FVC | |
| SVM | 70:30 | 0.65 | 0.63 | 0.63 | FVC | |
| Logistic Regression | 70:30 | 0.65 | 0.65 | 0.65 | FVC | |
| SVM (Video Feature) | 10 fold | 0.87 | 0.83 | 0.83 | FVC | |
| SVM (Comment Feature) | 10 fold | 0.87 | 0.83 | 0.83 | FVC | |
| SVM (Fusion) | 10 fold | 0.87 | 0.85 | 0.85 | FVC | |
| SVM(Video Feature)[YT] | 10 fold | 0.57 | 0.57 | 0.57 | FVC-2018 | |
| SVM(Comment Feature)[YT] | 10 fold | 0.57 | 0.57 | 0.57 | FVC-2018 | |
| SVM (Fusion)[YT] | 10 fold | 0.69 | 0.69 | 0.69 | FVC-2018 |
Fig. 12Plot Matrix representation of features CS(x) against four other features on a MVD and b FVC-2018 dataset for Clickbait’s (Red) and Non-Clickbait’s (Green)