| Literature DB >> 29742134 |
Wei Zhou1, Junhao Wen2, Qiang Qu1, Jun Zeng2, Tian Cheng2.
Abstract
Recommender systems are vulnerable to shilling attacks. Forged user-generated content data, such as user ratings and reviews, are used by attackers to manipulate recommendation rankings. Shilling attack detection in recommender systems is of great significance to maintain the fairness and sustainability of recommender systems. The current studies have problems in terms of the poor universality of algorithms, difficulty in selection of user profile attributes, and lack of an optimization mechanism. In this paper, a shilling behaviour detection structure based on abnormal group user findings and rating time series analysis is proposed. This paper adds to the current understanding in the field by studying the credibility evaluation model in-depth based on the rating prediction model to derive proximity-based predictions. A method for detecting suspicious ratings based on suspicious time windows and target item analysis is proposed. Suspicious rating time segments are determined by constructing a time series, and data streams of the rating items are examined and suspicious rating segments are checked. To analyse features of shilling attacks by a group user's credibility, an abnormal group user discovery method based on time series and time window is proposed. Standard testing datasets are used to verify the effect of the proposed method.Entities:
Mesh:
Year: 2018 PMID: 29742134 PMCID: PMC5942815 DOI: 10.1371/journal.pone.0196533
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The general composition of an attack profile.
Features of attack models.
| Attack Model | |||
|---|---|---|---|
| ∅ | random selected | ||
| ∅ | mean of items | ||
| popular items | |||
| segment items |
Fig 2Prediction shift in user-based collaborative filtering with attack size varies and filler size varies.
Fig 3Overall structure of the proposed shilling detecting method.
Fig 4Rating deviation distribution.
Attackers in suspicious rating segments ratio in phase 1 when attack size and confidence coefficient vary.
| confidence coefficient | Attack size | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 2% | 4% | 6% | 8% | 10% | 12% | 14% | 16% | 18% | 20% | |
| 80% | 0.895 | 0.915 | 0.935 | 0.942 | 0.95 | 0.955 | 0.955 | 0.96 | 0.965 | 0.97 |
| 85% | 0.90 | 0.919 | 0.941 | 0.945 | 0.954 | 0.957 | 0.96 | 0.963 | 0.965 | 0.971 |
| 90% | 0.91 | 0.93 | 0.945 | 0.948 | 0.958 | 0.962 | 0.965 | 0.967 | 0.97 | 0.972 |
| 95% | 0.92 | 0.931 | 0.945 | 0.952 | 0.96 | 0.965 | 0.966 | 0.965 | 0.97 | 0.971 |
Attack detection ratio when attack size varies under confidence coefficient 90%.
| Filler size (%) | Attack size (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 2% | 4% | 6% | 8% | 10% | 12% | 14% | 16% | 18% | 20% | |
| 3% | 0.69 | 0.74 | 0.79 | 0.83 | 0.865 | 0.89 | 0.90 | 0.91 | 0.91 | 0.913 |
| 5% | 0.74 | 0.78 | 0.82 | 0.86 | 0.89 | 0.905 | 0.916 | 0.925 | 0.93 | 0.941 |
| 7% | 0.78 | 0.83 | 0.87 | 0.895 | 0.91 | 0.92 | 0.928 | 0.942 | 0.946 | 0.95 |
| 9% | 0.81 | 0.85 | 0.88 | 0.905 | 0.92 | 0.935 | 0.945 | 0.95 | 0.954 | 0.955 |
Fig 5Detection rate and false positive rate when attack size varies.
Fig 6Comparisons of detection results with other methods.
Fig 7Detection results with different datasets with attack size varies.