| Literature DB >> 35795012 |
Shuo Yu1, Jing Ren2, Shihao Li1, Mehdi Naseriparsa3, Feng Xia2.
Abstract
Fake reviews have become prevalent on various social networks such as e-commerce and social media platforms. As fake reviews cause a heavily negative influence on the public, timely detection and response are of great significance. To this end, effective fake review detection has become an emerging research area that attracts increasing attention from various disciplines like network science, computational social science, and data science. An important line of research in fake review detection is to utilize graph learning methods, which incorporate both the attribute features of reviews and their relationships into the detection process. To further compare these graph learning methods in this paper, we conduct a detailed survey on fake review detection. The survey presents a comprehensive taxonomy and covers advancements in three high-level categories, including fake review detection, fake reviewer detection, and fake review analysis. Different kinds of fake reviews and their corresponding examples are also summarized. Furthermore, we discuss the graph learning methods, including supervised and unsupervised learning approaches for fake review detection. Specifically, we outline the unsupervised learning approach that includes generation-based and contrast-based methods, respectively. In view of the existing problems in the current methods and data, we further discuss some challenges and open issues in this field, including the imperfect data, explainability, model efficiency, and lightweight models.Entities:
Keywords: anomaly detection; data science; fake review detection; graph learning; social computing
Year: 2022 PMID: 35795012 PMCID: PMC9251112 DOI: 10.3389/frai.2022.922589
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1Changes in the number of papers related to fake reviews in the past 5 years. (A) SCI. (B) EI. (C) DBLP.
Examples of fake reviews.
|
|
|
|
|---|---|---|
| Untruthful opinions | These reviews intentionally misguide users of the review system by unjustly reviewing and rating target objects to manipulate the products' reputation. | (1) This little place in Soho is wonderful. World-class service. |
| Exclusive reviews | These reviews are given exclusively to specific brands, manufacturers, or sellers. | (1) The food is amazing! My friends and me are definitely coming back to this place. |
| Non-reviews | Non-reviews include two main sub-streams: | (1) Register to receive a gift. |
| Duplicates reviews | Different accounts post duplicate or near-duplicate reviews on products, either the same or different. | (1) Really charming. It is a great place to have a low-key lunch. |
Figure 2A taxonomy of fake review detection approaches.
Commonly used notations with explanations.
|
|
|
|---|---|
|
| A graph. |
|
| The set of nodes in a graph. |
|
| The set of edges in a graph. |
|
| Node feature matrix of a graph |
|
| A node in the node set V |
|
| An edge in the edge set E |
|
| The node representation vector of node vi. |
|
| Unlabeled node set. |
|
| Output representation of the encoder. |
|
| The adjacency matrix of a graph. |
|
| The reconstruction adjacency matrix. |
|
| The reconstruction feature matrix. |
|
| The node degree matrix. |
| σ(·) | Activation function |
Comparative review of graph learning methods for fake review detection.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| GAS (Li A. et al., | FRD | Node | Supervised | ✓ | Xianyu Graph | |
| PC-GNN (Liu et al., | FRD& FRerD | Node | Supervised | ✓ | ✓ | YelpChi, Amazon, |
| and Alibaba Review Graph | ||||||
| IHGAT (Liu et al., | FRD | Link | Supervised | ✓ | Alibaba Group | |
| AO-GNN (Huang et al., | FRD | Node | Supervised | ✓ | ✓ | YelpChi, Amazon, and Books |
| DeepFD (Ding et al., | FRD | Node | Unsupervised | ✓ | ✓ | Yelp, Amazon, and DDos |
| IN-GNN (Liu B. et al., | FRD | Node | Unsupervised | ✓ | MisInfdect and Pheme | |
| PAMFUL (Zhao et al., | FRerD | Node | Unsupervised | Bitcoin-Alpha, Weibo |
Figure 3Supervised methods with Graph Neural Network for fake review detection.
Figure 4Generation-based methods with Auto-Encoder for fake review detection.
Figure 5Contrast-based methods for fake review detection.
The statistics of fake review datasets.
|
|
|
|
|
|
|---|---|---|---|---|
| YelpCHI | 38,063 | 201 | 67,395 | Yes |
| YelpNYC | 160,225 | 923 | 359,052 | Yes |
| YelpZip | 260,277 | 5,044 | 608,598 | Yes |
| Amazon Reviews | 34,686,770 | 6,643,669 | 2,441,053 | No |
| Amazon FineFoods | 256,059 | 74,258 | 568,454 | No |
| Amazon Movies | 889,176 | 253,059 | 7,911,684 | No |
| BeerAdvocate | 33,387 | 66,051 | 1,586,259 | No |
| RateBeer | 40,213 | 110,419 | 2,924,127 | No |
| CellarTracker | 44,268 | 485,179 | 2,025,995 | No |
| SWMReview | 966,942 | 15,094 | 1,132,373 | No |
| Epinions | 49, 290 | 139, 738 | 664, 824 | No |