| Literature DB >> 27657071 |
Dong Guo1,2, Jian Cao3,4, Xiaoqi Wang5,6, Qiang Fu7,8, Qiang Li9,10.
Abstract
Cyber Physical Social Sensing makes mobile social networks (MSNs) popular with users. However, such attacks are rampant as malicious URLs are spread covertly through quick response (QR) codes to control compromised accounts in MSNs to propagate malicious messages. Currently, there are generally two types of methods to identify compromised accounts in MSNs: one type is to analyze the potential threats on wireless access points and the potential threats on handheld devices' operation systems so as to stop compromised accounts from spreading malicious messages; the other type is to apply the method of detecting compromised accounts in online social networks to MSNs. The above types of methods above focus neither on the problems of MSNs themselves nor on the interaction of sensors' messages, which leads to the restrictiveness of platforms and the simplification of methods. In order to stop the spreading of compromised accounts in MSNs effectively, the attacks have to be traced to their sources first. Through sensors, users exchange information in MSNs and acquire information by scanning QR codes. Therefore, analyzing the traces of sensor-related information helps to identify the compromised accounts in MSNs. This paper analyzes the diversity of information sending modes of compromised accounts and normal accounts, analyzes the regularity of GPS (Global Positioning System)-based location information, and introduces the concepts of entropy and conditional entropy so as to construct an entropy-based model based on machine learning strategies. To achieve the goal, about 500,000 accounts of Sina Weibo and about 100 million corresponding messages are collected. Through the validation, the accuracy rate of the model is proved to be as high as 87.6%, and the false positive rate is only 3.7%. Meanwhile, the comparative experiments of the feature sets prove that sensor-based location information can be applied to detect the compromised accounts in MSNs.Entities:
Keywords: Cyber Physical Social Sensing; QR code; compromised accounts; location-based features; mobile social networks
Year: 2016 PMID: 27657071 PMCID: PMC5038795 DOI: 10.3390/s16091522
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1System overview.
Figure 2Activities of message data.
Figure 3Sample of items in sending messages.
Figure 4Entropy of users’ behavior.
Figure 5Regularity of users’ behavior.
Figure 6Conditional entropy of location-based features.
Figure 7Entropy of location-based features.
Feature sets.
| Without Behavior Feature | Number of fans | Friends in following list of accounts |
| Account reputation | Ratio between number of fans and sum of related friends | |
| Age | Age of accounts | |
| URL ratio | Ratio between number of URLs in sending messages of latest one week by accounts and sum of number of messages | |
| Hashtag (#) ratio | Ratio between number of hashtags (#) in sent messages of latest one week by accounts and sum of number of messages | |
| Reply (@) ratio | Ratio between number of reply (@) characters in sent messages of latest one week by accounts and sum of number of messages | |
| Forward ratio | Ratio between number of forward in sending messages of latest one week by accounts to sum of number of messages |
Performance of entropy-based classifier.
| Classifiers | Accuracy Rate | False Positive Rate |
|---|---|---|
| J48 | 89.6% | 4.9% |
| Random Forest | 93.7% | 2.1% |
| SVM | 91.2% | 3.3% |
Evaluation on effective of behavior-based features.
| Accuracy Rate | False Positive Rate | |
|---|---|---|
| Without Behavior-Based Feature | 85.4% | 7.2% |
| With Behavior-Based Feature | 91.5% | 3.4% |
Evaluation of location-based features’ effectiveness.
| Accuracy Rate | False Positive Rate | |
|---|---|---|
| Without Location-Based Feature | 82.7% | 8.9% |
| With Location-Based Feature | 87.6% | 3.6% |