| Literature DB >> 31575038 |
Bo Wei1, Rebeen Ali Hamad2, Longzhi Yang3, Xuan He4,5, Hao Wang6, Bin Gao7, Wai Lok Woo8.
Abstract
This paper designs an accurate and low-cost phishing detection sensor by exploring deep learning techniques. Phishing is a very common social engineering technique. The attackers try to deceive online users by mimicking a uniform resource locator (URL) and a webpage. Traditionally, phishing detection is largely based on manual reports from users. Machine learning techniques have recently been introduced for phishing detection. With the recent rapid development of deep learning techniques, many deep-learning-based recognition methods have also been explored to improve classification performance. This paper proposes a light-weight deep learning algorithm to detect the malicious URLs and enable a real-time and energy-saving phishing detection sensor. Experimental tests and comparisons have been conducted to verify the efficacy of the proposed method. According to the experiments, the true detection rate has been improved. This paper has also verified that the proposed method can run in an energy-saving embedded single board computer in real-time.Entities:
Keywords: cyber security; deep learning; phishing detection
Year: 2019 PMID: 31575038 PMCID: PMC6806092 DOI: 10.3390/s19194258
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1One example of a phishing website imitating the popular website facebook.com.
Figure 2Difficulties to recognise malicious URLs for experts, normal users, and machines.
Figure 3The phishing detection methods using classical machine learning methods and deep learning techniques.
Figure 4The mechanism of list-based phishing URL detection methods.
Figure 5The general process of a classical supervised learning-based malicious website detection method.
Figure 6Deep-learning-based malicious website detection.
Figure 7System structure.
Figure 8Structure of the proposed DNN model.
Architecture configuration of the proposed DNN model.
| Output Dimension | ||
| Word Embedding | 32 | |
| Number of Filters | Kernel Size | |
| Conv1D_1 | 256 | 2 |
| Conv1D_2 | 256 | 3 |
| Conv1D_3 | 256 | 4 |
| Conv1D_4 | 256 | 5 |
| Conv1D_5 | 256 | 10 |
| Dropout Rate | ||
| Dropout | 0.5 | |
| Number of Units | ||
| Dense_1 | 128 | |
| Dense_2 | 128 | |
| Dense_3 | 128 | |
Figure 9Example of one hot encoding.
Figure 10Example of word embedding.
The effect of the dense layers.
| Accuracy | |
|---|---|
| Proposed |
|
| 1 Dense Layer | 86.537% |
| 2 Dense Layers | 86.538% |
| 3 Dense Layers | 86.542% |
The effect of the convolutional layers.
| Accuracy | |
|---|---|
| Proposed |
|
| 1 Convolutional Layer | 85.401% |
| 2 Convolutional Layers | 85.832% |
| 3 Convolutional Layers | 86.169% |
| 4 Convolutional Layers | 86.439% |
The effect of the concatenation of the output from the embedding layer.
| Accuracy | |
|---|---|
| Proposed |
|
| No Concatenation | 83.472% |
Figure 11The implementation process.
Execution time in the prototype.
| Execution Time (ms) | |
|---|---|
| Data Sanitisation | 0.0106 |
| Tokenisation | 0.1997 |
| DNN Inference | 105 |