Literature DB >> 21824844

Textual and visual content-based anti-phishing: a Bayesian approach.

Haijun Zhang1, Gang Liu, Tommy W S Chow, Wenyin Liu.   

Abstract

A novel framework using a Bayesian approach for content-based phishing web page detection is presented. Our model takes into account textual and visual contents to measure the similarity between the protected web page and suspicious web pages. A text classifier, an image classifier, and an algorithm fusing the results from classifiers are introduced. An outstanding feature of this paper is the exploration of a Bayesian model to estimate the matching threshold. This is required in the classifier for determining the class of the web page and identifying whether the web page is phishing or not. In the text classifier, the naive Bayes rule is used to calculate the probability that a web page is phishing. In the image classifier, the earth mover's distance is employed to measure the visual similarity, and our Bayesian model is designed to determine the threshold. In the data fusion algorithm, the Bayes theory is used to synthesize the classification results from textual and visual content. The effectiveness of our proposed approach was examined in a large-scale dataset collected from real phishing cases. Experimental results demonstrated that the text classifier and the image classifier we designed deliver promising results, the fusion algorithm outperforms either of the individual classifiers, and our model can be adapted to different phishing cases.
© 2011 IEEE

Mesh:

Year:  2011        PMID: 21824844     DOI: 10.1109/TNN.2011.2161999

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  4 in total

1.  Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy.

Authors:  Nai-Wen Chang; Hong-Jie Dai; Yung-Yu Shih; Chi-Yang Wu; Mira Anne C Dela Rosa; Rofeamor P Obena; Yu-Ju Chen; Wen-Lian Hsu; Yen-Jen Oyang
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

2.  A hybrid DNN-LSTM model for detecting phishing URLs.

Authors:  Alper Ozcan; Cagatay Catal; Emrah Donmez; Behcet Senturk
Journal:  Neural Comput Appl       Date:  2021-08-08       Impact factor: 5.102

3.  Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic disease prediction modelling.

Authors:  Terence Fusco; Yaxin Bi; Haiying Wang; Fiona Browne
Journal:  Int J Mach Learn Cybern       Date:  2019-11-18

4.  Biomarker identification using text mining.

Authors:  Hui Li; Chunmei Liu
Journal:  Comput Math Methods Med       Date:  2012-11-11       Impact factor: 2.238

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.