Literature DB >> 35135236

Evading obscure communication from spam emails.

Khan Farhan Rafat1, Qin Xin2, Abdul Rehman Javed1, Zunera Jalil1, Rana Zeeshan Ahmad3.   

Abstract

Spam is any form of annoying and unsought digital communication sent in bulk and may contain offensive content feasting viruses and cyber-attacks. The voluminous increase in spam has necessitated developing more reliable and vigorous artificial intelligence-based anti-spam filters. Besides text, an email sometimes contains multimedia content such as audio, video, and images. However, text-centric email spam filtering employing text classification techniques remains today's preferred choice. In this paper, we show that text pre-processing techniques nullify the detection of malicious contents in an obscure communication framework. We use Spamassassin corpus with and without text pre-processing and examined it using machine learning (ML) and deep learning (DL) algorithms to classify these as ham or spam emails. The proposed DL-based approach consistently outperforms ML models. In the first stage, using pre-processing techniques, the long-short-term memory (LSTM) model achieves the highest results of 93.46% precision, 96.81% recall, and 95% F1-score. In the second stage, without using pre-processing techniques, LSTM achieves the best results of 95.26% precision, 97.18% recall, and 96% F1-score. Results show the supremacy of DL algorithms over the standard ones in filtering spam. However, the effects are unsatisfactory for detecting encrypted communication for both forms of ML algorithms.

Entities:  

Keywords:  Email classification ; ham ; machine learning ; spam ; stenography ; text pre-processing

Mesh:

Year:  2021        PMID: 35135236     DOI: 10.3934/mbe.2022091

Source DB:  PubMed          Journal:  Math Biosci Eng        ISSN: 1547-1063            Impact factor:   2.080


  1 in total

1.  An efficient approach for textual data classification using deep learning.

Authors:  Abdullah Alqahtani; Habib Ullah Khan; Shtwai Alsubai; Mohemmed Sha; Ahmad Almadhor; Tayyab Iqbal; Sidra Abbas
Journal:  Front Comput Neurosci       Date:  2022-09-15       Impact factor: 3.387

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.