A Hybrid Spam Detection Method Based on Unstructured datasets

Shao, Yeqin, Trovati, Marcello, Shi, Quan, Angelopoulou, Olga, Asimakopoulou, Eleana and Bessis, Nik (2015) A Hybrid Spam Detection Method Based on Unstructured datasets. Soft Computing - A Fusion of Foundations, Methodologies and Applications, 21 (1). pp. 233-243. ISSN 1432-7643 DOI https://doi.org/10.1007/s00500-015-1959-z

SOCO-S-15-01331 copy.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (934kB) | Preview


The identification of non-genuine or malicious messages poses a variety of challenges due to the continuous changes in the techniques utilised by cyber-criminals. In this article, we propose a hybrid detection method based on a combination of image and text spam recognition techniques. In particular, the former is based on sparse representation based classification, which focuses on the global and local image features, and a dictionary learning technique to achieve a spam and a ham subdictionary. On the other hand, the textual analysis is based on semantic properties of documents to assess the level of maliciousness. More specifically, we are able to distinguish between meta-spam and real spam. Experimental results show the accuracy and potential of our approach.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Computing and Information Systems
Date Deposited: 21 Apr 2016 09:37
URI: http://repository.edgehill.ac.uk/id/eprint/7545

Archive staff only

Item control page Item control page