Abstract:Image spam is a type of e-mail spam that embeds spam text content into graphical images to bypass traditional text-based e-mail spam filters. To effectively detect image spam, it is desirable to leverage image content analysis technologies. However, most previous works of image spam detection focus on filtering the image spam on the client side. We propose a more desirable comprehensive solution which embraces both server-side filtering and client-side detection to effectively mitigate image spam. On the server side, we present a nonnegative sparsity induced similarity measure for cluster analysis of spam images to filter the attack activities of spammers and fast trace back the spam sources. On the client side, we employ the principle of active learning where the learner guides the users to label as few images as possible while maximizing the classification accuracy. The server-side filtering identifies large image clusters as suspicious spam sources and further analysis can be performed to identify the real sources and block them from the beginning. For those spam images which survived the server-side filter, our active learner on the client side will further guide the users to interactively and efficiently filter them out. Our experiments on an image spam data-set collected from the e-mail server of our department demonstrate the efficacy of the proposed comprehensive solution.

Image Spam Identification Method Based on Gray-Gradient Co-Occurrence Matrix

Image spam identification method based on gray-gradient co-occurrence matrix

Source Camera Identification Using Support Vector Machines

Image Classification Method by Combining Multi-features and Sparse Coding

Image Spam Identifying Algorithm Based on Color and Corner Feature

Image Spam Filtering Based on Gradient and Color Feature

Spam image discrimination using support vector machine based on higher-order local autocorrelation feature extraction

Camouflaged Chinese Spam Content Detection with Semi-supervised Generative Active Learning.

A Fast Image Retrieval Method Based on Svm and Imbalanced Samples in Filtering Multimedia Message Spam

Graph-Based Semi-supervised Feature Selection with Application to Automatic Spam Image Identification

An Efficient Color Image Classification Method Using Gradient Magnitude Based Angle Cooccurrence Matrix.

A comprehensive approach to image spam detection

Method of Image-Based Spam Filtering of E-Mails Based on Image Similarity Detection

Efficient Modeling of Spam Images

Fusion of text and image features: A new approach to image spam filtering

A Novel Spam Image Filtering Framework with Multi-Label Classification

A Comprehensive Approach to Image Spam Detection: From Server to Client Solution

Noise analysis for text-based spam images

A Nonnegative Sparsity Induced Similarity Measure with Application to Cluster Analysis of Spam Images

Text Region Extraction in Image-Based Spam Email

Research on the Classification of SVM-Based Image Texture Features