scispace - formally typeset
Open AccessJournal ArticleDOI

CPSFS: A Credible Personalized Spam Filtering Scheme by Crowdsourcing

Reads0
Chats0
TLDR
The experimental results show that the proposed CPSFS can improve the accuracy rate of distinguishing spam from legitimate emails compared with that of Bayesian filter alone.
Abstract
Email spam consumes a lot of network resources and threatens many systems because of its unwanted or malicious content. Most existing spam filters only target complete-spam but ignore semispam. This paper proposes a novel and comprehensive CPSFS scheme: Credible Personalized Spam Filtering Scheme, which classifies spam into two categories: complete-spam and semispam, and targets filtering both kinds of spam. Complete-spam is always spam for all users; semispam is an email identified as spam by some users and as regular email by other users. Most existing spam filters target complete-spam but ignore semispam. In CPSFS, Bayesian filtering is deployed at email servers to identify complete-spam, while semispam is identified at client side by crowdsourcing. An email user client can distinguish junk from legitimate emails according to spam reports from credible contacts with the similar interests. Social trust and interest similarity between users and their contacts are calculated so that spam reports are more accurately targeted to similar users. The experimental results show that the proposed CPSFS can improve the accuracy rate of distinguishing spam from legitimate emails compared with that of Bayesian filter alone.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A lifelong spam emails classification model

TL;DR: An enhanced model is proposed for ensuring lifelong spam classification model and the overall performance of the suggested model is contrasted against various other stream mining classification techniques to prove the success of the proposed model as a lifelong spam emails classification method.
Journal ArticleDOI

Blockchain-Based Crowdsourcing Makes Training Dataset of Machine Learning No Longer Be in Short Supply

TL;DR: This paper reviews studies applying mobile crowdsourcing to training dataset collection and annotation and proposes a new possible combination of machine learning and crowdsourcing systems.
Journal ArticleDOI

SentiFilter: A Personalized Filtering Model for Arabic Semi-Spam Content based on Sentimental and Behavioral Analysis

TL;DR: The proposed SentiFilter model is a hybrid model that combines both sentimental and behavioral factors to detect unwanted content for each user towards pre-defined topics and is expected to provide an effective automated solution for filtering semi-spam content in favor of personalized preferences.
Journal ArticleDOI

Hyperparameter Optimization of Ensemble Models for Spam Email Detection

TL;DR: In this paper , the authors developed baseline models of random forest and extreme gradient boost (XGBoost) ensemble algorithms for the detection and classification of spam emails using the Enron1 dataset.
References
More filters
Proceedings ArticleDOI

SOAP: A Social network Aided Personalized and effective spam filter to clean your e-mail box

TL;DR: Experimental results show that SOAP can greatly improve the performance of Bayesian spam filters in terms of the accuracy, attack-resilience and efficiency of spam detection, and it is found that the performance is the lower bound of SOAP.
Journal ArticleDOI

Leveraging Social Networks for Effective Spam Filtering

TL;DR: Experimental results show that SOAP can greatly improve the performance of Bayesian spam filters in terms of accuracy, attack-resilience, and efficiency of spam detection.
Proceedings ArticleDOI

Supervised clustering of streaming data for email batch detection

TL;DR: This work devise a sequential decoding procedure and derive the corresponding optimization problem of supervised clustering, and study the impact of collective attributes of email batches on the effectiveness of recognizing spam emails.
Journal ArticleDOI

Computational social science and social computing

TL;DR: The related research area of social computing deals with the mechanisms through which people interact with computational systems, examining questions such as how and why people contribute user-generated content and how to design systems that better enable them to do so.
Journal ArticleDOI

Recommending social network applications via social filtering mechanisms

TL;DR: The experiment shows that the model outperforms other methodologies and indicates that social relationships play a more important role than the preferences of a user and the popularity of an application.
Related Papers (5)