A Combining Classifiers Approach for Detecting Email Spams

doi:10.1109/WAINA.2016.127

Proceedings ArticleDOI

A Combining Classifiers Approach for Detecting Email Spams

Shrawan Kumar Trivedi, +1 more

- pp 355-360

Chats0

TLDR

Results show the best results of novel combining classifier approach in compression with individual classifiers compared in terms of good performance accuracy and low false positives.

Abstract:

Email is a rapid and cheap communication medium for sending and receiving information where spam is becoming a nuisance for such communication. A good spam filtering cannot only be achieved by high performance accuracy but low false positive is also necessary. This paper presents a combining classifiers approach with committee selection mechanism where the main objective is to combine individual decisions of the good classifiers for utmost classification outcome in spam classification domain. In this context, three different classifiers have been selected i.e. "Boosted Bayesian", "Boosted Naive Bayes and Support Vector Machine (SVM). For combining classifiers, boosted bayesian and boosted naive bayes are chosen as members of committee and SVM is taken as the president. The member of committee have been selected from our previous study where we have identified boosting with adaboost improves the performance of probabilistic classifier. Results show the best results of novel combining classifier approach in compression with individual classifiers compared in terms of good performance accuracy and low false positives. In addition, greedy step wise feature search method is found to be good in this study.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Survey on Machine Learning Techniques for Cyber Security in the Last Decade

Kamran Shaukat, +4 more

- 02 Dec 2020 -

IEEE Access

TL;DR: This paper aims to provide a comprehensive overview of the challenges that ML techniques face in protecting cyberspace against attacks, by presenting a literature on ML techniques for cyber security including intrusion detection, spam detection, and malware detection on computer networks and mobile networks in the last decade.

...read moreread less

Journal ArticleDOI

Spam filtering using integrated distribution-based balancing approach and regularized deep neural networks

Aliaksandr Barushka, +1 more

- 01 Oct 2018 -

Applied Intelligence

TL;DR: A novel spam filter integrating an N-gram tf.idf feature selection, modified distribution-based balancing algorithm and a regularized deep multi-layer perceptron NN model with rectified linear units is proposed, which outperforms state-of-the-art spam filters and several machine learning algorithms commonly used to classify text.

...read moreread less

Journal ArticleDOI

Spam filtering using a logistic regression model trained by an artificial bee colony algorithm

Bilge Kagan Dedeturk, +1 more

- 01 Jun 2020 -

Applied Soft Computing

TL;DR: A novel spam detection method that combines the artificial bee colony algorithm with a logistic regression classification model is proposed that outperforms other spam detection techniques considered in this study in terms of classification accuracy.

...read moreread less

Journal ArticleDOI

A multi class random forest (MCRF) model for classification of small plant peptides

Ankita Tripathi, +3 more

TL;DR: Results of this study show that the proposed MCRF classifier has potential to accurately classify multi-level imbalanced data.

...read moreread less

Book ChapterDOI

Spam Detection Using Ensemble Learning

Vashu Gupta, +4 more

TL;DR: Voting classifier, a type of ensemble learning to calculate the accuracy of different combinations of classifiers is used, and results show that use of voting classifier produces more accurate prediction than individual classifier.

...read moreread less

References

PDF

Open Access

More filters

Statistical learning theory

Vladimir Vapnik

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

Book ChapterDOI

Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

Thorsten Joachims

TL;DR: This paper explores the use of Support Vector Machines for learning text classifiers from examples and analyzes the particular properties of learning with text data and identifies why SVMs are appropriate for this task.

...read moreread less

Book ChapterDOI

Naive (Bayes) at forty: the independence assumption in information retrieval

David D. Lewis

TL;DR: The naive Bayes classifier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval, and some of the variations used for text retrieval and classification are reviewed.

...read moreread less

Journal ArticleDOI

Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis

Chien-Fu Wu

- 01 Dec 1986 -

Annals of Statistics

TL;DR: In this paper, a class of weighted jackknife variance estimators for the least square estimator by deleting any fixed number of observations at a time was proposed, and three bootstrap methods were considered.

...read moreread less

Proceedings Article

A Bayesian Approach to Filtering Junk E-Mail

Mehran Sahami, +3 more

TL;DR: This work examines methods for the automated construction of filters to eliminate such unwanted messages from a user’s mail stream, and shows the efficacy of such filters in a real world usage scenario, arguing that this technology is mature enough for deployment.

...read moreread less

A Combining Classifiers Approach for Detecting Email Spams

Citations

A Survey on Machine Learning Techniques for Cyber Security in the Last Decade

Spam filtering using integrated distribution-based balancing approach and regularized deep neural networks

Spam filtering using a logistic regression model trained by an artificial bee colony algorithm

A multi class random forest (MCRF) model for classification of small plant peptides

Spam Detection Using Ensemble Learning

References

Statistical learning theory

Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

Naive (Bayes) at forty: the independence assumption in information retrieval

Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis

A Bayesian Approach to Filtering Junk E-Mail

Related Papers (5)

Trees of classifiers for detecting email spam

An Enhanced Genetic Programming Approach for Detecting Unsolicited Emails

A Novel Feature Selection Technique for Text Classification Using Naïve Bayes

Classification of Text Documents

Design of effective multiple classifier systems by clustering of classifiers