Search or ask a question

Home
/
Authors
/
Liny Varghese

Author

Liny Varghese

Bio: Liny Varghese is an academic researcher. The author has contributed to research in topics: Statistical classification & Spambot. The author has an hindex of 1, co-authored 2 publications receiving 2 citations.

Topics: Statistical classification, Spambot, Bag-of-words model, Forum spam, Spamming ...read more

Papers

PDF

Open Access

More filters

Journal Article•

Spam: A Big Data Challenge

[...]

Liny Varghese, Supriya M.H, K. Poulose Jacob

30 Apr 2017-International Journal of Advanced Research in Computer Science

TL;DR: This paper uses mahout framework to analyse the time and accuracy efficiencies of the results of two Naive Bayes classification algorithms.

...read moreread less

Abstract: Spam consists of varieties of contents like text, image, embedded HTML, MIME attachments and also the volume of spam mails sent per day is massive. To handle this high volume, high velocity and large varieties of spam, a scalable spam filtering solution is required. Scalable solutions available for machine learning and statistical studies can be used to implement a scalable solution for spam filtering also. From Big data Analytics domain, Mahout is an open source library from Apache for building scalable solutions in machine learning. This paper uses mahout framework to analyse the time and accuracy efficiencies of the results of two Naive Bayes classification algorithms. Keywords: Apache Mahout, big data, scalable algorithms, Naive Bayes algorithms

...read moreread less

1 citations

Journal Article•DOI•

Filtering Template Driven Spam Mails using Vector Space Models

[...]

Liny Varghese, Supriya M.H, K. Poulose Jacob

29 Feb 2012-International Journal of Computer Applications

TL;DR: The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context.

...read moreread less

Abstract: Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most highvolume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam

...read moreread less

1 citations

Journal Article•DOI•

Impact of Different Chelating Agents on Fibrin Clot Adhesion to the Exposed Root Surface: A Comparative Study

[...]

Debjit Dhamali, Liny Varghese, M. Jalaluddin, Praveen Kumar Bankur, Deesha Kumari, E. David - Show less +2 more

02 Jun 2023-World Journal of Dentistry

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Development of Proposed Ensemble Model for Spam e-mail Classification

[...]

Akhilesh Kumar Shrivas¹, Amit Kumar Dewangan², S. M. Ghosh², Devendra Singh•Institutions (2)

Guru Ghasidas University¹, Dr. C. V. Raman University²

24 Sep 2021-Information Technology and Control

TL;DR: In this article, an Ensemble Model-1 that is an ensemble of Multilayer Perceptron (MLP), Naive Bayes and Random Forest (RF) was proposed for classification of spam and ham documents.

...read moreread less

Abstract: Spam e-mail documents classification is a very challenging task for e-mail users, especially non IT users. Billions of people using the internet and face the problem of spam e-mails. The automatic identification and classification of spam e-mails help to reduce the problem of e-mail users in managing a large amount of e-mails. This work aims to do a significant contribution by building a robust model for classification of spam e-mail documents using data mining techniques. In this paper, we use Enorn1 data set which consists of spam and ham documents collected from Kaggle repository. We propose an Ensemble Model-1 that is an ensemble of Multilayer Perceptron (MLP), Naive Bayes and Random Forest (RF) to obtain better accuracy for the classification of spam and hame-mail documents. Experimental results reveal that the proposed Ensemble Model-1 outperforms other existing classifiers as well as other proposed ensemble models in terms of classification accuracy. The suggested and proposed Ensemble Model-1 produces a high accuracy of 97.25% for classification of spam e-mail documents.

...read moreread less

3 citations

Journal Article•DOI•

Performance Evaluation of Data Mining based Classifier for Classification of Spam E-Mail

[...]

Manish Kumar Sahu

27 Apr 2017-International Journal for Research in Applied Science and Engineering Technology

TL;DR: This research work has recommended the Multilayer perceptron (MLP) as a best classifier for classification of spam which gives 93.15% accuracy with 10-fold cross validation.

...read moreread less

Abstract: E-mail is one of the important and economical communication media to transfer the information from one person to others. Due to increase number of E-mails resulted drastic increases spam E-mail. In this research work, we have used various classification techniques to classification of spam E-mail and non spam E-mails. The experiment done in Tanagra data mining tool. We have recommended the Multilayer perceptron (MLP) as a best classifier for classification of spam which gives 93.15% accuracy with 10-fold cross validation.

...read moreread less

SciSpace

About Careers Resources Support Browse Papers Pricing SciSpace Affiliate Program Cancellation & Refund Policy

Tools

Citation generator AI Detector Paraphraser

Extensions

SciSpace

Directories

Papers Topics Journals Authors Conferences Institutions Questions Citation Styles

Contact

support@typeset.io +91 8431021544