Optimization Scheme for Text Classification Using Machine Learning Naïve Bayes Classifier

doi:10.1007/978-981-15-1420-3_61

Book ChapterDOI

Optimization Scheme for Text Classification Using Machine Learning Naïve Bayes Classifier

- pp 576-586

TLDR

A naive bayes classifier which scales directly with number of indicators and data points which can be used for both binary and multi-class classification problems, which demonstrates the performance improvement in the classification technique.

Abstract:

Text classification is an essential advance in characteristic dialect processing. It very well may be performed utilizing different classification algorithms. Hadoop Map Reduce is widely utilized in text classification to perform classification on colossal measure of text data. However, Map Reduce required a ton of time to perform the tasks thereby increasing latency and since the data is distributed over the cluster it builds time and thus reducing processing speed. Also, Hadoop utilizes long queue of code. Motivated by this, we propose a basic yet compelling machine learning method which uses Naive Bayes classifier for text data. In Machine Learning approach, the classifier is built automatically by learning the properties of categories from a set of predefined training data. Hence, it can process complex furthermore, multi assortmentinformation in dynamic situations. Here we propose a naive bayes classifier which scales directly with number of indicators and data points which can be used for both binary and multi-class classification problems. We implemented the presented schemes using Machine Learning tool. The experimental results demonstrate the performance improvement in the classification technique.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning

Kanwal Zahoor, +2 more

TL;DR: In this article, the authors analyzed the customer reviews about various restaurants across Karachi - one of the biggest cities of Pakistan, by using text categorization techniques, comments are automatically classified according to feedback about food taste, ambiance, service, and value for money.

...read moreread less

Journal ArticleDOI

An efficient approach for textual data classification using deep learning

Abdullah Alqahtani, +6 more

- 15 Sep 2022 -

Frontiers in Computational Neuroscience

TL;DR: This paper employs machine and deep learning techniques to classify textual data, and reveals that LSTM achieves 92% accuracy outperforming all other model and baseline studies.

...read moreread less

Proceedings ArticleDOI

C4.5 and Naive Bayes for Sentiment Analysis Indonesian Tweet on E-Money User During Pandemic

Wahyu S.J. Saputra, +2 more

TL;DR: In this article, the authors presented classification performance results for the C4.5 decision tree and Naive Bayes classifiers and compared each classifier's output using the voting-by-majority method to final predict positive or negative opinions about e-money in Indonesia.

...read moreread less

Journal ArticleDOI

Text Classification Using Genetic Programming with Implementation of Map Reduce and Scraping

Wirarama Wedashwara, +4 more

- 08 May 2023 -

JOIV : International Journal on Informat...

TL;DR: In this paper , the authors presented text classification using genetic programming by pre-processing text using Hadoop map-reduce and collecting data using web scraping, where genetic programming is used to perform association rule mining (ARM) before text classification to analyze big data patterns.

...read moreread less

Proceedings ArticleDOI

Feature level Fine Grained Sentiment Analysis for Classifying Online Restaurant Reviews

A Rama Satish

TL;DR: In this article , the authors conducted sentiment analysis on online restaurant reviews given by customers and classified each and every comment given in the text into five labels i.e. positive, negative, neutral, highly positive and highly negative.

...read moreread less

References

PDF

Open Access

More filters

Book ChapterDOI

Naive (Bayes) at forty: the independence assumption in information retrieval

David D. Lewis

TL;DR: The naive Bayes classifier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval, and some of the variations used for text retrieval and classification are reviewed.

...read moreread less

Proceedings Article

Spam Filtering with Naive Bayes - Which Naive Bayes?

Vangelis Metsis, +2 more

TL;DR: An experimental procedure that emulates the incremental training of person- alized spam filters is adopted, and roc curves that allow us to compare the dierent versions of nb over the entire tradeo between true positives and true negatives are plotted.

...read moreread less

Journal ArticleDOI

Some Effective Techniques for Naive Bayes Text Classification

Sang-Bum Kim, +3 more

- 01 Nov 2006 -

IEEE Transactions on Knowledge and Data ...

TL;DR: This paper proposed two empirical heuristics: per-document text normalization and feature weighting method, which performed very well in the standard benchmark collections, competing with state-of-the-art text classifiers based on a highly complex learning method such as SVM.

...read moreread less

Book ChapterDOI

Multinomial naive bayes for text categorization revisited

Ashraf M. Kibriya, +3 more

TL;DR: It is shown how the performance of multinomial naive Bayes can be improved using locally weighted learning, and that support vector machines are still the method of choice if the aim is to maximize accuracy.

...read moreread less

Journal ArticleDOI

An investigation of machine learning based prediction systems

Carolyn Mair, +6 more

- 15 Jul 2000 -

Journal of Systems and Software

TL;DR: It is shown that ANN methods have superior accuracy and that RI methods are least accurate, however, this view is somewhat counteracted by problems with explanatory value and configurability.

...read moreread less

Optimization Scheme for Text Classification Using Machine Learning Naïve Bayes Classifier

Citations

Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning

An efficient approach for textual data classification using deep learning

C4.5 and Naive Bayes for Sentiment Analysis Indonesian Tweet on E-Money User During Pandemic

Text Classification Using Genetic Programming with Implementation of Map Reduce and Scraping

Feature level Fine Grained Sentiment Analysis for Classifying Online Restaurant Reviews

References

Naive (Bayes) at forty: the independence assumption in information retrieval

Spam Filtering with Naive Bayes - Which Naive Bayes?

Some Effective Techniques for Naive Bayes Text Classification

Multinomial naive bayes for text categorization revisited

An investigation of machine learning based prediction systems

Related Papers (5)

Classification and Optimization Scheme for Text Data using Machine Learning Naïve Bayes Classifier

A MapReduce based approach for classification

Emotion based social media text classification using optimized improved ID3 classifier

A parallel learning algorithm for text classification

Dynamically evolving fuzzy classifier for real-time classification of data streams