scispace - formally typeset
Open AccessJournal Article

Statistical Validation of ACO-KNN Algorithm for Sentiment Analysis

Reads0
Chats0
TLDR
The experimental results have proven that the ACO-KNN can be used as a feature selection technique in sentiment analysis to obtain quality, optimal feature subset that can represent the actual data in customer review data.
Abstract
This research paper aims to propose a hybrid of ant colony optimization (ACO) and k-nearest neighbour (KNN) algorithms as feature selections for selecting and choosing relevant features from customer review datasets. Information gain (IG), genetic algorithm (GA), and rough set attribute reduction (RSAR) were used as baseline algorithms in a performance comparison with the proposed algorithm. This paper will also discuss the significance test, which was used to evaluate the performance differences between the ACO-KNN, the IG-GA, and the IG-RSAR algorithms. The dependency relation algorithm was used to identify actual features commented by customers by linking the dependency relation between product feature and sentiment words in customers sentences. This study evaluated the performance of the ACOKNN algorithm using precision, recall, and F-score, which was validated using the parametric statistical significance tests. The evaluation process has statistically proven that this ACO-KNN algorithm has been significantly improved compared to the baseline algorithms. In addition, the experimental results have proven that the ACO-KNN can be used as a feature selection technique in sentiment analysis to obtain quality, optimal feature subset that can represent the actual data in customer review data.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A comparative study on bio-inspired algorithms for sentiment analysis

TL;DR: A comprehensive review of the significant bio-inspired algorithms that are popularly applied in sentiment analysis, which consist of swarm intelligence based and non-swarm intelligence-based algorithms.
Journal ArticleDOI

An improved version of multi-view k-nearest neighbors (MVKNN) for multiple view learning

TL;DR: The experimental results show that a significant improvement is achieved by the proposed MVKNN algorithm compared to the well-known machine learning algorithms (KNN, support vector machine, decision tree, and naive bayes) in the case of multi-view data.

Harmony Gradient Boosting Random Forest Machine Learning Algorithms for Sentiment Classification

TL;DR: In this paper , the authors proposed a harmony gradient boosting random forest machine learning algorithm for sentiment analysis on social media, which has a classification accuracy of 5.68% compared to the harmonic random forest.
Related Papers (5)