Opinion mining using ensemble text hidden Markov models for text classification

doi:10.1016/J.ESWA.2017.07.019

Journal ArticleDOI

Opinion mining using ensemble text hidden Markov models for text classification

Mangi Kang, +2 more

- 15 Mar 2018 -

Expert Systems With Applications

- Vol. 94, pp 218-227

TLDR

A new sentiment analysis method, based on text-based hidden Markov models (TextHMMs), for text classification that uses a sequence of words in training texts instead of a predefined sentiment lexicon and has potential to classify implicit opinions.

Abstract:

Proposed a new sentiment analysis method, based on text-based hidden Markov models, that uses word orders without the need of sentiment lexicons.Proposed an ensemble of text-based hidden Markov models using boosting and clusters of words produced by latent semantic analysis.Showed the method has potential to classify implicit opinions by the proposed ensemble method.Showed better performance in comparison to several previous algorithms in several datasets.Applied it to a real-life dataset to classify paper titles. With the rapid growth of social media, text mining is extensively utilized in practical fields, and opinion mining, also known as sentiment analysis, plays an important role in analyzing opinion and sentiment in texts. Methods in opinion mining generally depend on a sentiment lexicon, which is a set of predefined key words that express sentiment. Opinion mining requires proper sentiment words to be extracted in advance and has difficulty classifying sentences that imply an opinion without using any sentiment key words. This paper presents a new sentiment analysis method, based on text-based hidden Markov models (TextHMMs), for text classification that uses a sequence of words in training texts instead of a predefined sentiment lexicon. We sought to learn text patterns representing sentiment through ensemble TextHMMs. Our method defines hidden variables in TextHMMs by semantic cluster information in consideration of the co-occurrence of words, and thus calculates the sentiment orientation of sentences by fitted TextHMMs. To reflect diverse patterns, we applied an ensemble of TextHMM-based classifiers. In the experiments with a benchmark data set, we show that this method is superior to some existing methods and particularly has potential to classify implicit opinions. We also demonstrate the practicality of the proposed method in a real-life data set of online market reviews.

Opinion mining using ensemble text hidden Markov models for text classification

Citations

Text Classification Algorithms: A Survey

Text Classification Algorithms: A Survey

A survey of sentiment analysis in social media

A recent overview of the state-of-the-art elements of text classification

Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism

References

Indexing by Latent Semantic Analysis

Convolutional Neural Networks for Sentence Classification

Mining and summarizing customer reviews

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

Related Papers (5)

A survey on opinion mining and sentiment analysis

Opinion Mining and Sentiment Analysis

Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion

Latent dirichlet allocation

Mining and summarizing customer reviews