Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Open AccessPosted Content

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Peter D. Turney

- 11 Dec 2002 -

arXiv: Learning

Chats0

TLDR

A simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (Thumbs down) if the average semantic orientation of its phrases is positive.

Abstract:

This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A phrase has a positive semantic orientation when it has good associations (e.g., "subtle nuances") and a negative semantic orientation when it has bad associations (e.g., "very cavalier"). In this paper, the semantic orientation of a phrase is calculated as the mutual information between the given phrase and the word "excellent" minus the mutual information between the given phrase and the word "poor". A review is classified as recommended if the average semantic orientation of its phrases is positive. The algorithm achieves an average accuracy of 74% when evaluated on 410 reviews from Epinions, sampled from four different domains (reviews of automobiles, banks, movies, and travel destinations). The accuracy ranges from 84% for automobile reviews to 66% for movie reviews.

Citations

PDF

Open Access

More filters

Book

Opinion Mining and Sentiment Analysis

Bo Pang, +1 more

TL;DR: This survey covers techniques and approaches that promise to directly enable opinion-oriented information-seeking systems and focuses on methods that seek to address the new challenges raised by sentiment-aware applications, as compared to those that are already present in more traditional fact-based analysis.

...read moreread less

Proceedings ArticleDOI

Mining and summarizing customer reviews

Minqing Hu, +1 more

TL;DR: This research aims to mine and to summarize all the customer reviews of a product, and proposes several novel techniques to perform these tasks.

...read moreread less

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

Bo Pang, +2 more

TL;DR: In this paper, the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, was considered and three machine learning methods (Naive Bayes, maximum entropy classiflcation, and support vector machines) were employed.

...read moreread less

Book

Sentiment Analysis and Opinion Mining

Bing Liu

TL;DR: Sentiment analysis and opinion mining is the field of study that analyzes people's opinions, sentiments, evaluations, attitudes, and emotions from written language as discussed by the authors and is one of the most active research areas in natural language processing and is also widely studied in data mining, Web mining, and text mining.

...read moreread less

Proceedings ArticleDOI

A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts

Bo Pang, +1 more

TL;DR: This paper proposed a machine learning method that applies text-categorization techniques to just the subjective portions of the document, extracting these portions can be implemented using efficient techniques for finding minimum cuts in graphs; this greatly facilitates incorporation of cross-sentence contextual constraints.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

An introduction to categorical data analysis

Alan Agresti

TL;DR: In this paper, the authors present a tour of categorical data analysis for Contingency Tables and Logit and Loglinear models for contingency tables, as well as generalized linear models for Matched Pairs.

...read moreread less

Journal ArticleDOI

A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge.

Thomas K. Landauer, +1 more

- 01 Apr 1997 -

Psychological Review

TL;DR: A new general theory of acquired similarity and knowledge representation, latent semantic analysis (LSA), is presented and used to successfully simulate such learning and several other psycholinguistic phenomena.

...read moreread less

Journal ArticleDOI

Word association norms, mutual information, and lexicography

Kenneth Church, +1 more

- 01 Mar 1990 -

Computational Linguistics

TL;DR: The proposed measure, the association ratio, estimates word association norms directly from computer readable corpora, making it possible to estimate norms for tens of thousands of words.

...read moreread less

Proceedings ArticleDOI

Predicting the Semantic Orientation of Adjectives

Vasileios Hatzivassiloglou, +1 more

TL;DR: A log-linear regression model uses constraints from conjunctions to predict whether conjoined adjectives are of same or different orientations, achieving 82% accuracy in this task when each conjunction is considered independently.

...read moreread less

Posted Content

Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL

Peter D. Turney

- 11 Dec 2002 -

arXiv: Learning

TL;DR: This article presented an unsupervised learning algorithm for recognizing synonyms based on statistical data acquired by querying a web search engine, called Pointwise Mutual Information (PMI) and Information Retrieval (IR) to measure the similarity of pairs of words.

...read moreread less

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Citations

Opinion Mining and Sentiment Analysis

Mining and summarizing customer reviews

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

Sentiment Analysis and Opinion Mining

A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts

References

An introduction to categorical data analysis

A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge.

Word association norms, mutual information, and lexicography

Predicting the Semantic Orientation of Adjectives

Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL

Related Papers (5)

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus

Learning to classify semantic orientation on on-line document

SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS

Adjective Intensity and Sentiment Analysis