Negative Deceptive Opinion Spam

Open AccessProceedings Article

Negative Deceptive Opinion Spam

Myle Ott, +2 more

- pp 497-501

Chats0

TLDR

This work creates and study the first dataset of deceptive opinion spam with negative sentiment reviews, and finds that standard n-gram text categorization techniques can detect negative deceptive opinions spam with performance far surpassing that of human judges.

Abstract:

The rising influence of user-generated online reviews (Cone, 2011) has led to growing incentive for businesses to solicit and manufacture DECEPTIVE OPINION SPAM—fictitious reviews that have been deliberately written to sound authentic and deceive the reader. Recently, Ott et al. (2011) have introduced an opinion spam dataset containing gold standard deceptive positive hotel reviews. However, the complementary problem of negative deceptive opinion spam, intended to slander competitive offerings, remains largely unstudied. Following an approach similar to Ott et al. (2011), in this work we create and study the first dataset of deceptive opinion spam with negative sentiment reviews. Based on this dataset, we find that standard n-gram text categorization techniques can detect negative deceptive opinion spam with performance far surpassing that of human judges. Finally, in conjunction with the aforementioned positive review dataset, we consider the possible interactions between sentiment and deception, and present initial results that encourage further exploration of this relationship.

Citations

PDF

Open Access

More filters

Patent

Tree kernel learning for text classification into classes of intent

Boris Galitsky

TL;DR: In this article, an intent classification application was proposed to determine the intent of a sentence from a predefined list of intent classes by applying a classification model to the parse thicket.

...read moreread less

Posted Content

Online Deception Detection Refueled by Real World Data Collection

Wenlin Yao, +3 more

- 28 Jul 2017 -

arXiv: Computation and Language

TL;DR: In this article, the authors apply a data collection method based on social network analysis to quickly identify high-quality deceptive and truthful online reviews from Amazon, which contains more than 10,000 deceptive reviews and is diverse in product domains and reviewers.

...read moreread less

Proceedings ArticleDOI

Think Outside the Dataset: Finding Fraudulent Reviews using Cross-Dataset Analysis

Shirin Nilizadeh, +4 more

TL;DR: This work proposes OneReview, a method for locating fraudulent reviews, correlating data from multiple crowd-sourced review sites, and applies the created model on suspicious reviews, which detected about 62K fraudulent reviews.

...read moreread less

Proceedings ArticleDOI

Identifying the sentiment styles of YouTube’s vloggers

Bennett Kleinberg, +2 more

TL;DR: The authors examined the continuous sentiment styles employed in 27,333 vlogs using a dynamic intra-textual approach to sentiment analysis and identified seven distinct continuous sentiment trajectories characterized by fluctuations of sentiment throughout a vlog's narrative time.

...read moreread less

Proceedings Article

DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed Text.

Pasquale Capuozzo, +4 more

TL;DR: DecOp (Deceptive Opinions), a new language resource developed for automatic deception detection in cross-domain and cross-language scenarios, is introduced and the collection procedure of the DecOp corpus and his main characteristics are described.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The measurement of observer agreement for categorical data

J. R. Landis, +1 more

- 01 Mar 1977 -

Biometrics

TL;DR: A general statistical methodology for the analysis of multivariate categorical data arising from observer reliability studies is presented and tests for interobserver bias are presented in terms of first-order marginal homogeneity and measures of interob server agreement are developed as generalized kappa-type statistics.

...read moreread less

Book

Longman Grammar of Spoken and Written English

Douglas Biber, +1 more

TL;DR: The authors compare the frequency of constructions in different contexts, from conversation to fiction to academic prose, using the 40 million-word Longman Spoken and Written English Corpus (LSEE).

...read moreread less

Journal ArticleDOI

Generalized additive models for location, scale and shape

Robert A. Rigby, +1 more

- 01 Jun 2005 -

Journal of The Royal Statistical Society...

TL;DR: The generalized additive model for location, scale and shape (GAMLSS) as mentioned in this paper is a general class of statistical models for a univariate response variable, which assumes independent observations of the response variable y given the parameters, the explanatory variables and the values of the random effects.

...read moreread less

Journal ArticleDOI

Nonverbal Leakage and Clues to Deception

Paul Ekman, +1 more

- 01 Feb 1969 -

Psychiatry MMC

TL;DR: The study explores the interaction situation, and considers how within deception interactions differences in neuroanatomy and cultural influences combine to produce specific types of body movements and facial expressions which escape efforts to deceive and emerge as leakage or deception clues.

...read moreread less

Journal ArticleDOI

Accuracy of Deception Judgments

Charles F. Bond, +1 more

- 01 Jan 2006 -

Personality and Social Psychology Review

TL;DR: It is proposed that people judge others' deceptions more harshly than their own and that this double standard in evaluating deceit can explain much of the accumulated literature.

...read moreread less

Negative Deceptive Opinion Spam

Citations

Tree kernel learning for text classification into classes of intent

Online Deception Detection Refueled by Real World Data Collection

Think Outside the Dataset: Finding Fraudulent Reviews using Cross-Dataset Analysis

Identifying the sentiment styles of YouTube’s vloggers

DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed Text.

References

The measurement of observer agreement for categorical data

Longman Grammar of Spoken and Written English

Generalized additive models for location, scale and shape

Nonverbal Leakage and Clues to Deception

Accuracy of Deception Judgments

Related Papers (5)

Opinion spam and analysis

Finding Deceptive Opinion Spam by Any Stretch of the Imagination

Syntactic Stylometry for Deception Detection

What Yelp Fake Review Filter Might Be Doing

Detecting product review spammers using rating behaviors