scispace - formally typeset
Open AccessProceedings Article

Negative Deceptive Opinion Spam

Reads0
Chats0
TLDR
This work creates and study the first dataset of deceptive opinion spam with negative sentiment reviews, and finds that standard n-gram text categorization techniques can detect negative deceptive opinions spam with performance far surpassing that of human judges.
Abstract
The rising influence of user-generated online reviews (Cone, 2011) has led to growing incentive for businesses to solicit and manufacture DECEPTIVE OPINION SPAM—fictitious reviews that have been deliberately written to sound authentic and deceive the reader. Recently, Ott et al. (2011) have introduced an opinion spam dataset containing gold standard deceptive positive hotel reviews. However, the complementary problem of negative deceptive opinion spam, intended to slander competitive offerings, remains largely unstudied. Following an approach similar to Ott et al. (2011), in this work we create and study the first dataset of deceptive opinion spam with negative sentiment reviews. Based on this dataset, we find that standard n-gram text categorization techniques can detect negative deceptive opinion spam with performance far surpassing that of human judges. Finally, in conjunction with the aforementioned positive review dataset, we consider the possible interactions between sentiment and deception, and present initial results that encourage further exploration of this relationship.

read more

Citations
More filters
Patent

Tree kernel learning for text classification into classes of intent

TL;DR: In this article, an intent classification application was proposed to determine the intent of a sentence from a predefined list of intent classes by applying a classification model to the parse thicket.
Posted Content

Online Deception Detection Refueled by Real World Data Collection

TL;DR: In this article, the authors apply a data collection method based on social network analysis to quickly identify high-quality deceptive and truthful online reviews from Amazon, which contains more than 10,000 deceptive reviews and is diverse in product domains and reviewers.
Proceedings ArticleDOI

Think Outside the Dataset: Finding Fraudulent Reviews using Cross-Dataset Analysis

TL;DR: This work proposes OneReview, a method for locating fraudulent reviews, correlating data from multiple crowd-sourced review sites, and applies the created model on suspicious reviews, which detected about 62K fraudulent reviews.
Proceedings ArticleDOI

Identifying the sentiment styles of YouTube’s vloggers

TL;DR: The authors examined the continuous sentiment styles employed in 27,333 vlogs using a dynamic intra-textual approach to sentiment analysis and identified seven distinct continuous sentiment trajectories characterized by fluctuations of sentiment throughout a vlog's narrative time.
Proceedings Article

DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed Text.

TL;DR: DecOp (Deceptive Opinions), a new language resource developed for automatic deception detection in cross-domain and cross-language scenarios, is introduced and the collection procedure of the DecOp corpus and his main characteristics are described.
References
More filters
Journal ArticleDOI

The measurement of observer agreement for categorical data

TL;DR: A general statistical methodology for the analysis of multivariate categorical data arising from observer reliability studies is presented and tests for interobserver bias are presented in terms of first-order marginal homogeneity and measures of interob server agreement are developed as generalized kappa-type statistics.
Book

Longman Grammar of Spoken and Written English

TL;DR: The authors compare the frequency of constructions in different contexts, from conversation to fiction to academic prose, using the 40 million-word Longman Spoken and Written English Corpus (LSEE).
Journal ArticleDOI

Generalized additive models for location, scale and shape

TL;DR: The generalized additive model for location, scale and shape (GAMLSS) as mentioned in this paper is a general class of statistical models for a univariate response variable, which assumes independent observations of the response variable y given the parameters, the explanatory variables and the values of the random effects.
Journal ArticleDOI

Nonverbal Leakage and Clues to Deception

Paul Ekman, +1 more
- 01 Feb 1969 - 
TL;DR: The study explores the interaction situation, and considers how within deception interactions differences in neuroanatomy and cultural influences combine to produce specific types of body movements and facial expressions which escape efforts to deceive and emerge as leakage or deception clues.
Journal ArticleDOI

Accuracy of Deception Judgments

TL;DR: It is proposed that people judge others' deceptions more harshly than their own and that this double standard in evaluating deceit can explain much of the accumulated literature.
Related Papers (5)