scispace - formally typeset
Open AccessJournal ArticleDOI

Crowdsourcing a word–emotion association lexicon

Saif M. Mohammad, +1 more
- Vol. 29, Iss: 3, pp 436-465
TLDR
It is shown how the combined strength and wisdom of the crowds can be used to generate a large, high‐quality, word–emotion and word–polarity association lexicon quickly and inexpensively.
Abstract
Even though considerable attention has been given to the polarity of words (positive and negative) and the creation of large polarity lexicons, research in emotion analysis has had to rely on limited and small emotion lexicons. In this paper, we show how the combined strength and wisdom of the crowds can be used to generate a large, high-quality, word–emotion and word–polarity association lexicon quickly and inexpensively. We enumerate the challenges in emotion annotation in a crowdsourcing scenario and propose solutions to address them. Most notably, in addition to questions about emotions associated with terms, we show how the inclusion of a word choice question can discourage malicious data entry, help to identify instances where the annotator may not be familiar with the target term (allowing us to reject such annotations), and help to obtain annotations at sense level (rather than at word level). We conducted experiments on how to formulate the emotion-annotation questions, and show that asking if a term is associated with an emotion leads to markedly higher interannotator agreement than that obtained by asking if a term evokes an emotion.

read more

Citations
More filters
Journal ArticleDOI

The spread of true and false news online

TL;DR: A large-scale analysis of tweets reveals that false rumors spread further and faster than the truth, and false news was more novel than true news, which suggests that people were more likely to share novel information.
Proceedings ArticleDOI

SemEval-2018 Task 1: Affect in Tweets

TL;DR: This work presents the SemEval-2018 Task 1: Affect in Tweets, which includes an array of subtasks on inferring the affectual state of a person from their tweet, with a focus on the techniques and resources that are particularly useful.
Journal ArticleDOI

Deep Learning-Based Document Modeling for Personality Detection from Text

TL;DR: This article presents a deep learning based method for determining the author's personality type from text: given a text, the presence or absence of the Big Five traits is detected in theAuthor's psychological profile, and the implementation is freely available for research purposes.
Proceedings Article

Emotional Tweets

TL;DR: This paper describes how a Twitter emotion corpus is created from Twitter posts using emotion-word hashtags, and extracts a word-emotion association lexicon that leads to significantly better results than the manually crafted WordNet Affect lexicon in an emotion classification task.
Proceedings ArticleDOI

Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words

TL;DR: The NRC VAD Lexicon is presented, which has human ratings of valence, arousal, and dominance for more than 20,000 English words and it is shown that the ratings obtained are vastly more reliable than those in existing lexicons.
References
More filters
Journal ArticleDOI

The measurement of observer agreement for categorical data

TL;DR: A general statistical methodology for the analysis of multivariate categorical data arising from observer reliability studies is presented and tests for interobserver bias are presented in terms of first-order marginal homogeneity and measures of interob server agreement are developed as generalized kappa-type statistics.
Journal ArticleDOI

A Coefficient of agreement for nominal Scales

TL;DR: In this article, the authors present a procedure for having two or more judges independently categorize a sample of units and determine the degree, significance, and significance of the units. But they do not discuss the extent to which these judgments are reproducible, i.e., reliable.
Book

The Expression of the Emotions in Man and Animals

TL;DR: The Expression of the Emotions in Man and Animals Introduction to the First Edition and Discussion Index, by Phillip Prodger and Paul Ekman.
Book

Opinion Mining and Sentiment Analysis

TL;DR: This survey covers techniques and approaches that promise to directly enable opinion-oriented information-seeking systems and focuses on methods that seek to address the new challenges raised by sentiment-aware applications, as compared to those that are already present in more traditional fact-based analysis.