scispace - formally typeset
Open AccessJournal ArticleDOI

A Pattern-Based Approach for Sarcasm Detection on Twitter

Mondher Bouazizi, +1 more
- 24 Aug 2016 - 
- Vol. 4, pp 5477-5488
TLDR
This paper proposes a pattern-based approach to detect sarcasm on Twitter and proposes four sets of features that cover the different types of sarcasm, which are used to classify tweets as sarcastic and non-sarcastic.
Abstract
Sarcasm is a sophisticated form of irony widely used in social networks and microblogging websites. It is usually used to convey implicit information within the message a person transmits. Sarcasm might be used for different purposes, such as criticism or mockery. However, it is hard even for humans to recognize. Therefore, recognizing sarcastic statements can be very useful to improve automatic sentiment analysis of data collected from microblogging websites or social networks. Sentiment Analysis refers to the identification and aggregation of attitudes and opinions expressed by Internet users toward a specific topic. In this paper, we propose a pattern-based approach to detect sarcasm on Twitter. We propose four sets of features that cover the different types of sarcasm we defined. We use those to classify tweets as sarcastic and non-sarcastic. Our proposed approach reaches an accuracy of 83.1% with a precision equal to 91.1%. We also study the importance of each of the proposed sets of features and evaluate its added value to the classification. In particular, we emphasize the importance of pattern-based features for the detection of sarcastic statements.

read more

Citations
More filters
Journal ArticleDOI

Hate Speech on Twitter: A Pragmatic Approach to Collect Hateful and Offensive Expressions and Perform Hate Speech Detection

TL;DR: This paper proposes an approach to detect hate expressions on Twitter based on unigrams and patterns that are automatically collected from the training set and used, among others, as features to train a machine learning algorithm.
Journal ArticleDOI

A Term Weighted Neural Language Model and Stacked Bidirectional LSTM Based Framework for Sarcasm Identification

TL;DR: In this paper, an effective sarcasm identification framework on social media data by pursuing the paradigms of neural language models and deep neural networks is presented. But sarcasm detection on text documents is one of the most challenging tasks in NLP.
Journal ArticleDOI

Sarcasm Detection Using Soft Attention-Based Bidirectional Long Short-Term Memory Model With Convolution Network

TL;DR: A deep learning model called sAtt-BLSTM convNet that is based on the hybrid of soft attention-based bidirectional long short-term memory (sAtt- BLSTM) and convolution neural network (convNet) applying global vectors for word representation (GLoVe) for building semantic word embeddings is proposed.
Journal ArticleDOI

Systematic literature review of sentiment analysis on Twitter using soft computing techniques

TL;DR: This work presents a systematic literature review to collate, explore, understand, understand and analyze the efforts and trends in a well‐structured manner to identify research gaps defining the future prospects of this coupling of soft computing techniques for sentiment analysis on Twitter.
Journal ArticleDOI

Approaches to Cross-Domain Sentiment Analysis: A Systematic Literature Review

TL;DR: A comprehensive systematic literature review of the methods and techniques employed in a cross-domain sentiment analysis during the period of 2010–2016 to assist researchers in developing new and more accurate techniques in the future.
References
More filters
Journal ArticleDOI

Random Forests

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Journal ArticleDOI

The WEKA data mining software: an update

TL;DR: This paper provides an introduction to the WEKA workbench, reviews the history of the project, and, in light of the recent 3.6 stable release, briefly discusses what has been added since the last stable version (Weka 3.4) released in 2003.

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

TL;DR: In this paper, the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, was considered and three machine learning methods (Naive Bayes, maximum entropy classiflcation, and support vector machines) were employed.
Proceedings ArticleDOI

Thumbs up? Sentiment Classification using Machine Learning Techniques

TL;DR: This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging.
Related Papers (5)