The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Open AccessPosted Content

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Douwe Kiela, +6 more

- 10 May 2020 -

arXiv: Artificial Intelligence

Chats0

TLDR

The authors proposed a new challenge set for multimodal classification, focusing on detecting hate speech in multi-modal memes, where difficult examples are added to the dataset to make it hard to rely on unimodal signals.

Abstract:

This work proposes a new challenge set for multimodal classification, focusing on detecting hate speech in multimodal memes. It is constructed such that unimodal models struggle and only multimodal models can succeed: difficult examples ("benign confounders") are added to the dataset to make it hard to rely on unimodal signals. The task requires subtle reasoning, yet is straightforward to evaluate as a binary classification problem. We provide baseline performance numbers for unimodal models, as well as for multimodal models with various degrees of sophistication. We find that state-of-the-art methods perform poorly compared to humans (64.73% vs. 84.7% accuracy), illustrating the difficulty of the task and highlighting the challenge that this important problem poses to the community.

Citations

PDF

Open Access

More filters

Posted Content

Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge.

Riza Velioglu, +1 more

- 23 Dec 2020 -

arXiv: Artificial Intelligence

TL;DR: This article used VisualBERT to detect hate speech in multimodal memes and achieved an accuracy of 0.765 on the challenge test set and placed third out of 3,173 participants.

...read moreread less

Journal ArticleDOI

AOMD: An analogy-aware approach to offensive meme detection on social media

Lanyu Shang, +5 more

- 01 Sep 2021 -

Information Processing and Management

TL;DR: Zhang et al. as discussed by the authors developed a deep learning based Analogy-aware Offensive Meme Detection (AOMD) framework to learn the implicit analogy from the multi-modal contents of the meme and effectively detect offensive analogy memes.

...read moreread less

Journal ArticleDOI

Combating the hate speech in Thai textual memes

Lawankorn Mookdarsanit, +1 more

- 01 Mar 2021 -

Indonesian Journal of Electrical Enginee...

TL;DR: The Thai textual meme detection is introduced as a new research problem in Thai natural language processing (Thailand-NLP) that is the settlement of transmission linkage between scene text localization, Thai optical recognition (Thai-OCR) and language understanding.

...read moreread less

Posted Content

Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions

Liunian Harold Li, +5 more

- 24 Oct 2020 -

arXiv: Computation and Language

TL;DR: This paper proposed Mask-and-Predict (MOP) pre-training on text-only and image-only corpora and introduced the object tags detected by an object recognition model as anchor points to bridge two modalities.

...read moreread less

Posted Content

A Survey on Multimodal Disinformation Detection

Firoj Alam, +8 more

- 13 Mar 2021 -

arXiv: Multimedia

TL;DR: The state-of-the-art on multimodal disinformation detection covers various combinations of modalities: text, images, audio, video, network structure, and temporal information.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014 -

arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

Collapse

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, +9 more

- 26 Jul 2019 -

arXiv: Computation and Language

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

Citations

Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge.

AOMD: An analogy-aware approach to offensive meme detection on social media

Combating the hate speech in Thai textual memes

Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions

A Survey on Multimodal Disinformation Detection

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Microsoft COCO: Common Objects in Context

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Adam: A Method for Stochastic Optimization

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Deep Residual Learning for Image Recognition

Attention is All you Need

Faster R-CNN: towards real-time object detection with region proposal networks