Hate speech detection: Challenges and solutions.

doi:10.1371/JOURNAL.PONE.0221152

Open AccessJournal ArticleDOI

Hate speech detection: Challenges and solutions.

Sean MacAvaney, +5 more

- 20 Aug 2019 -

PLOS ONE

- Vol. 14, Iss: 8

Chats0

TLDR

This work identifies and examines challenges faced by online automatic approaches for hate speech detection in text, and proposes a multi-view SVM approach that achieves near state-of-the-art performance, while being simpler and producing more easily interpretable decisions than neural methods.

Abstract:

As online content continues to grow, so does the spread of hate speech. We identify and examine challenges faced by online automatic approaches for hate speech detection in text. Among these difficulties are subtleties in language, differing definitions on what constitutes hate speech, and limitations of data availability for training and testing of these systems. Furthermore, many recent approaches suffer from an interpretability problem-that is, it can be difficult to understand why the systems make the decisions that they do. We propose a multi-view SVM approach that achieves near state-of-the-art performance, while being simpler and producing more easily interpretable decisions than neural methods. We also discuss both technical and practical challenges that remain for this task.

Citations

PDF

Open Access

More filters

Book ChapterDOI

What about Hate Speech

Kevin W. Saunders

TL;DR: In echten (analogen) Leben sind wir eher selten offenen Beleidigungen oder Hass ausgesetzt.

...read moreread less

Journal ArticleDOI

Developing an online hate classifier for multiple social media platforms

Joni Salminen, +6 more

- 01 Jan 2020 -

Human-centric Computing and Information ...

TL;DR: While all the models significantly outperform the keyword-based baseline classifier, XGBoost using all features performs the best and feature importance analysis indicates that BERT features are the most impactful for the predictions.

...read moreread less

Journal ArticleDOI

Detecting weak and strong Islamophobic hate speech on social media

Bertie Vidgen, +1 more

- 02 Jan 2020 -

Journal of Information Technology & Poli...

TL;DR: Islamophobic hate speech on social media is a growing concern in contemporary Western politics and society as discussed by the authors, and it can inflict considerable harm on any victims who are targeted, create a sense of fear and cause considerable harm.

...read moreread less

Journal ArticleDOI

A deep neural network based multi-task learning approach to hate speech detection

Prashant Kapil, +1 more

- 27 Dec 2020 -

Knowledge Based Systems

TL;DR: A deep multi-task learning (MTL) framework is proposed to leverage useful information from multiple related classification tasks in order to improve the performance of the individual task.

...read moreread less

Posted Content

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

Brendan F. Kennedy, +4 more

- 05 May 2020 -

arXiv: Computation and Language

TL;DR: This work extracts post-hoc explanations from fine-tuned BERT classifiers to detect bias towards identity terms and proposes a novel regularization technique based on these explanations that encourages models to learn from the context of group identifiers in addition to the identifiers themselves.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018 -

arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Collapse

ACM Computing Surveys

Hate speech detection: Challenges and solutions.

Citations

What about Hate Speech

Developing an online hate classifier for multiple social media platforms

Detecting weak and strong Islamophobic hate speech on social media

A deep neural network based multi-task learning approach to hate speech detection

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

References

Scikit-learn: Machine Learning in Python

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Scikit-learn: Machine Learning in Python

Distributed Representations of Words and Phrases and their Compositionality

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

A Survey on Hate Speech Detection using Natural Language Processing

Automated Hate Speech Detection and the Problem of Offensive Language

Deep Learning for Hate Speech Detection in Tweets

A Survey on Automatic Detection of Hate Speech in Text