Benchmarking Aggression Identification in Social Media.

The Shared Task on Aggression Identification organised as part of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC - 1) at COLING 2018 was to develop a classifier that could discriminate between Overtly Aggression, Covertly Aggressive, and Non-aggressive texts.

Abstract:

In this paper, we present the report and findings of the Shared Task on Aggression Identification organised as part of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC - 1) at COLING 2018. The task was to develop a classifier that could discriminate between Overtly Aggressive, Covertly Aggressive, and Non-aggressive texts. For this task, the participants were provided with a dataset of 15,000 aggression-annotated Facebook Posts and Comments each in Hindi (in both Roman and Devanagari script) and English for training and validation. For testing, two different sets - one from Facebook and another from a different social media - were provided. A total of 130 teams registered to participate in the task, 30 teams submitted their test runs, and finally 20 teams also sent their system description paper which are included in the TRAC workshop proceedings. The best system obtained a weighted F-score of 0.64 for both Hindi and English on the Facebook test sets, while the best scores on the surprise set were 0.60 and 0.50 for English and Hindi respectively. The results presented in this report depict how challenging the task is. The positive response from the community and the great levels of participation in the first edition of this shared task also highlights the interest in this topic.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Predicting the Type and Target of Offensive Posts in Social Media

Marcos Zampieri,Shervin Malmasi,Preslav Nakov,Sara Rosenthal,Noura Farra,Ritesh Kumar +5 moreUniversity of Wolverhampton,Brigham and Women's Hospital,Sofia University,Columbia University,Indian Institutes of Technology

Show Less

TL;DR: The Offensive Language Identification Dataset (OLID), a new dataset with tweets annotated for offensive content using a fine-grained three-layer annotation scheme, is complied and made publicly available.

...read moreread less

Proceedings ArticleDOI

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval).

Marcos Zampieri,Shervin Malmasi,Preslav Nakov,Sara Rosenthal,Noura Farra,Ritesh Kumar +5 moreUniversity of Wolverhampton,Brigham and Women's Hospital,Qatar Computing Research Institute,Columbia University,Indian Institutes of Technology

Show Less

TL;DR: The SemEval-2019 Task 6 on Identifying and categorizing Offensive Language in Social Media (OffensEval) as mentioned in this paper was based on a new dataset, the Offensive Language Identification Dataset (OLID), which contains over 14,000 English tweets, and featured three sub-tasks.

...read moreread less

Journal ArticleDOI

Hate speech detection: Challenges and solutions.

Sean MacAvaney,Hao-Ren Yao,Eugene Yang,Katina Russell,Nazli Goharian,Ophir Frieder +5 moreGeorgetown University

- 20 Aug 2019 -

PLOS ONE

Show Less

TL;DR: This work identifies and examines challenges faced by online automatic approaches for hate speech detection in text, and proposes a multi-view SVM approach that achieves near state-of-the-art performance, while being simpler and producing more easily interpretable decisions than neural methods.

...read moreread less

Proceedings ArticleDOI

SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

Marcos Zampieri,Preslav Nakov,Sara Rosenthal,Pepa Atanasova,Georgi Karadzhov,Hamdy Mubarak,Leon Derczynski,Zeses Pitenis,Çağrı Çöltekin +8 moreRochester Institute of Technology,Qatar Computing Research Institute,IBM,University of Copenhagen,Massachusetts Institute of Technology,IT University of Copenhagen,University of Wolverhampton,University of Tübingen

Show Less

TL;DR: The SemEval-2020 Task 12 on Multilingual Offensive Language Identification in Social Media (OffensEval 2020) as mentioned in this paper included three subtasks corresponding to the hierarchical taxonomy of the OLID schema, and was offered in five languages: Arabic, Danish, English, Greek, and Turkish.

...read moreread less

Journal ArticleDOI

Resources and benchmark corpora for hate speech detection: a systematic review

Fabio Poletto,Valerio Basile,Manuela Sanguinetti,Cristina Bosco,Viviana Patti +4 moreUniversity of Turin

Show Less

TL;DR: This review systematically analyze the resources made available by the community at large, including their development methodology, topical focus, language coverage, and other factors, to highlight a heterogeneous, growing landscape.

...read moreread less

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Bullying, cyberbullying, and suicide

Sameer Hinduja,Justin W. Patchin +1 moreFlorida Atlantic University,University of Wisconsin–Eau Claire

- 22 Jul 2010 -

Archives of Suicide Research

Show Less

TL;DR: Examining the extent to which a nontraditional form of peer aggression—cyberbullying—is also related to suicidal ideation among adolescents suggests that a suicide prevention and intervention component is essential within comprehensive bullying response programs implemented in schools.

...read moreread less

Proceedings Article

Automated Hate Speech Detection and the Problem of Offensive Language

Thomas Davidson,Dana Warmsley,Michael W. Macy,Ingmar Weber +3 moreCornell University,Khalifa University

Show Less

TL;DR: This work used a crowd-sourced hate speech lexicon to collect tweets containing hate speech keywords and labels a sample of these tweets into three categories: those containinghate speech, only offensive language, and those with neither.

...read moreread less

Proceedings ArticleDOI

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

Zeerak Waseem,Dirk Hovy +1 moreUniversity of Copenhagen

Show Less

TL;DR: A list of criteria founded in critical race theory is provided, and these are used to annotate a publicly available corpus of more than 16k tweets and present a dictionary based the most indicative words in the data.

...read moreread less

Proceedings ArticleDOI

A Survey on Hate Speech Detection using Natural Language Processing

Anna Schmidt,Michael Wiegand +1 more

Show Less

TL;DR: A survey on hate speech detection describes key areas that have been explored to automatically recognize these types of utterances using natural language processing and discusses limits of those approaches.

...read moreread less

Proceedings ArticleDOI

Abusive Language Detection in Online User Content

Chikashi Nobata,Joel Tetreault,Achint Oommen Thomas,Yashar Mehdad,Yi Chang +4 moreYahoo!

Show Less

TL;DR: A machine learning based method to detect hate speech on online user comments from two domains which outperforms a state-of-the-art deep learning approach and a corpus of user comments annotated for abusive language, the first of its kind.

...read moreread less

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

SciSpace

About Careers Resources Support Browse Papers Pricing SciSpace Affiliate Program Cancellation & Refund Policy Terms Privacy

Tools

Citation generator AI Detector Paraphraser Citation Booster

Extensions

SciSpace

Directories

Papers Topics Journals Authors Conferences Institutions Questions Citation Styles

Contact

support@typeset.io +91 8431021544

Benchmarking Aggression Identification in Social Media.

Citations

Predicting the Type and Target of Offensive Posts in Social Media

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval).

Hate speech detection: Challenges and solutions.

SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

Resources and benchmark corpora for hate speech detection: a systematic review

References

Bullying, cyberbullying, and suicide

Automated Hate Speech Detection and the Problem of Offensive Language

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

A Survey on Hate Speech Detection using Natural Language Processing

Abusive Language Detection in Online User Content

Related Papers (5)

Automated Hate Speech Detection and the Problem of Offensive Language

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

A Survey on Hate Speech Detection using Natural Language Processing

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

Abusive Language Detection in Online User Content