Detecting Offensive Language in Social Media to Protect Adolescent Online Safety

This work proposes the Lexical Syntactic Feature (LSF) architecture to detect offensive content and identify potential offensive users in social media, and incorporates a user's writing style, structure and specific cyber bullying content as features to predict the user's potentiality to send out offensive content.

Abstract:

Since the textual contents on online social media are highly unstructured, informal, and often misspelled, existing research on message-level offensive language detection cannot accurately detect offensive content. Meanwhile, user-level offensiveness detection seems a more feasible approach but it is an under researched area. To bridge this gap, we propose the Lexical Syntactic Feature (LSF) architecture to detect offensive content and identify potential offensive users in social media. We distinguish the contribution of pejoratives/profanities and obscenities in determining offensive content, and introduce hand-authoring syntactic rules in identifying name-calling harassments. In particular, we incorporate a user's writing style, structure and specific cyber bullying content as features to predict the user's potentiality to send out offensive content. Results from experiments showed that our LSF framework performed significantly better than existing methods in offensive content detection. It achieves precision of 98.24% and recall of 94.34% in sentence offensive detection, as well as precision of 77.9% and recall of 77.8% in user offensive detection. Meanwhile, the processing speed of LSF is approximately 10msec per sentence, suggesting the potential for effective deployment in social media.

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

Zeerak Waseem,Dirk Hovy +1 moreUniversity of Copenhagen

Show Less

TL;DR: A list of criteria founded in critical race theory is provided, and these are used to annotate a publicly available corpus of more than 16k tweets and present a dictionary based the most indicative words in the data.

...read moreread less

Proceedings ArticleDOI

A Survey on Hate Speech Detection using Natural Language Processing

Anna Schmidt,Michael Wiegand +1 more

Show Less

TL;DR: A survey on hate speech detection describes key areas that have been explored to automatically recognize these types of utterances using natural language processing and discusses limits of those approaches.

...read moreread less

Proceedings ArticleDOI

Abusive Language Detection in Online User Content

Chikashi Nobata,Joel Tetreault,Achint Oommen Thomas,Yashar Mehdad,Yi Chang +4 moreYahoo!

Show Less

TL;DR: A machine learning based method to detect hate speech on online user comments from two domains which outperforms a state-of-the-art deep learning approach and a corpus of user comments annotated for abusive language, the first of its kind.

...read moreread less

Journal ArticleDOI

A Survey on Automatic Detection of Hate Speech in Text

Paula Fortuna,Sérgio Nunes +1 moreUniversity of Porto

- 31 Jul 2018 -

ACM Computing Surveys

Show Less

TL;DR: This survey organizes and describes the current state of the field, providing a structured overview of previous approaches, including core algorithms, methods, and main features used, and provides a unifying definition of hate speech.

...read moreread less

Journal ArticleDOI

Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making

Peter Burnap,Matthew Leighton Williams +1 more

- 01 Jun 2015 -

Policy & Internet

Show Less

TL;DR: It is demonstrated how the results of the classifier can be robustly utilized in a statistical model used to forecast the likely spread of cyber hate in a sample of Twitter data.

...read moreread less

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112

Collapse

References

PDF

Open Access

More filters

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

Bo Pang,Lillian Lee,Shivakumar Vaithyanathan +2 more

Show Less

TL;DR: In this paper, the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, was considered and three machine learning methods (Naive Bayes, maximum entropy classiflcation, and support vector machines) were employed.

...read moreread less

Proceedings ArticleDOI

Thumbs up? Sentiment Classification using Machine Learning Techniques

Bo Pang,Lillian Lee,Shivakumar Vaithyanathan +2 moreCornell University,IBM

Show Less

TL;DR: This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging.

...read moreread less

Posted Content

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Peter D. TurneyNational Research Council

- 11 Dec 2002 -

arXiv: Learning

Show Less

TL;DR: A simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (Thumbs down) if the average semantic orientation of its phrases is positive.

...read moreread less

Proceedings Article

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Peter,Turney +1 more

Show Less

TL;DR: This article proposed an unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended(thumbs down) based on the average semantic orientation of phrases in the review that contain adjectives or adverbs.

...read moreread less

Proceedings Article

Generating Typed Dependency Parses from Phrase Structure Parses

Marie-Catherine de Marneffe,Bill MacCartney,Christopher D. Manning +2 moreStanford University

Show Less

TL;DR: A system for extracting typed dependency parses of English sentences from phrase structure parses that captures inherent relations occurring in corpus texts that can be critical in real-world applications is described.

...read moreread less

1
2
3
4
…
5
6
7

Collapse

SciSpace

About Careers Resources Support Browse Papers Pricing SciSpace Affiliate Program Cancellation & Refund Policy Terms Privacy

Tools

Citation generator AI Detector Paraphraser Citation Booster

Extensions

SciSpace

Directories

Papers Topics Journals Authors Conferences Institutions Questions Citation Styles

Contact

support@typeset.io +91 8431021544

Detecting Offensive Language in Social Media to Protect Adolescent Online Safety

Citations

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

A Survey on Hate Speech Detection using Natural Language Processing

Abusive Language Detection in Online User Content

A Survey on Automatic Detection of Hate Speech in Text

Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making

References

Thumbs up? Sentiment Classiflcation using Machine Learning Techniques

Thumbs up? Sentiment Classification using Machine Learning Techniques

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

Generating Typed Dependency Parses from Phrase Structure Parses

Related Papers (5)

Abusive Language Detection in Online User Content

Hate Speech Detection with Comment Embeddings

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

Detecting Hate Speech on the World Wide Web

A Survey on Hate Speech Detection using Natural Language Processing