Surfacing contextual hate speech words within social media
Citations
46 citations
24 citations
22 citations
Cites background from "Surfacing contextual hate speech wo..."
..., codewords (Taylor et al., 2017), novel forms of offense (Jain et al....
[...]
...…approaches (Waseem and Hovy, 2016; Davidson et al., 2017) and thus are ineffective at detecting forms of veiled toxicity; e.g., codewords (Taylor et al., 2017), novel forms of offense (Jain et al., 2018), and subtle and often unintentional manifestations of social bias such as…...
[...]
2 citations
2 citations
References
24,012 citations
"Surfacing contextual hate speech wo..." refers methods in this paper
...Topical Context is the context used by word embedding approaches like word2vec [9], that utilize a bag-of-words in an effort to rank words by their domain similarity....
[...]
14,400 citations
"Surfacing contextual hate speech wo..." refers methods in this paper
...• Using graph expansion and PageRank scores to bootstrap our initial HS seed words....
[...]
...To further reduce the search space we use PageRank [13] to rank out-of-dictionary words in a graph where some of the vertices are known hate speech keywords....
[...]
...Concisely, this boosting is done to set known hate speech words as the important “pages" that pass on their weight during the PageRank computation....
[...]
...For the PageRank scores we set d = 0.85 as it is the standard rate of decay used for the algorithm....
[...]
...Using cosine similarity scores alone as the edge weight would not allow us to model the idea that hate speech words are the important “pages" in the graph, the key concept behind PageRank....
[...]
11,343 citations
7,537 citations
2,425 citations