Hope Speech Detection for Dravidian Languages Using Cross-Lingual Embeddings with Stacked Encoder Architecture.

doi:10.1007/S42979-021-00943-8

Open AccessDOI

Hope Speech Detection for Dravidian Languages Using Cross-Lingual Embeddings with Stacked Encoder Architecture.

- Vol. 3, Iss: 1, pp 67

TLDR

In this paper, a multilingual model, with main emphasis on Dravidian languages, was proposed to automatically detect hope speech, which achieved an F1-score of 0.61 and 0.85 for Tamil and Malayalam, respectively.

Abstract:

The task of hope speech detection has gained traction in the natural language processing field owing to the need for an increase in positive reinforcement online during the COVID-19 pandemic. Hope speech detection focuses on identifying texts among social media comments that could invoke positive emotions in people. Students and working adults alike posit that they experience a lot of work-induced stress further proving that there exists a need for external inspiration which in this current scenario, is mostly found online. In this paper, we propose a multilingual model, with main emphasis on Dravidian languages, to automatically detect hope speech. We have employed a stacked encoder architecture which makes use of language agnostic cross-lingual word embeddings as the dataset consists of code-mixed YouTube comments. Additionally, we have carried out an empirical analysis and tested our architecture against various traditional, transformer, and transfer learning methods. Furthermore a k-fold paired t test was conducted which corroborates that our model outperforms the other approaches. Our methodology achieved an F1-score of 0.61 and 0.85 for Tamil and Malayalam, respectively. Our methodology is quite competitive to the state-of-the-art methods. The code for our work can be found in our GitHub repository (https://github.com/arunimasundar/Hope-Speech-LT-EDI).

Hope Speech Detection for Dravidian Languages Using Cross-Lingual Embeddings with Stacked Encoder Architecture.

Citations

Language-agnostic deep learning framework for automatic monitoring of population-level mental health from social networks

Hope speech detection in Spanish

On finetuning Adapter-based Transformer models for classifying Abusive Social Media Tamil Comments

Development of Multi-lingual Models for Detecting Hope Speech Texts from Social Media Comments

References

Scikit-learn: Machine Learning in Python

SMOTE: synthetic minority over-sampling technique

SMOTE: Synthetic Minority Over-sampling Technique

Attention Is All You Need

Bidirectional LSTM-CRF Models for Sequence Tagging