Home
/
Authors
/
Jamin Shin

Author

Jamin Shin

Hong Kong University of Science and Technology

Bio: Jamin Shin is an academic researcher from Hong Kong University of Science and Technology. The author has contributed to research in topics: Computer science & Named-entity recognition. The author has an hindex of 10, co-authored 21 publications receiving 528 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Reducing Gender Bias in Abusive Language Detection

[...]

Ji Ho Park, Jamin Shin¹, Pascale Fung¹•Institutions (1)

Hong Kong University of Science and Technology¹

22 Aug 2018

TL;DR: This paper used debiased word embeddings, gender swap data augmentation, and fine-tuning with a larger corpus to correct model bias in abusive language detection models, which can effectively reduce model bias by 90-98%.

...read moreread less

Abstract: Abusive language detection models tend to have a problem of being biased toward identity words of a certain group of people because of imbalanced training datasets. For example, “You are a good woman” was considered “sexist” when trained on an existing dataset. Such model bias is an obstacle for models to be robust enough for practical use. In this work, we measure them on models trained with different datasets, while analyzing the effect of different pre-trained word embeddings and model architectures. We also experiment with three mitigation methods: (1) debiased word embeddings, (2) gender swap data augmentation, and (3) fine-tuning with a larger corpus. These methods can effectively reduce model bias by 90-98% and can be extended to correct model bias in other scenarios.

...read moreread less

214 citations

Proceedings Article•DOI•

MoEL: Mixture of Empathetic Listeners.

[...]

Zhaojiang Lin¹, Andrea Madotto¹, Jamin Shin¹, Peng Xu¹, Pascale Fung¹ - Show less +1 more•Institutions (1)

Hong Kong University of Science and Technology¹

01 Nov 2019

TL;DR: A novel end-to-end approach for modeling empathy in dialogue systems: Mixture of Empathetic Listeners (MoEL), which outperforms multitask training baseline in terms of empathy, relevance, and fluency.

...read moreread less

Abstract: Previous research on empathetic dialogue systems has mostly focused on generating responses given certain emotions. However, being empathetic not only requires the ability of generating emotional responses, but more importantly, requires the understanding of user emotions and replying appropriately. In this paper, we propose a novel end-to-end approach for modeling empathy in dialogue systems: Mixture of Empathetic Listeners (MoEL). Our model first captures the user emotions and outputs an emotion distribution. Based on this, MoEL will softly combine the output states of the appropriate Listener(s), which are each optimized to react to certain emotions, and generate an empathetic response. Human evaluations on EMPATHETIC-DIALOGUES dataset confirm that MoEL outperforms multitask training baseline in terms of empathy, relevance, and fluency. Furthermore, the case study on generated responses of different Listeners shows high interpretability of our model.

...read moreread less

150 citations

Journal Article•DOI•

Caire: An end-to-end empathetic chatbot

[...]

Zhaojiang Lin¹, Peng Xu¹, Genta Indra Winata¹, Farhad Bin Siddique¹, Zihan Liu¹, Jamin Shin¹, Pascale Fung¹ - Show less +3 more•Institutions (1)

Hong Kong University of Science and Technology¹

03 Apr 2020

TL;DR: CAiRE as discussed by the authors is an end-to-end generative empathetic chatbot designed to recognize user emotions and respond in an empathic manner via transfer learning, which is built primarily to focus on empathy integration in fully data-driven generative dialogue systems.

...read moreread less

Abstract: We present CAiRE, an end-to-end generative empathetic chatbot designed to recognize user emotions and respond in an empathetic manner. Our system adapts the Generative Pre-trained Transformer (GPT) to empathetic response generation task via transfer learning. CAiRE is built primarily to focus on empathy integration in fully data-driven generative dialogue systems. We create a web-based user interface which allows multiple users to asynchronously chat with CAiRE. CAiRE also collects user feedback and continues to improve its response quality by discarding undesirable generations via active learning and negative training.

...read moreread less

90 citations

Proceedings Article•DOI•

Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables

[...]

Zihan Liu¹, Jamin Shin¹, Yan Xu¹, Genta Indra Winata¹, Peng Xu¹, Andrea Madotto¹, Pascale Fung¹ - Show less +3 more•Institutions (1)

Hong Kong University of Science and Technology¹

01 Nov 2019

TL;DR: The authors proposed a zero-shot adaptation of task-oriented dialogue system to low-resource languages by using a set of very few parallel word pairs to refine the aligned cross-lingual word-level representations and employed a latent variable model to cope with the variance of similar sentences across different languages.

...read moreread less

Abstract: Despite the surging demands for multilingual task-oriented dialog systems (e.g., Alexa, Google Home), there has been less research done in multilingual or cross-lingual scenarios. Hence, we propose a zero-shot adaptation of task-oriented dialogue system to low-resource languages. To tackle this challenge, we first use a set of very few parallel word pairs to refine the aligned cross-lingual word-level representations. We then employ a latent variable model to cope with the variance of similar sentences across different languages, which is induced by imperfect cross-lingual alignments and inherent differences in languages. Finally, the experimental results show that even though we utilize much less external resources, our model achieves better adaptation performance for natural language understanding task (i.e., the intent detection and slot filling) compared to the current state-of-the-art model in the zero-shot scenario.

...read moreread less

61 citations

Posted Content•

HappyBot: Generating Empathetic Dialogue Responses by Improving User Experience Look-ahead

[...]

Jamin Shin, Peng Xu, Andrea Madotto, Pascale Fung

20 Jun 2019-arXiv: Computation and Language

TL;DR: A Sentiment Predictor is trained to estimate the user sentiment look-ahead towards the generated system responses, which is then used as the reward function for generating more empathetic responses.

...read moreread less

Abstract: Recent neural conversation models that attempted to incorporate emotion and generate empathetic responses either focused on conditioning the output to a given emotion, or incorporating the current user emotional state. While these approaches have been successful to some extent in generating more diverse and seemingly engaging utterances, they do not factor in how the user would feel towards the generated dialogue response. Hence, in this paper, we advocate such look-ahead of user emotion as the key to modeling and generating empathetic dialogue responses. We thus train a Sentiment Predictor to estimate the user sentiment look-ahead towards the generated system responses, which is then used as the reward function for generating more empathetic responses. Human evaluation results show that our model outperforms other baselines in empathy, relevance, and fluency.

...read moreread less

47 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

The Risk of Racial Bias in Hate Speech Detection.

[...]

Maarten Sap¹, Dallas Card¹, Saadia Gabriel¹, Yejin Choi², Noah A. Smith¹ - Show less +1 more•Institutions (2)

University of Washington¹, Microsoft²

01 Jul 2019

TL;DR: This work proposes *dialect* and *race priming* as ways to reduce the racial bias in annotation, showing that when annotators are made explicitly aware of an AAE tweet’s dialect they are significantly less likely to label the tweet as offensive.

...read moreread less

Abstract: We investigate how annotators’ insensitivity to differences in dialect can lead to racial bias in automatic hate speech detection models, potentially amplifying harm against minority populations. We first uncover unexpected correlations between surface markers of African American English (AAE) and ratings of toxicity in several widely-used hate speech datasets. Then, we show that models trained on these corpora acquire and propagate these biases, such that AAE tweets and tweets by self-identified African Americans are up to two times more likely to be labelled as offensive compared to others. Finally, we propose *dialect* and *race priming* as ways to reduce the racial bias in annotation, showing that when annotators are made explicitly aware of an AAE tweet’s dialect they are significantly less likely to label the tweet as offensive.

...read moreread less

611 citations

Posted Content•

WILDS: A Benchmark of in-the-Wild Distribution Shifts

[...]

Pang Wei Koh¹, Shiori Sagawa¹, Henrik Marklund¹, Sang Michael Xie², Marvin Zhang¹, Akshay Balsubramani¹, Weihua Hu¹, Michihiro Yasunaga³, Richard Lanas Phillips¹, Irena Gao¹, Tony Lee¹, Etienne David⁴, Ian Stavness⁵, Wei Guo⁵, Berton A. Earnshaw, Imran S. Haque⁶, Sara Beery¹, Jure Leskovec¹, Anshul Kundaje⁷, Emma Pierson², Sergey Levine¹, Chelsea Finn¹, Percy Liang¹ - Show less +19 more•Institutions (7)

Stanford University¹, University of California, Berkeley², Cornell University³, University of Saskatchewan⁴, University of Tokyo⁵, California Institute of Technology⁶, Microsoft⁷

14 Dec 2020-arXiv: Learning

TL;DR: WILDS is presented, a benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications, and is hoped to encourage the development of general-purpose methods that are anchored to real-world distribution shifts and that work well across different applications and problem settings.

...read moreread less

Abstract: Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity, these real-world distribution shifts are under-represented in the datasets widely used in the ML community today. To address this gap, we present WILDS, a curated collection of 8 benchmark datasets that reflect a diverse range of distribution shifts which naturally arise in real-world applications, such as shifts across hospitals for tumor identification; across camera traps for wildlife monitoring; and across time and location in satellite imaging and poverty mapping. On each dataset, we show that standard training results in substantially lower out-of-distribution than in-distribution performance, and that this gap remains even with models trained by existing methods for handling distribution shifts. This underscores the need for new training methods that produce models which are more robust to the types of distribution shifts that arise in practice. To facilitate method development, we provide an open-source package that automates dataset loading, contains default model architectures and hyperparameters, and standardizes evaluations. Code and leaderboards are available at this https URL.

...read moreread less

579 citations

Posted Content•

Language (Technology) is Power: A Critical Survey of "Bias" in NLP

[...]

Su Lin Blodgett¹, Solon Barocas², Hal Daumé³, Hanna Wallach²•Institutions (3)

University of Massachusetts Amherst¹, Microsoft², University of Maryland, College Park³

28 May 2020-arXiv: Computation and Language

TL;DR: The authors survey 146 papers analyzing "bias" in NLP systems, finding that their motivations are often vague, inconsistent, and lacking in normative reasoning, despite the fact that analyzing bias is an inherently normative process.

...read moreread less

Abstract: We survey 146 papers analyzing "bias" in NLP systems, finding that their motivations are often vague, inconsistent, and lacking in normative reasoning, despite the fact that analyzing "bias" is an inherently normative process. We further find that these papers' proposed quantitative techniques for measuring or mitigating "bias" are poorly matched to their motivations and do not engage with the relevant literature outside of NLP. Based on these findings, we describe the beginnings of a path forward by proposing three recommendations that should guide work analyzing "bias" in NLP systems. These recommendations rest on a greater recognition of the relationships between language and social hierarchies, encouraging researchers and practitioners to articulate their conceptualizations of "bias"---i.e., what kinds of system behaviors are harmful, in what ways, to whom, and why, as well as the normative reasoning underlying these statements---and to center work around the lived experiences of members of communities affected by NLP systems, while interrogating and reimagining the power relations between technologists and such communities.

...read moreread less

465 citations

Proceedings Article•DOI•

Racial Bias in Hate Speech and Abusive Language Detection Datasets

[...]

Thomas Davidson, Debasmita Bhattacharya, Ingmar Weber

01 May 2019

TL;DR: This article examined racial bias in five different sets of Twitter data annotated for hate speech and abusive language and found that abusive language detection systems may discriminate against the groups who are often the targets of the abuse we are trying to detect.

...read moreread less

Abstract: Technologies for abusive language detection are being developed and applied with little consideration of their potential biases. We examine racial bias in five different sets of Twitter data annotated for hate speech and abusive language. We train classifiers on these datasets and compare the predictions of these classifiers on tweets written in African-American English with those written in Standard American English. The results show evidence of systematic racial bias in all datasets, as classifiers trained on them tend to predict that tweets written in African-American English are abusive at substantially higher rates. If these abusive language detection systems are used in the field they will therefore have a disproportionate negative impact on African-American social media users. Consequently, these systems may discriminate against the groups who are often the targets of the abuse we are trying to detect.

...read moreread less

328 citations

Proceedings Article•DOI•

Mitigating Gender Bias in Natural Language Processing: Literature Review

[...]

Tony Sun¹, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief¹, Jieyu Zhao², Diba Mirza¹, Elizabeth Belding¹, Kai-Wei Chang², William Yang Wang¹ - Show less +6 more•Institutions (2)

University of California, Santa Barbara¹, University of California, Los Angeles²

01 Jul 2019

TL;DR: This paper discusses gender bias based on four forms of representation bias and analyzes methods recognizing gender bias in NLP, and discusses the advantages and drawbacks of existing gender debiasing methods.

...read moreread less

Abstract: As Natural Language Processing (NLP) and Machine Learning (ML) tools rise in popularity, it becomes increasingly vital to recognize the role they play in shaping societal biases and stereotypes. Although NLP models have shown success in modeling various applications, they propagate and may even amplify gender bias found in text corpora. While the study of bias in artificial intelligence is not new, methods to mitigate gender bias in NLP are relatively nascent. In this paper, we review contemporary studies on recognizing and mitigating gender bias in NLP. We discuss gender bias based on four forms of representation bias and analyze methods recognizing gender bias. Furthermore, we discuss the advantages and drawbacks of existing gender debiasing methods. Finally, we discuss future studies for recognizing and mitigating gender bias in NLP.

...read moreread less

327 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128

Collapse