Home
/
Topics
/
Word embedding

Topic

Word embedding

About: Word embedding is a research topic. Over the lifetime, 4683 publications have been published within this topic receiving 153378 citations. The topic is also known as: word embeddings.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2005
2003

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A Transparent Framework for Evaluating Unintended Demographic Bias in Word Embeddings.

[...]

Chris Sweeney¹, Maryam Najafian²•Institutions (2)

Massachusetts Institute of Technology¹, University of Texas at Dallas²

01 Jul 2019

TL;DR: This work presents a transparent framework and metric for evaluating discrimination across protected groups with respect to their word embedding bias via the relative negative sentiment associated with demographic identity terms from various protected groups and shows that it enable useful analysis into the bias in word embeddings.

...read moreread less

Abstract: Word embedding models have gained a lot of traction in the Natural Language Processing community, however, they suffer from unintended demographic biases. Most approaches to evaluate these biases rely on vector space based metrics like the Word Embedding Association Test (WEAT). While these approaches offer great geometric insights into unintended biases in the embedding vector space, they fail to offer an interpretable meaning for how the embeddings could cause discrimination in downstream NLP applications. In this work, we present a transparent framework and metric for evaluating discrimination across protected groups with respect to their word embedding bias. Our metric (Relative Negative Sentiment Bias, RNSB) measures fairness in word embeddings via the relative negative sentiment associated with demographic identity terms from various protected groups. We show that our framework and metric enable useful analysis into the bias in word embeddings.

...read moreread less

53 citations

Proceedings Article•DOI•

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

[...]

Shuming Ma¹, Xu Sun¹, Wei Li, Sujian Li¹, Wenjie Li², Xuancheng Ren¹ - Show less +2 more•Institutions (2)

Peking University¹, Hong Kong Polytechnic University²

01 Mar 2018

TL;DR: This article proposed a word embedding attention network (WEAN) to generate the words by querying distributed word representations (i.e. neural word embeddings), hoping to capture the meaning of the according words.

...read moreread less

Abstract: Most recent approaches use the sequence-to-sequence model for paraphrase generation. The existing sequence-to-sequence model tends to memorize the words and the patterns in the training dataset instead of learning the meaning of the words. Therefore, the generated sentences are often grammatically correct but semantically improper. In this work, we introduce a novel model based on the encoder-decoder framework, called Word Embedding Attention Network (WEAN). Our proposed model generates the words by querying distributed word representations (i.e. neural word embeddings), hoping to capturing the meaning of the according words. Following previous work, we evaluate our model on two paraphrase-oriented tasks, namely text simplification and short text abstractive summarization. Experimental results show that our model outperforms the sequence-to-sequence baseline by the BLEU score of 6.3 and 5.5 on two English text simplification datasets, and the ROUGE-2 F1 score of 5.7 on a Chinese summarization dataset. Moreover, our model achieves state-of-the-art performances on these three benchmark datasets.

...read moreread less

53 citations

Journal Article•DOI•

An artificial intelligence approach to COVID-19 infection risk assessment in virtual visits: A case report.

[...]

Jihad S. Obeid¹, Matthew Davis¹, Matthew Turner¹, Stéphane M. Meystre¹, Paul M. Heider¹, Edward C. O'Bryan¹, Leslie A. Lenert¹ - Show less +3 more•Institutions (1)

Medical University of South Carolina¹

01 Aug 2020-Journal of the American Medical Informatics Association

TL;DR: Informatics tools such as natural language processing and artificial intelligence methods can have significant clinical impacts when applied to data streams early in the development of clinical systems for outbreak response.

...read moreread less

52 citations

Posted Content•

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

[...]

Xiang Bai¹, Mingkun Yang¹, Pengyuan Lyu¹, Yongchao Xu¹, Jiebo Luo² - Show less +1 more•Institutions (2)

Huazhong University of Science and Technology¹, University of Rochester²

15 Apr 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: The main idea is combining word representations and deep visual features in a globally trainable deep convolutional neural network for fine-grained image classification, which significantly outperforms classification with only visual representation.

...read moreread less

Abstract: Text in natural images contains rich semantics that are often highly relevant to objects or scene. In this paper, we focus on the problem of fully exploiting scene text for visual understanding. The main idea is combining word representations and deep visual features into a globally trainable deep convolutional neural network. First, the recognized words are obtained by a scene text reading system. Then, we combine the word embedding of the recognized words and the deep visual features into a single representation, which is optimized by a convolutional neural network for fine-grained image classification. In our framework, the attention mechanism is adopted to reveal the relevance between each recognized word and the given image, which further enhances the recognition performance. We have performed experiments on two datasets: Con-Text dataset and Drink Bottle dataset, that are proposed for fine-grained classification of business places and drink bottles, respectively. The experimental results consistently demonstrate that the proposed method combining textual and visual cues significantly outperforms classification with only visual representations. Moreover, we have shown that the learned representation improves the retrieval performance on the drink bottle images by a large margin, making it potentially useful in product search.

...read moreread less

52 citations

Journal Article•DOI•

Thai sentiment analysis with deep learning techniques: A comparative study based on word embedding, POS-tag, and sentic features

[...]

Kitsuchart Pasupa¹, Thititorn Seneewong Na Ayutthaya¹•Institutions (1)

King Mongkut's Institute of Technology Ladkrabang¹

01 Oct 2019-Sustainable Cities and Society

TL;DR: This experiment evaluated and compared the performances of several conventional deep learning models: Convolutional Neural Network, Long Short-Term Memory (LSTM), and Bidirectional LSTM, in sentiment analysis of Thai children tales, and showed that the CNN model that used all three features gave the best result.

...read moreread less

52 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
…
64
65
66
67
68
69
70
…
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

5,718

Papers

201,647

Citations

No. of papers in the topic in previous years
Year	Papers
2023	317
2022	716
2021	736
2020	1,025
2019	1,078
2018	788

Word embedding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics