Home
/
Topics
/
Word embedding

Topic

Word embedding

About: Word embedding is a research topic. Over the lifetime, 4683 publications have been published within this topic receiving 153378 citations. The topic is also known as: word embeddings.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2005
2003

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Morphological Word Embedding for Arabic

[...]

Rana Aref Salama¹, Abdou Youssef², Aly A. Fahmy¹•Institutions (2)

Cairo University¹, George Washington University²

01 Jan 2018-Procedia Computer Science

TL;DR: This work tunes the generated word vectors to their lemma forms using linear compositionality to generate lemma-based embedding and shows improvements over existing state-of-the-art methods for Arabic word embedding.

...read moreread less

25 citations

Journal Article•DOI•

Measuring Sentences Similarity: A Survey

[...]

Mamdouh Farouk

01 Jul 2019-Indian journal of science and technology

TL;DR: In this paper, a survey classifies approaches of calculating sentences similarity based on the adopted methodology into three categories: word-to-word based, structure-based, and vector-based.

...read moreread less

Abstract: Objective/Methods: This study is to review the approaches used for measuring sentences similarity. Measuring similarity between natural language sentences is a crucial task for many Natural Language Processing applications such as text classification, information retrieval, question answering, and plagiarism detection. This survey classifies approaches of calculating sentences similarity based on the adopted methodology into three categories. Word-to-word based, structurebased, and vector-based are the most widely used approaches to find sentences similarity. Findings/Application: Each approach measures relatedness between short texts based on a specific perspective. In addition, datasets that are mostly used as benchmarks for evaluating techniques in this field are introduced to provide a complete view on this issue. The approaches that combine more than one perspective give better results. Moreover, structure based similarity that measures similarity between sentences’ structures needs more investigation. Keywords: Sentence Representation, Sentences Similarity, Structural Similarity, Word Embedding, Words Similarity

...read moreread less

25 citations

Journal Article•DOI•

A novel negative sampling based on TFIDF for learning word representation

[...]

Pengda Qin¹, Weiran Xu¹, Jun Guo¹•Institutions (1)

Beijing University of Posts and Telecommunications¹

12 Feb 2016-Neurocomputing

TL;DR: A novel NEG strategy that samples negatives based on the notion of Term Frequency-Inverse Document Frequency (NEG-TFIDF), which outperforms Mikolov's NEG on both word analogy and word similarity test tasks, particularly in terms of the performance of medium-frequency words.

...read moreread less

25 citations

Proceedings Article•DOI•

Retrofitting Contextualized Word Embeddings with Paraphrases

[...]

Weijia Shi¹, Muhao Chen¹, Pei Zhou², Kai-Wei Chang¹•Institutions (2)

University of California, Los Angeles¹, University of Southern California²

12 Sep 2019

TL;DR: This work proposes a post-processing approach to retrofit the contextualized word embedding with paraphrases, which seeks to minimize the variance of word representations on paraphrased contexts and significantly improves ELMo on various sentence classification and inference tasks.

...read moreread less

Abstract: Contextualized word embeddings, such as ELMo, provide meaningful representations for words and their contexts. They have been shown to have a great impact on downstream applications. However, we observe that the contextualized embeddings of a word might change drastically when its contexts are paraphrased. As these embeddings are over-sensitive to the context, the downstream model may make different predictions when the input sentence is paraphrased. To address this issue, we propose a post-processing approach to retrofit the embedding with paraphrases. Our method learns an orthogonal transformation on the input space of the contextualized word embedding model, which seeks to minimize the variance of word representations on paraphrased contexts. Experiments show that the proposed method significantly improves ELMo on various sentence classification and inference tasks.

...read moreread less

25 citations

Proceedings Article•DOI•

Automatic keyphrase extraction using graph-based methods

[...]

Josiane Mothe, Faneva Ramiandrisoa, Michael Rasolomanana

09 Apr 2018

TL;DR: This paper analyses various unsupervised automatic keyphrase extraction methods based on graphs as well as the impact of word embedding to show that there are no differences when using word embeding and when not using it.

...read moreread less

Abstract: This paper analyses various unsupervised automatic keyphrase extraction methods based on graphs as well as the impact of word embedding. Evaluation is made on three datasets. We show that there is no differences when using word embedding and when not using it.

...read moreread less

25 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
…
136
137
138
139
140
141
142
…
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

5,718

Papers

201,647

Citations

No. of papers in the topic in previous years
Year	Papers
2023	317
2022	716
2021	736
2020	1,025
2019	1,078
2018	788

Word embedding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics