Home
/
Topics
/
Word embedding

Topic

Word embedding

About: Word embedding is a research topic. Over the lifetime, 4683 publications have been published within this topic receiving 153378 citations. The topic is also known as: word embeddings.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2005
2003

Papers

PDF

Open Access

More filters

Proceedings Article•

Intrinsic and extrinsic evaluations of word embeddings

[...]

Michael Zhai¹, Johnny Tan¹, Jinho D. Choi¹•Institutions (1)

Emory University¹

12 Feb 2016

TL;DR: The semantic composition of word embeddings is analyzed by cross-referencing their clusters with the manual lexical database, WordNet, and it is shown that the word embedding clusters give high correlations to the synonym and hyponym sets in WordNet.

...read moreread less

Abstract: In this paper, we first analyze the semantic composition of word embeddings by cross-referencing their clusters with the manual lexical database, WordNet. We then evaluate a variety of word embedding approaches by comparing their contributions to two NLP tasks. Our experiments show that the word embedding clusters give high correlations to the synonym and hyponym sets in WordNet, and give 0.88% and 0.17% absolute improvements in accuracy to named entity recognition and part-of-speech tagging, respectively.

...read moreread less

22 citations

Posted Content•

Toward Word Embedding for Personalized Information Retrieval

[...]

Nawal Ould-Amer, Philippe Mulhem, Mathias Géry

22 Jun 2016-arXiv: Information Retrieval

TL;DR: In this article, the authors presented preliminary works on using Word Embedding (word2vec) for query expansion in the context of Personalized Information Retrieval (PIR).

...read moreread less

Abstract: This paper presents preliminary works on using Word Embedding (word2vec) for query expansion in the context of Personalized Information Retrieval. Traditionally, word embeddings are learned on a general corpus, like Wikipedia. In this work we try to personalize the word embeddings learning, by achieving the learning on the user's profile. The word embeddings are then in the same context than the user interests. Our proposal is evaluated on the CLEF Social Book Search 2016 collection. The results obtained show that some efforts should be made in the way to apply Word Embedding in the context of Personalized Information Retrieval.

...read moreread less

22 citations

Journal Article•DOI•

Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.

[...]

Yuqing Mao¹, Kin Wah Fung¹•Institutions (1)

National Institutes of Health¹

01 Oct 2020-Journal of the American Medical Informatics Association

TL;DR: Word and graph embedding techniques can be used to harness terms and relations in the UMLS to measure semantic relatedness between concepts and can be further enhanced by combining withgraph embedding.

...read moreread less

22 citations

Journal Article•DOI•

A Superior Arabic Text Categorization Deep Model (SATCDM)

[...]

Mohammad Alhawarat¹, Ahmad O. Aseeri¹•Institutions (1)

Salman bin Abdulaziz University¹

30 Jan 2020-IEEE Access

TL;DR: A Superior Arabic Text Categorization Deep Model (SATCDM), which achieves very high accuracy compared to current research in Arabic text categorization using 15 of freely available datasets, which is superior to similar studies on the Arabic document classification task.

...read moreread less

Abstract: Categorizing Arabic text documents is considered an important research topic in the field of Natural Language Processing (NLP) and Machine Learning (ML). The number of Arabic documents is tremendously increasing daily as new web pages, news articles, social media contents are added. Hence, classifying such documents in specific classes is of high importance to many people and applications. Convolutional Neural Network (CNN) is a class of deep learning that has been shown to be useful for many NLP tasks, including text translation and text categorization for the English language. Word embedding is a text representation currently used to represent text terms as real-valued vectors in vector space that represent both syntactic and semantic traits of text. Current research studies in classifying Arabic text documents use traditional text representation such as bag-of-words and TF-IDF weighting, but few use word embedding. Traditional ML algorithms have already been used in Arabic text categorization, and good results are achieved. In this study, we present a Multi-Kernel CNN model for classifying Arabic news documents enriched with n-gram word embedding, which we call A Superior Arabic Text Categorization Deep Model (SATCDM). The proposed solution achieves very high accuracy compared to current research in Arabic text categorization using 15 of freely available datasets. The model achieves an accuracy ranging from 97.58% to 99.90%, which is superior to similar studies on the Arabic document classification task.

...read moreread less

22 citations

Book Chapter•DOI•

Constraining Word Embeddings by Prior Knowledge – Application to Medical Information Retrieval

[...]

Xiaojie Liu¹, Jian-Yun Nie¹, Alessandro Sordoni¹•Institutions (1)

Université de Montréal¹

30 Nov 2016

TL;DR: The existing knowledge (word relations) in the medical domain is leveraged to constrain word embeddings using the principle that related words should have similarembeddings, showing superior effectiveness to unsupervised word embedDings.

...read moreread less

Abstract: Word embedding has been used in many NLP tasks and showed some capability to capture semantic features. It has also been used in several recent studies in IR. However, word embeddings trained in unsupervised manner may fail to capture some of the semantic relations in a specific area (e.g. healthcare). In this paper, we leverage the existing knowledge (word relations) in the medical domain to constrain word embeddings using the principle that related words should have similar embeddings. The resulting constrained word embeddings are used to rerank documents, showing superior effectiveness to unsupervised word embeddings.

...read moreread less

22 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
…
152
153
154
155
156
157
158
…
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

5,718

Papers

201,647

Citations

No. of papers in the topic in previous years
Year	Papers
2023	317
2022	716
2021	736
2020	1,025
2019	1,078
2018	788

Word embedding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics