Topic

Word embedding

About: Word embedding is a research topic. Over the lifetime, 4683 publications have been published within this topic receiving 153378 citations. The topic is also known as: word embeddings.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Using semantic similarity to reduce wrong labels in distant supervision for relation extraction

[...]

Chengsen Ru¹, Jintao Tang¹, Shasha Li¹, Songxian Xie¹, Ting Wang¹ - Show less +1 more•Institutions (1)

National University of Defense Technology¹

01 Jul 2018-Information Processing and Management

TL;DR: The relation extraction performance of the CNN model using the core dependency phrases as input of CNN is the best of all, which indicates that the semantic similarity based method is effective in reducing wrong labels.

...read moreread less

Abstract: Distant supervision (DS) has the advantage of automatically generating large amounts of labelled training data and has been widely used for relation extraction. However, there are usually many wrong labels in the automatically labelled data in distant supervision (Riedel, Yao, & McCallum, 2010). This paper presents a novel method to reduce the wrong labels. The proposed method uses the semantic Jaccard with word embedding to measure the semantic similarity between the relation phrase in the knowledge base and the dependency phrases between two entities in a sentence to filter the wrong labels. In the process of reducing wrong labels, the semantic Jaccard algorithm selects a core dependency phrase to represent the candidate relation in a sentence, which can capture features for relation classification and avoid the negative impact from irrelevant term sequences that previous neural network models of relation extraction often suffer. In the process of relation classification, the core dependency phrases are also used as the input of a convolutional neural network (CNN) for relation classification. The experimental results show that compared with the methods using original DS data, the methods using filtered DS data performed much better in relation extraction. It indicates that the semantic similarity based method is effective in reducing wrong labels. The relation extraction performance of the CNN model using the core dependency phrases as input is the best of all, which indicates that using the core dependency phrases as input of CNN is enough to capture the features for relation classification and could avoid negative impact from irrelevant terms.

...read moreread less

40 citations

Journal Article•DOI•

OWL2Vec*: Embedding of OWL Ontologies

[...]

Jiaoyan Chen¹, Pan Hu¹, Ernesto Jiménez-Ruiz², Ernesto Jiménez-Ruiz³, Ole Magnus Holter², Denvar Antonyrajah⁴, Ian Horrocks¹ - Show less +3 more•Institutions (4)

University of Oxford¹, University of Oslo², City University London³, Samsung⁴

16 Jun 2021-Machine Learning

TL;DR: In this paper, a random walk and word embedding based ontology embedding method named OWL2Vec*, which encodes the semantics of an OWL ontology by taking into account its graph structure, lexical information and logical constructors.

...read moreread less

Abstract: Semantic embedding of knowledge graphs has been widely studied and used for prediction and statistical analysis tasks across various domains such as Natural Language Processing and the Semantic Web. However, less attention has been paid to developing robust methods for embedding OWL (Web Ontology Language) ontologies, which contain richer semantic information than plain knowledge graphs, and have been widely adopted in domains such as bioinformatics. In this paper, we propose a random walk and word embedding based ontology embedding method named OWL2Vec*, which encodes the semantics of an OWL ontology by taking into account its graph structure, lexical information and logical constructors. Our empirical evaluation with three real world datasets suggests that OWL2Vec* benefits from these three different aspects of an ontology in class membership prediction and class subsumption prediction tasks. Furthermore, OWL2Vec* often significantly outperforms the state-of-the-art methods in our experiments.

...read moreread less

40 citations

Proceedings Article•DOI•

Synergistic union of Word2Vec and lexicon for domain specific semantic similarity

[...]

Keet Sugathadasa¹, Buddhi Ayesha¹, Nisansa de Silva¹, Amal Shehan Perera¹, Vindula Jayawardana¹, Dimuthu Lakmal¹, Madhavi Perera² - Show less +3 more•Institutions (2)

University of Moratuwa¹, University of London²

01 Dec 2017

TL;DR: The authors proposed a domain specific semantic similarity measure that was created by the synergistic union of word2vec, a word embedding method that is used for semantic similarity calculation and lexicon based (lexical) semantic similarity methods.

...read moreread less

Abstract: Semantic similarity measures are an important part in Natural Language Processing tasks. However Semantic similarity measures built for general use do not perform well within specific domains. Therefore in this study we introduce a domain specific semantic similarity measure that was created by the synergistic union of word2vec, a word embedding method that is used for semantic similarity calculation and lexicon based (lexical) semantic similarity methods. We prove that this proposed methodology outperforms both, word embedding methods trained on a generic corpus and word embedding methods trained on a domain specific corpus, which do not use lexical semantic similarity methods to augment the results. Further, we prove that text lemmatization can improve the performance of word embedding methods.

...read moreread less

40 citations

Journal Article•DOI•

Code authorship identification using convolutional neural networks

[...]

Mohammed Abuhamad¹, Ji su Rhim¹, Tamer AbuHmed¹, Sana Ullah², Sanggil Kang¹, DaeHun Nyang - Show less +2 more•Institutions (2)

Inha University¹, Gyeongsang National University²

01 Jun 2019-Future Generation Computer Systems

TL;DR: The proposed convolutional neural network based code authorship identification system exploits term frequency-inverse document frequency, word embedding modeling, and feature learning techniques for code representation to identify the code’s author.

...read moreread less

40 citations

Journal Article•DOI•

Automatic detection and interpretation of nominal metaphor based on the theory of meaning

[...]

Chang Su¹, Shuman Huang¹, Yijiang Chen¹•Institutions (1)

Xiamen University¹

05 Jan 2017-Neurocomputing

TL;DR: An approach to recognize nominal metaphorical references and to interpret metaphors by exploiting distributional semantics word embedding techniques and semantic relatedness in the metaphor detection and interpretation fields is presented.

...read moreread less

40 citations

Collapse

Network Information

Performance

Metrics

5,718

Papers

201,647

Citations

No. of papers in the topic in previous years
Year	Papers
2023	317
2022	716
2021	736
2020	1,025
2019	1,078
2018	788

Word embedding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics