Topic

Word embedding

About: Word embedding is a research topic. Over the lifetime, 4683 publications have been published within this topic receiving 153378 citations. The topic is also known as: word embeddings.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A transformer architecture based on BERT and 2D convolutional neural network to identify DNA enhancers from sequence information

[...]

Nguyen Quoc Khanh Le¹, Quang-Thai Ho², Trinh-Trung-Duong Nguyen³, Yu-Yen Ou³•Institutions (3)

Taipei Medical University¹, Can Tho University², Yuan Ze University³

02 Sep 2021-Briefings in Bioinformatics

TL;DR: In this article, the authors presented a novel technique by incorporating BERT-based multilingual model in bioinformatics to represent the information of DNA sequences, and treated DNA sequences as natural sentences and then used BERT models to transform them into fixed-length numerical matrices.

...read moreread less

Abstract: Recently, language representation models have drawn a lot of attention in the natural language processing field due to their remarkable results. Among them, bidirectional encoder representations from transformers (BERT) has proven to be a simple, yet powerful language model that achieved novel state-of-the-art performance. BERT adopted the concept of contextualized word embedding to capture the semantics and context of the words in which they appeared. In this study, we present a novel technique by incorporating BERT-based multilingual model in bioinformatics to represent the information of DNA sequences. We treated DNA sequences as natural sentences and then used BERT models to transform them into fixed-length numerical matrices. As a case study, we applied our method to DNA enhancer prediction, which is a well-known and challenging problem in this field. We then observed that our BERT-based features improved more than 5-10% in terms of sensitivity, specificity, accuracy and Matthews correlation coefficient compared to the current state-of-the-art features in bioinformatics. Moreover, advanced experiments show that deep learning (as represented by 2D convolutional neural networks; CNN) holds potential in learning BERT features better than other traditional machine learning techniques. In conclusion, we suggest that BERT and 2D CNNs could open a new avenue in biological modeling using sequence information.

...read moreread less

69 citations

Posted Content•

FRAGE: Frequency-Agnostic Word Representation

[...]

Chengyue Gong¹, Di He¹, Xu Tan², Tao Qin², Liwei Wang¹, Tie-Yan Liu² - Show less +2 more•Institutions (2)

Peking University¹, Microsoft²

18 Sep 2018-arXiv: Computation and Language

TL;DR: This paper develops a neat, simple yet effective way to learn FRequency-AGnostic word Embedding (FRAGE) using adversarial training and shows that with FRAGE, the model achieves higher performance than the baselines in all tasks.

...read moreread less

Abstract: Continuous word representation (aka word embedding) is a basic building block in many neural network-based models used in natural language processing tasks Although it is widely accepted that words with similar semantics should be close to each other in the embedding space, we find that word embeddings learned in several tasks are biased towards word frequency: the embeddings of high-frequency and low-frequency words lie in different subregions of the embedding space, and the embedding of a rare word and a popular word can be far from each other even if they are semantically similar This makes learned word embeddings ineffective, especially for rare words, and consequently limits the performance of these neural network models In this paper, we develop a neat, simple yet effective way to learn \emph{FRequency-AGnostic word Embedding} (FRAGE) using adversarial training We conducted comprehensive studies on ten datasets across four natural language processing tasks, including word similarity, language modeling, machine translation and text classification Results show that with FRAGE, we achieve higher performance than the baselines in all tasks

...read moreread less

69 citations

Journal Article•DOI•

VOPRec: Vector Representation Learning of Papers with Text Information and Structural Identity for Recommendation

[...]

Xiangjie Kong¹, Mengyi Mao¹, Wei Wang¹, Jiaying Liu¹, Bo Xu¹ - Show less +1 more•Institutions (1)

Dalian University of Technology¹

01 Jan 2021-IEEE Transactions on Emerging Topics in Computing

TL;DR: Through the APS data set, it is shown that VOPRec outperforms state-of-the-art paper recommendation baselines measured by precision, recall, F1, and NDCG.

...read moreread less

Abstract: Finding relevant papers is a non-trivial problem for scholars due to the tremendous amount of academic information in the era of scholarly big data. Scientific paper recommendation systems have been developed to solve such problem by recommending relevant papers to scholars. However, previous paper recommendations calculate paper similarity based on hand-engineered features which are inflexible. To address this problem, we develop a scientific paper recommendation system, namely VOPRec, by vector representation learning of paper in citation networks. VOPRec takes advantages of recent research in both text and network representation learning for unsupervised feature design. In VOPRec, the text information is represented with word embedding to find papers of similar research interest. Then, the structural identity is converted into vectors to find papers of similar network topology. After bridging text information and structural identity with the citation network, vector representation of paper can be learned with network embedding. Finally, top- $Q$ Q recommendation list is generated based on the similarity calculated with paper vectors. Through the APS data set, we show that VOPRec outperforms state-of-the-art paper recommendation baselines measured by precision, recall, F1, and NDCG.

...read moreread less

69 citations

Journal Article•DOI•

Enhancing unsupervised neural networks based text summarization with word embedding and ensemble learning

[...]

Nabil Alami¹, Mohammed Meknassi¹, Noureddine En-Nahnahi¹•Institutions (1)

Sidi Mohamed Ben Abdellah University¹

01 Jun 2019-Expert Systems With Applications

TL;DR: This paper develops a word embedding based text summarization, and it is shown that Word2Vec representation gives better results than traditional BOW representation, and proposes three ensemble techniques that improve the quality of ATS.

...read moreread less

Abstract: The vast amounts of data being collected and analyzed have led to invaluable source of information, which needs to be easily handled by humans. Automatic Text Summarization (ATS) systems enable users to get the gist of information and knowledge in a short time in order to make critical decisions quickly. Deep neural networks have proven their ability to achieve excellent performance in many real-world Natural Language Processing and computer vision applications. However, it still lacks attention in ATS. The key problem of traditional applications is that they involve high dimensional and sparse data, which makes it difficult to capture relevant information. One technique for overcoming these problems is learning features via dimensionality reduction. On the other hand, word embedding is another neural network technique that generates a much more compact word representation than a traditional Bag-of-Words (BOW) approach. In this paper, we are seeking to enhance the quality of ATS by integrating unsupervised deep neural network techniques with word embedding approach. First, we develop a word embedding based text summarization, and we show that Word2Vec representation gives better results than traditional BOW representation. Second, we propose other models by combining word2vec and unsupervised feature learning methods in order to merge information from different sources. We show that unsupervised neural networks models trained on Word2Vec representation give better results than those trained on BOW representation. Third, we also propose three ensemble techniques. The first ensemble combines BOW and word2vec using a majority voting technique. The second ensemble aggregates the information provided by the BOW approach and unsupervised neural networks. The third ensemble aggregates the information provided by Word2Vec and unsupervised neural networks. We show that the ensemble methods improve the quality of ATS, in particular the ensemble based on word2vec approach gives better results. Finally, we perform different experiments to evaluate the performance of the investigated models. We use two kind of datasets that are publically available for evaluating ATS task. Results of statistical studies affirm that word embedding-based models outperform the summarization task compared to those based on BOW approach. In particular, ensemble learning technique with Word2Vec representation surpass all the investigated models.

...read moreread less

68 citations

Posted Content•

A Joint Model for Word Embedding and Word Morphology

[...]

Kris Cao¹, Marek Rei¹•Institutions (1)

University of Cambridge¹

08 Jun 2016-arXiv: Computation and Language

TL;DR: A joint model for performing unsupervised morphological analysis on words, and learning a character-level composition function from morphemes to word embeddings, which is comparable to dedicated morphological analyzers at the task of morpheme boundary recovery and performs better than word-based embedding models at thetask of syntactic analogy answering.

...read moreread less

Abstract: This paper presents a joint model for performing unsupervised morphological analysis on words, and learning a character-level composition function from morphemes to word embeddings. Our model splits individual words into segments, and weights each segment according to its ability to predict context words. Our morphological analysis is comparable to dedicated morphological analyzers at the task of morpheme boundary recovery, and also performs better than word-based embedding models at the task of syntactic analogy answering. Finally, we show that incorporating morphology explicitly into character-level models help them produce embeddings for unseen words which correlate better with human judgments.

...read moreread less

68 citations

Collapse

Network Information

Performance

Metrics

5,718

Papers

201,647

Citations

No. of papers in the topic in previous years
Year	Papers
2023	317
2022	716
2021	736
2020	1,025
2019	1,078
2018	788

Word embedding

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics