Glove: Global Vectors for Word Representation

doi:10.3115/V1/D14-1162

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- pp 1532-1543

Chats0

TLDR

A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

Abstract:

Recent methods for learning vector space representations of words have succeeded in capturing fine-grained semantic and syntactic regularities using vector arithmetic, but the origin of these regularities has remained opaque. We analyze and make explicit the model properties needed for such regularities to emerge in word vectors. The result is a new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods. Our model efficiently leverages statistical information by training only on the nonzero elements in a word-word cooccurrence matrix, rather than on the entire sparse matrix or on individual context windows in a large corpus. The model produces a vector space with meaningful substructure, as evidenced by its performance of 75% on a recent word analogy task. It also outperforms related models on similarity tasks and named entity recognition.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Explainable zero-shot learning via attentive graph convolutional network and knowledge graphs

Yuxia Geng, +5 more

- 01 Jan 2021 -

Sprachwissenschaft

Journal ArticleDOI

Comparison of neutrosophic approach to various deep learning models for sentiment analysis

Mayukh Sharma, +2 more

- 08 Jul 2021 -

Knowledge Based Systems

TL;DR: In this article, the authors proposed a novel framework to implement neutrosophy in deep learning models, where instead of just predicting a single class as output, they quantified the sentiments using three membership functions to understand them better.

...read moreread less

Proceedings ArticleDOI

Mittens: an Extension of GloVe for Learning Domain-Specialized Representations

Nicholas Dingwall, +1 more

TL;DR: In this article, a simple extension of the GloVe representation learning model is presented, which can lead to faster learning and better results on a variety of tasks on a specialized domain.

...read moreread less

Proceedings ArticleDOI

Learning with Weak Supervision for Email Intent Detection

Kai Shu, +5 more

- 26 May 2020 -

arXiv: Computation and Language

TL;DR: In this article, the authors propose to leverage user actions as a source of weak supervision, in addition to a limited set of annotated examples, to detect intents in emails.

...read moreread less

Proceedings ArticleDOI

Learn to Select via Hierarchical Gate Mechanism for Aspect-Based Sentiment Analysis.

Xiangying Ran, +3 more

TL;DR: A novel architecture named Hierarchical Gate Memory Network (HGMN) is proposed for ABSA, which employs the proposed hierarchical gate mechanism to learn to select the related part about the given aspect, which can keep the original sequence structure of sentence at the same time.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Journal ArticleDOI

Indexing by Latent Semantic Analysis

Scott Deerwester, +4 more

- 01 Sep 1990 -

Journal of the Association for Informati...

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.

...read moreread less

Proceedings Article

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

TL;DR: Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.

...read moreread less

Book

Learning Deep Architectures for AI

Yoshua Bengio

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less