Learning Grounded Meaning Representations with Autoencoders

doi:10.3115/V1/P14-1068

Open AccessProceedings ArticleDOI

Learning Grounded Meaning Representations with Autoencoders

Carina Silberer, +1 more

- pp 721-732

Chats0

TLDR

A new model is introduced which uses stacked autoencoders to learn higher-level embeddings from textual and visual input and which outperforms baselines and related models on similarity judgments and concept categorization.

Abstract:

In this paper we address the problem of grounding distributional representations of lexical meaning. We introduce a new model which uses stacked autoencoders to learn higher-level embeddings from textual and visual input. The two modalities are encoded as vectors of attributes and are obtained automatically from text and images, respectively. We evaluate our model on its ability to simulate similarity judgments and concept categorization. On both tasks, our approach outperforms baselines and related models.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Multimodal Machine Learning: A Survey and Taxonomy

Tadas Baltrusaitis, +2 more

- 01 Feb 2019 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper surveys the recent advances in multimodal machine learning itself and presents them in a common taxonomy to enable researchers to better understand the state of the field and identify directions for future research.

...read moreread less

Journal ArticleDOI

Simlex-999: Evaluating semantic models with genuine similarity estimation

Felix Hill, +2 more

- 01 Dec 2015 -

Computational Linguistics

TL;DR: SimLex-999 is presented, a gold standard resource for evaluating distributional semantic models that improves on existing resources in several important ways, and explicitly quantifies similarity rather than association or relatedness so that pairs of entities that are associated but not actually similar have a low rating.

...read moreread less

Posted Content

SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation

Felix Hill, +2 more

- 15 Aug 2014 -

arXiv: Computation and Language

TL;DR: SimLex-999 as mentioned in this paper is a gold standard resource for evaluating distributional semantic models that improves on existing resources in several important ways, such as quantifying similarity rather than association or relatedness, so that pairs of entities that are associated but not actually similar have a low rating.

...read moreread less

Journal ArticleDOI

A Survey of Multi-View Representation Learning

Yingming Li, +2 more

- 01 Oct 2019 -

IEEE Transactions on Knowledge and Data ...

TL;DR: Multi-view representation learning has become a rapidly growing direction in machine learning and data mining areas as mentioned in this paper, and a comprehensive survey of multi-view representations can be found in this paper.

...read moreread less

Journal ArticleDOI

Deep Multimodal Representation Learning: A Survey

Wenzhong Guo, +2 more

- 15 May 2019 -

IEEE Access

TL;DR: The key issues of newly developed technologies, such as encoder-decoder model, generative adversarial networks, and attention mechanism in a multimodal representation learning perspective, which, to the best of the knowledge, have never been reviewed previously are highlighted.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

Geoffrey E. Hinton, +1 more

- 28 Jul 2006 -

Science

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.

...read moreread less

Collapse

International Journal of Computer Vision

Learning Grounded Meaning Representations with Autoencoders

Citations

Multimodal Machine Learning: A Survey and Taxonomy

Simlex-999: Evaluating semantic models with genuine similarity estimation

SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation

A Survey of Multi-View Representation Learning

Deep Multimodal Representation Learning: A Survey

References

ImageNet: A large-scale hierarchical image database

Distinctive Image Features from Scale-Invariant Keypoints

Latent dirichlet allocation

Latent Dirichlet Allocation

Reducing the Dimensionality of Data with Neural Networks

Related Papers (5)

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality

ImageNet: A large-scale hierarchical image database

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge