Semantically Consistent Regularization for Zero-Shot Recognition

doi:10.1109/CVPR.2017.220

Open AccessProceedings ArticleDOI

Semantically Consistent Regularization for Zero-Shot Recognition

- pp 2037-2046

TLDR

In this article, the role of semantics in zero-shot learning is considered and the effectiveness of previous approaches is analyzed according to the form of supervision provided, while some learn semantics independently, others only supervise the semantic subspace explained by training classes.

Abstract:

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses this issue but leaves part of the semantic space unsupervised. This complementarity is exploited in a new convolutional neural network (CNN) framework, which proposes the use of semantics as constraints for recognition. Although a CNN trained for classification has no transfer ability, this can be encouraged by learning an hidden semantic layer together with a semantic code for classification. Two forms of semantic constraints are then introduced. The first is a loss-based regularizer that introduces a generalization constraint on each semantic predictor. The second is a codeword regularizer that favors semantic-to-class mappings consistent with prior semantic knowledge while allowing these to be learned from data. Significant improvements over the state-of-the-art are achieved on several datasets.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

Long Chen, +4 more

TL;DR: Through adversarial learning of the two subspaces, SP-AEN can transfer the semantics from the reconstructive subspace to the discriminative one, accomplishing the improved zero-shot recognition of unseen classes.

...read moreread less

Proceedings ArticleDOI

Attentive Region Embedding Network for Zero-Shot Learning

Li Liu, +6 more

TL;DR: To discover (semantic) regions, the attentive region embedding network (AREN) is proposed, which is tailored to advance the ZSL task and achieves state-of-the-art performances under ZSLSetting, and compelling results under generalized ZSL setting.

...read moreread less

Proceedings ArticleDOI

Transductive Unbiased Embedding for Zero-Shot Learning

Jie Song, +4 more

TL;DR: Quasi-Fully Supervised Learning (QFSL) as mentioned in this paper follows the way of transductive learning, which assumes that both the labeled source images and unlabeled target images are available for training.

...read moreread less

Proceedings ArticleDOI

Transferable Contrastive Network for Generalized Zero-Shot Learning

Huajie Jiang, +3 more

TL;DR: A novel Transferable Contrastive Network (TCN) is proposed that explicitly transfers knowledge from the source classes to the target classes, and is more robust to recognize the target images.

...read moreread less

Proceedings ArticleDOI

Preserving Semantic Relations for Zero-Shot Learning

Soma Biswas, +1 more

TL;DR: In this article, the structure of the space spanned by the attributes using a set of relations is utilized to preserve these relations in the embedding space, thereby inducing semanticity to the embedded space.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Journal ArticleDOI

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995 -

Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

Collapse

Semantically Consistent Regularization for Zero-Shot Recognition

Citations

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

Attentive Region Embedding Network for Zero-Shot Learning

Transductive Unbiased Embedding for Zero-Shot Learning

Transferable Contrastive Network for Generalized Zero-Shot Learning

Preserving Semantic Relations for Zero-Shot Learning

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet: A large-scale hierarchical image database

Going deeper with convolutions

Support-Vector Networks

Related Papers (5)

An embarrassingly simple approach to zero-shot learning

Deep Residual Learning for Image Recognition

DeViSE: A Deep Visual-Semantic Embedding Model

Describing objects by their attributes

Learning to detect unseen object classes by between-class attribute transfer