scispace - formally typeset
Open AccessProceedings ArticleDOI

Semantically Consistent Regularization for Zero-Shot Recognition

TLDR
In this article, the role of semantics in zero-shot learning is considered and the effectiveness of previous approaches is analyzed according to the form of supervision provided, while some learn semantics independently, others only supervise the semantic subspace explained by training classes.
Abstract
The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses this issue but leaves part of the semantic space unsupervised. This complementarity is exploited in a new convolutional neural network (CNN) framework, which proposes the use of semantics as constraints for recognition. Although a CNN trained for classification has no transfer ability, this can be encouraged by learning an hidden semantic layer together with a semantic code for classification. Two forms of semantic constraints are then introduced. The first is a loss-based regularizer that introduces a generalization constraint on each semantic predictor. The second is a codeword regularizer that favors semantic-to-class mappings consistent with prior semantic knowledge while allowing these to be learned from data. Significant improvements over the state-of-the-art are achieved on several datasets.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

TL;DR: Through adversarial learning of the two subspaces, SP-AEN can transfer the semantics from the reconstructive subspace to the discriminative one, accomplishing the improved zero-shot recognition of unseen classes.
Proceedings ArticleDOI

Attentive Region Embedding Network for Zero-Shot Learning

TL;DR: To discover (semantic) regions, the attentive region embedding network (AREN) is proposed, which is tailored to advance the ZSL task and achieves state-of-the-art performances under ZSLSetting, and compelling results under generalized ZSL setting.
Proceedings ArticleDOI

Transductive Unbiased Embedding for Zero-Shot Learning

TL;DR: Quasi-Fully Supervised Learning (QFSL) as mentioned in this paper follows the way of transductive learning, which assumes that both the labeled source images and unlabeled target images are available for training.
Proceedings ArticleDOI

Transferable Contrastive Network for Generalized Zero-Shot Learning

TL;DR: A novel Transferable Contrastive Network (TCN) is proposed that explicitly transfers knowledge from the source classes to the target classes, and is more robust to recognize the target images.
Proceedings ArticleDOI

Preserving Semantic Relations for Zero-Shot Learning

TL;DR: In this article, the structure of the space spanned by the attributes using a set of relations is utilized to preserve these relations in the embedding space, thereby inducing semanticity to the embedded space.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Journal ArticleDOI

Support-Vector Networks

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.