Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions

doi:10.1109/ICCV.2013.321

Open AccessProceedings ArticleDOI

Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions

Mohamed Elhoseiny, +2 more

- pp 2584-2591

Chats0

TLDR

An approach for zero-shot learning of object categories where the description of unseen categories comes in the form of typical text such as an encyclopedia entry, without the need to explicitly defined attributes is proposed.

Abstract:

The main question we address in this paper is how to use purely textual description of categories with no training images to learn visual classifiers for these categories. We propose an approach for zero-shot learning of object categories where the description of unseen categories comes in the form of typical text such as an encyclopedia entry, without the need to explicitly defined attributes. We propose and investigate two baseline formulations, based on regression and domain adaptation. Then, we propose a new constrained optimization formulation that combines a regression function and a knowledge transfer function with additional constraints to predict the classifier parameters for new classes. We applied the proposed approach on two fine-grained categorization datasets, and the results indicate successful classifier prediction.

Citations

PDF

Open Access

More filters

Proceedings Article

Prototypical Networks for Few-shot Learning

Jake Snell, +2 more

TL;DR: Prototypical Networks as discussed by the authors learn a metric space in which classification can be performed by computing distances to prototype representations of each class, and achieve state-of-the-art results on the CU-Birds dataset.

...read moreread less

Posted Content

Zero-Shot Learning - A Comprehensive Evaluation of the Good, the Bad and the Ugly

Yongqin Xian, +3 more

- 03 Jul 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A new zero-shot learning dataset is proposed, the Animals with Attributes 2 (AWA2) dataset which is made publicly available both in terms of image features and the images themselves and compares and analyzes a significant number of the state-of-the-art methods in depth.

...read moreread less

Proceedings ArticleDOI

Learning Deep Representations of Fine-Grained Visual Descriptions

Scott Reed, +3 more

TL;DR: This model achieves strong performance on zero-shot text-based image retrieval and significantly outperforms the attribute-based state-of-the-art for zero- shot classification on the Caltech-UCSD Birds 200-2011 dataset.

...read moreread less

Proceedings ArticleDOI

Low-Shot Visual Recognition by Shrinking and Hallucinating Features

Bharath Hariharan, +1 more

TL;DR: This work presents a low-shot learning benchmark on complex images that mimics challenges faced by recognition systems in the wild, and proposes representation regularization techniques and techniques to hallucinate additional training examples for data-starved classes.

...read moreread less

Proceedings ArticleDOI

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, +2 more

TL;DR: In this paper, an encoder aims to project a visual feature vector into the semantic space as in the existing ZSL models, but the decoder exerts an additional constraint, that the projection/code must be able to reconstruct the original visual feature.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

WordNet: a lexical database for English

George A. Miller

- 01 Nov 1995 -

Communications of The ACM

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.

...read moreread less

Book

Gaussian Processes for Machine Learning

Carl Edward Rasmussen, +1 more

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.

...read moreread less

Journal ArticleDOI

Term Weighting Approaches in Automatic Text Retrieval

Gerard Salton, +1 more

- 01 Aug 1988 -

Information Processing and Management

TL;DR: This paper summarizes the insights gained in automatic term weighting, and provides baseline single term indexing models with which other more elaborate content analysis procedures can be compared.

...read moreread less

Journal ArticleDOI

Ridge regression: biased estimation for nonorthogonal problems

Arthur E. Hoerl, +1 more

- 01 Feb 2000 -

Technometrics

TL;DR: In this paper, an estimation procedure based on adding small positive quantities to the diagonal of X′X was proposed, which is a method for showing in two dimensions the effects of nonorthogonality.

...read moreread less

Collapse

IEEE Transactions on Pattern Analysis an...

Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions

Citations

Prototypical Networks for Few-shot Learning

Zero-Shot Learning - A Comprehensive Evaluation of the Good, the Bad and the Ugly

Learning Deep Representations of Fine-Grained Visual Descriptions

Low-Shot Visual Recognition by Shrinking and Hallucinating Features

Semantic Autoencoder for Zero-Shot Learning

References

ImageNet: A large-scale hierarchical image database

WordNet: a lexical database for English

Gaussian Processes for Machine Learning

Term Weighting Approaches in Automatic Text Retrieval

Ridge regression: biased estimation for nonorthogonal problems

Related Papers (5)

Describing objects by their attributes

An embarrassingly simple approach to zero-shot learning

DeViSE: A Deep Visual-Semantic Embedding Model

Learning to detect unseen object classes by between-class attribute transfer

Attribute-Based Classification for Zero-Shot Visual Object Categorization