Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval

doi:10.1109/CVPR.2017.282

Proceedings ArticleDOI

Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval

- pp 2633-2641

TLDR

This paper proposes a simple, yet effective generalized hashing framework which can work for all the different scenarios, while preserving the semantic distance between the data points, and learns the optimum hash codes for the two modalities simultaneously.

Abstract:

Due to availability of large amounts of multimedia data, cross-modal matching is gaining increasing importance. Hashing based techniques provide an attractive solution to this problem when the data size is large. Different scenarios of cross-modal matching are possible, for example, data from the different modalities can be associated with a single label or multiple labels, and in addition may or may not have one-to-one correspondence. Most of the existing approaches have been developed for the case where there is one-to-one correspondence between the data of the two modalities. In this paper, we propose a simple, yet effective generalized hashing framework which can work for all the different scenarios, while preserving the semantic distance between the data points. The approach first learns the optimum hash codes for the two modalities simultaneously, so as to preserve the semantic similarity between the data points, and then learns the hash functions to map from the features to the hash codes. Extensive experiments on single label dataset like Wiki and multi-label datasets like NUS-WIDE, Pascal and LabelMe under all the different scenarios and comparisons with the state-of-the-art shows the effectiveness of the proposed approach.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

Chao Li, +5 more

TL;DR: Li et al. as discussed by the authors proposed a self-supervised adversarial hashing (SSAH) approach, which leveraged two adversarial networks to maximize the semantic correlation and consistency of the representations between different modalities.

...read moreread less

Posted Content

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

Chao Li, +5 more

- 04 Apr 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Li et al. as discussed by the authors proposed a self-supervised adversarial hashing (SSAH) approach, which leveraged two adversarial networks to maximize the semantic correlation and consistency of the representations between different modalities.

...read moreread less

Proceedings ArticleDOI

Zero-Shot Sketch-Image Hashing

Yuming Shen, +3 more

TL;DR: Zhang et al. as discussed by the authors proposed a zero-shot sketch-image hashing (ZSIH) model with an end-to-end three-network architecture, two of which are treated as binary encoders and the third network mitigates the sketch image heterogeneity and enhances the semantic relations among data.

...read moreread less

Posted Content

Zero-Shot Sketch-Image Hashing

Yuming Shen, +3 more

- 06 Mar 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: ZSIH is the first zero- shot hashing work suitable for SBIR and cross-modal search and forms a generative hashing scheme in reconstructing semantic knowledge representations for zero-shot retrieval.

...read moreread less

Journal ArticleDOI

Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval

En Yu, +5 more

- 01 May 2019 -

IEEE Transactions on Multimedia

TL;DR: The semi-supervised model named adaptive semi- supervised feature selection for cross-modal retrieval uses the semantic regression to strengthen the neighboring relationship between the data with the same semantic and an efficient joint optimization algorithm is proposed to update the mapping matrices and the label matrix for unlabeled data simultaneously and iteratively.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

Mark Everingham, +4 more

- 01 Jun 2010 -

International Journal of Computer Vision

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.

...read moreread less

Book ChapterDOI

Relations Between Two Sets of Variates

Harold Hotelling

- 01 Dec 1936 -

Biometrika

TL;DR: The concept of correlation and regression may be applied not only to ordinary one-dimensional variates but also to variates of two or more dimensions as discussed by the authors, where the correlation of the horizontal components is ordinarily discussed, whereas the complex consisting of horizontal and vertical deviations may be even more interesting.

...read moreread less

Journal ArticleDOI

LabelMe: A Database and Web-Based Tool for Image Annotation

Bryan Russell, +3 more

- 01 May 2008 -

International Journal of Computer Vision

TL;DR: In this article, a large collection of images with ground truth labels is built to be used for object detection and recognition research, such data is useful for supervised learning and quantitative evaluation.

...read moreread less

Journal ArticleDOI

Canonical Correlation Analysis: An Overview with Application to Learning Methods

David R. Hardoon, +2 more

- 01 Dec 2004 -

Neural Computation

TL;DR: A general method using kernel canonical correlation analysis to learn a semantic representation to web images and their associated text and compares orthogonalization approaches against a standard cross-representation retrieval technique known as the generalized vector space model is presented.

...read moreread less

Collapse

Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval

Citations

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

Zero-Shot Sketch-Image Hashing

Zero-Shot Sketch-Image Hashing

Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval

References

ImageNet Classification with Deep Convolutional Neural Networks

The Pascal Visual Object Classes (VOC) Challenge

Relations Between Two Sets of Variates

LabelMe: A Database and Web-Based Tool for Image Annotation

Canonical Correlation Analysis: An Overview with Application to Learning Methods

Related Papers (5)

Large-scale supervised multimodal hashing with semantic correlation maximization

NUS-WIDE: a real-world web image database from National University of Singapore

Inter-media hashing for large-scale retrieval from heterogeneous data sources

A new approach to cross-modal multimedia retrieval

The MIR flickr retrieval evaluation