Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

doi:10.1145/2502081.2502093

Proceedings ArticleDOI

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

- pp 33-42

TLDR

Experimental results show that the proposed A2 SH can characterize the semantic affinities among images accurately and can shape user search intent precisely and quickly, leading to more accurate search results as compared to state-of-the-art CBIR solutions.

Abstract:

This paper presents a novel Attribute-augmented Semantic Hierarchy (A2 SH) and demonstrates its effectiveness in bridging both the semantic and intention gaps in Content-based Image Retrieval (CBIR). A2 SH organizes the semantic concepts into multiple semantic levels and augments each concept with a set of related attributes, which describe the multiple facets of the concept and act as the intermediate bridge connecting the concept and low-level visual content. A hierarchical semantic similarity function is learnt to characterize the semantic similarities among images for retrieval. To better capture user search intent, a hybrid feedback mechanism is developed, which collects hybrid feedbacks on attributes and images. These feedbacks are then used to refine the search results based on A2 SH. We develop a content-based image retrieval system based on the proposed A2 SH. We conduct extensive experiments on a large-scale data set of over one million Web images. Experimental results show that the proposed A2 SH can characterize the semantic affinities among images accurately and can shape user search intent precisely and quickly, leading to more accurate search results as compared to state-of-the-art CBIR solutions.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Neural Factorization Machines for Sparse Predictive Analytics

Xiangnan He, +1 more

TL;DR: Neural Factorization Machines (NFM) as discussed by the authors is a special case of NFM without hidden layers, which combines the linearity of FM in modelling second-order feature interactions and the non-linearity of neural network in modelling higher-order features.

...read moreread less

Proceedings ArticleDOI

Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention

Jingyuan Chen, +5 more

TL;DR: A novel attention mechanism in CF is introduced to address the challenging item- and component-level implicit feedback in multimedia recommendation, dubbed Attentive Collaborative Filtering (ACF), which significantly outperforms state-of-the-art CF methods.

...read moreread less

Posted Content

Neural Factorization Machines for Sparse Predictive Analytics

Xiangnan He, +1 more

- 16 Aug 2017 -

arXiv: Learning

TL;DR: Neural Factorization Machines (NFM) as mentioned in this paper is a special case of NFM without hidden layers, which combines the linearity of FM in modelling second-order feature interactions and the non-linearity of neural network in modelling higher order feature interactions.

...read moreread less

Proceedings ArticleDOI

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Dejing Xu, +6 more

TL;DR: This paper proposes an end-to-end model which gradually refines its attention over the appearance and motion features of the video using the question as guidance and demonstrates the effectiveness of the model by analyzing the refined attention weights during the question answering procedure.

...read moreread less

Proceedings ArticleDOI

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

Long Chen, +4 more

TL;DR: Through adversarial learning of the two subspaces, SP-AEN can transfer the semantics from the reconstructive subspace to the discriminative one, accomplishing the improved zero-shot recognition of unseen classes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Distinctive Image Features from Scale-Invariant Keypoints

Matthijs Dorst

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.

...read moreread less

Journal ArticleDOI

Content-based image retrieval at the end of the early years

Arnold W. M. Smeulders, +4 more

- 01 Dec 2000 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The working conditions of content-based retrieval: patterns of use, types of pictures, the role of semantics, and the sensory gap are discussed, as well as aspects of system engineering: databases, system architecture, and evaluation.

...read moreread less

Proceedings Article

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Kilian Q. Weinberger, +2 more

TL;DR: In this article, a Mahanalobis distance metric for k-NN classification is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin.

...read moreread less

Collapse

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

Citations

Neural Factorization Machines for Sparse Predictive Analytics

Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention

Neural Factorization Machines for Sparse Predictive Analytics

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

References

ImageNet: A large-scale hierarchical image database

Distinctive Image Features from Scale-Invariant Keypoints

Distinctive Image Features from Scale-Invariant Keypoints

Content-based image retrieval at the end of the early years

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Related Papers (5)

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Caffe: Convolutional Architecture for Fast Feature Embedding

Learning to detect unseen object classes by between-class attribute transfer

ImageNet Classification with Deep Convolutional Neural Networks