scispace - formally typeset
Proceedings ArticleDOI

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval

TLDR
Experimental results show that the proposed A2 SH can characterize the semantic affinities among images accurately and can shape user search intent precisely and quickly, leading to more accurate search results as compared to state-of-the-art CBIR solutions.
Abstract
This paper presents a novel Attribute-augmented Semantic Hierarchy (A2 SH) and demonstrates its effectiveness in bridging both the semantic and intention gaps in Content-based Image Retrieval (CBIR). A2 SH organizes the semantic concepts into multiple semantic levels and augments each concept with a set of related attributes, which describe the multiple facets of the concept and act as the intermediate bridge connecting the concept and low-level visual content. A hierarchical semantic similarity function is learnt to characterize the semantic similarities among images for retrieval. To better capture user search intent, a hybrid feedback mechanism is developed, which collects hybrid feedbacks on attributes and images. These feedbacks are then used to refine the search results based on A2 SH. We develop a content-based image retrieval system based on the proposed A2 SH. We conduct extensive experiments on a large-scale data set of over one million Web images. Experimental results show that the proposed A2 SH can characterize the semantic affinities among images accurately and can shape user search intent precisely and quickly, leading to more accurate search results as compared to state-of-the-art CBIR solutions.

read more

Citations
More filters
Proceedings ArticleDOI

Neural Factorization Machines for Sparse Predictive Analytics

TL;DR: Neural Factorization Machines (NFM) as discussed by the authors is a special case of NFM without hidden layers, which combines the linearity of FM in modelling second-order feature interactions and the non-linearity of neural network in modelling higher-order features.
Proceedings ArticleDOI

Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention

TL;DR: A novel attention mechanism in CF is introduced to address the challenging item- and component-level implicit feedback in multimedia recommendation, dubbed Attentive Collaborative Filtering (ACF), which significantly outperforms state-of-the-art CF methods.
Posted Content

Neural Factorization Machines for Sparse Predictive Analytics

TL;DR: Neural Factorization Machines (NFM) as mentioned in this paper is a special case of NFM without hidden layers, which combines the linearity of FM in modelling second-order feature interactions and the non-linearity of neural network in modelling higher order feature interactions.
Proceedings ArticleDOI

Video Question Answering via Gradually Refined Attention over Appearance and Motion

TL;DR: This paper proposes an end-to-end model which gradually refines its attention over the appearance and motion features of the video using the question as guidance and demonstrates the effectiveness of the model by analyzing the refined attention weights during the question answering procedure.
Proceedings ArticleDOI

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

TL;DR: Through adversarial learning of the two subspaces, SP-AEN can transfer the semantics from the reconstructive subspace to the discriminative one, accomplishing the improved zero-shot recognition of unseen classes.
References
More filters
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.
Journal ArticleDOI

Content-based image retrieval at the end of the early years

TL;DR: The working conditions of content-based retrieval: patterns of use, types of pictures, the role of semantics, and the sensory gap are discussed, as well as aspects of system engineering: databases, system architecture, and evaluation.
Proceedings Article

Distance Metric Learning for Large Margin Nearest Neighbor Classification

TL;DR: In this article, a Mahanalobis distance metric for k-NN classification is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin.
Related Papers (5)