scispace - formally typeset
Proceedings ArticleDOI

Multiple feature hashing for real-time large scale near-duplicate video retrieval

Reads0
Chats0
TLDR
This paper presents a novel approach - Multiple Feature Hashing (MFH) to tackle both the accuracy and the scalability issues of NDVR and shows that the proposed method outperforms the state-of-the-art techniques in both accuracy and efficiency.
Abstract
Near-duplicate video retrieval (NDVR) has recently attracted lots of research attention due to the exponential growth of online videos. It helps in many areas, such as copyright protection, video tagging, online video usage monitoring, etc. Most of existing approaches use only a single feature to represent a video for NDVR. However, a single feature is often insufficient to characterize the video content. Besides, while the accuracy is the main concern in previous literatures, the scalability of NDVR algorithms for large scale video datasets has been rarely addressed. In this paper, we present a novel approach - Multiple Feature Hashing (MFH) to tackle both the accuracy and the scalability issues of NDVR. MFH preserves the local structure information of each individual feature and also globally consider the local structures for all the features to learn a group of hash functions which map the video keyframes into the Hamming space and generate a series of binary codes to represent the video dataset. We evaluate our approach on a public video dataset and a large scale video dataset consisting of 132,647 videos, which was collected from YouTube by ourselves. The experiment results show that the proposed method outperforms the state-of-the-art techniques in both accuracy and efficiency.

read more

Citations
More filters
Proceedings Article

Large-scale supervised multimodal hashing with semantic correlation maximization

TL;DR: A novel SMH method, called semantic correlation maximization (SCM), is proposed to seamlessly integrate semantic labels into the hashing learning procedure for large-scale data modeling, and experimental results show that SCM can significantly outperform the state-of-the-art SMH methods, in terms of both accuracy and scalability.
Proceedings ArticleDOI

Inter-media hashing for large-scale retrieval from heterogeneous data sources

TL;DR: A novel inter-media hashing (IMH) model is proposed to explore the correlations among multiple media types from different data sources and tackle the scalability issue, which transforms multimedia data from heterogeneous data sources into a common Hamming space, in which fast search can be easily implemented by XOR and bit-count operations.
Journal ArticleDOI

Learning to Hash for Indexing Big Data—A Survey

TL;DR: Learning-to-Hash (LHT) as mentioned in this paper is one of the most popular methods for approximate nearest neighbor (ANN) search in big data applications, which can exploit information such as data distributions or class labels when optimizing the hash codes or functions.
Proceedings ArticleDOI

Deep Cross-Modal Hashing

TL;DR: Deep cross-modal hashing (DCMH) as mentioned in this paper is an end-to-end learning framework with deep neural networks, one for each modality, to perform feature learning from scratch.
Posted Content

Deep Cross-Modal Hashing

TL;DR: DCMH is an end-to-end learning framework with deep neural networks, one for each modality, to perform feature learning from scratch and can outperform other baselines to achieve the state-of-the-art performance in cross-modal retrieval applications.
References
More filters
Book ChapterDOI

SURF: speeded up robust features

TL;DR: A novel scale- and rotation-invariant interest point detector and descriptor, coined SURF (Speeded Up Robust Features), which approximates or even outperforms previously proposed schemes with respect to repeatability, distinctiveness, and robustness, yet can be computed and compared much faster.
Journal ArticleDOI

A performance evaluation of local descriptors

TL;DR: It is observed that the ranking of the descriptors is mostly independent of the interest region detector and that the SIFT-based descriptors perform best and Moments and steerable filters show the best performance among the low dimensional descriptors.
Proceedings ArticleDOI

A performance evaluation of local descriptors

TL;DR: It is observed that the ranking of the descriptors is mostly independent of the interest region detector and that the SIFT-based descriptors perform best and Moments and steerable filters show the best performance among the low dimensional descriptors.
Proceedings ArticleDOI

Locality-sensitive hashing scheme based on p-stable distributions

TL;DR: A novel Locality-Sensitive Hashing scheme for the Approximate Nearest Neighbor Problem under lp norm, based on p-stable distributions that improves the running time of the earlier algorithm and yields the first known provably efficient approximate NN algorithm for the case p<1.
Proceedings Article

Spectral Hashing

TL;DR: The problem of finding a best code for a given dataset is closely related to the problem of graph partitioning and can be shown to be NP hard and a spectral method is obtained whose solutions are simply a subset of thresholded eigenvectors of the graph Laplacian.
Related Papers (5)