scispace - formally typeset
Journal ArticleDOI

Compressed Histogram of Gradients: A Low-Bitrate Descriptor

TLDR
A framework for computing low bit-rate feature descriptors with a 20× reduction in bit rate compared to state-of-the-art descriptors is proposed and it is shown how to efficiently compute distances between descriptors in the compressed domain eliminating the need for decoding.
Abstract
Establishing visual correspondences is an essential component of many computer vision problems, which is often done with local feature-descriptors Transmission and storage of these descriptors are of critical importance in the context of mobile visual search applications We propose a framework for computing low bit-rate feature descriptors with a 20× reduction in bit rate compared to state-of-the-art descriptors The framework offers low complexity and has significant speed-up in the matching stage We show how to efficiently compute distances between descriptors in the compressed domain eliminating the need for decoding We perform a comprehensive performance comparison with SIFT, SURF, BRIEF, MPEG-7 image signatures and other low bit-rate descriptors and show that our proposed CHoG descriptor outperforms existing schemes significantly over a wide range of bitrates We implement the descriptor in a mobile image retrieval system and for a database of 1 million CD, DVD and book covers, we achieve 96% retrieval accuracy using only 4 KB of data per query image

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Mobile Visual Search

TL;DR: Mobile phones have evolved into powerful image and video processing devices equipped with high-resolution cameras, color displays, and hardware-accelerated graphics, which enables a new class of applications that use the camera phone to initiate search queries about objects in visual proximity to the user.
Book

Fundamentals of multimedia

TL;DR: This textbook introduces the Fundamentals of Multimedia, addressing real issues commonly faced in the workplace, and includes coverage of such topics as 3D TV, social networks, high-efficiency video compression and conferencing, wireless and mobile networks, and their attendant technologies.
Proceedings ArticleDOI

Multimodal feature fusion for robust event detection in web videos

TL;DR: This work rigorously analyzes and combines a large set of low-level features that capture appearance, color, motion, audio and audio-visual co-occurrence patterns in videos and exploits multimodal information by analyzing available spoken and videotext content using state-of-the-art automatic speech recognition (ASR) and Videotext recognition systems.
Journal ArticleDOI

Understanding network failures in data centers

TL;DR: In this paper, the authors present the first large-scale analysis of failures in a data center network and seek to answer several fundamental questions: which devices/links are most unreliable, what...

IEEE International Conference on Image Processing (ICIP)

TL;DR: The IVMSP Technical Committee will review the proposal, and if it so chooses, will endorse the proposal and forward it to the Conference Board, which will recommend the proposal to the Board of Governors for final approval.
References
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Book

Elements of information theory

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Proceedings ArticleDOI

Object recognition from local scale-invariant features

TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.
Related Papers (5)