Three things everyone should know to improve object retrieval

doi:10.1109/CVPR.2012.6248018

Proceedings ArticleDOI

Three things everyone should know to improve object retrieval

Relja Arandjelovic, +1 more

- pp 2911-2918

Chats0

TLDR

A new method to compare SIFT descriptors (RootSIFT) which yields superior performance without increasing processing or storage requirements, and a novel method for query expansion where a richer model for the query is learnt discriminatively in a form suited to immediate retrieval through efficient use of the inverted index.

Abstract:

The objective of this work is object retrieval in large scale image datasets, where the object is specified by an image query and retrieval should be immediate at run time in the manner of Video Google [28]. We make the following three contributions: (i) a new method to compare SIFT descriptors (RootSIFT) which yields superior performance without increasing processing or storage requirements; (ii) a novel method for query expansion where a richer model for the query is learnt discriminatively in a form suited to immediate retrieval through efficient use of the inverted index; (iii) an improvement of the image augmentation method proposed by Turcot and Lowe [29], where only the augmenting features which are spatially consistent with the augmented image are kept. We evaluate these three methods over a number of standard benchmark datasets (Oxford Buildings 5k and 105k, and Paris 6k) and demonstrate substantial improvements in retrieval performance whilst maintaining immediate retrieval speeds. Combining these complementary methods achieves a new state-of-the-art performance on these datasets.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Posted Content

CNN Features off-the-shelf: an Astounding Baseline for Recognition

Ali Sharif Razavian, +3 more

- 23 Mar 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

...read moreread less

Proceedings ArticleDOI

Scalable Person Re-identification: A Benchmark

Liang Zheng, +6 more

TL;DR: A minor contribution, inspired by recent advances in large-scale image search, an unsupervised Bag-of-Words descriptor is proposed that yields competitive accuracy on VIPeR, CUHK03, and Market-1501 datasets, and is scalable on the large- scale 500k dataset.

...read moreread less

Proceedings ArticleDOI

Return of the Devil in the Details: Delving Deep into Convolutional Nets

Ken Chatfield, +3 more

TL;DR: It is shown that the data augmentation techniques commonly applied to CNN-based methods can also be applied to shallow methods, and result in an analogous performance boost, and it is identified that the dimensionality of the CNN output layer can be reduced significantly without having an adverse effect on performance.

...read moreread less

Proceedings ArticleDOI

Action Recognition with Improved Trajectories

Heng Wang, +1 more

TL;DR: Dense trajectories were shown to be an efficient video representation for action recognition and achieved state-of-the-art results on a variety of datasets are improved by taking into account camera motion to correct them.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011 -

ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

Proceedings ArticleDOI

Video Google: a text retrieval approach to object matching in videos

Sivic, +1 more

TL;DR: An approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video, represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion.

...read moreread less

Journal ArticleDOI

Scale & Affine Invariant Interest Point Detectors

Krystian Mikolajczyk, +1 more

- 01 Oct 2004 -

International Journal of Computer Vision

TL;DR: A comparative evaluation of different detectors is presented and it is shown that the proposed approach for detecting interest points invariant to scale and affine transformations provides better results than existing methods.

...read moreread less

Proceedings ArticleDOI

Scalable Recognition with a Vocabulary Tree

David Nister, +1 more

TL;DR: A recognition scheme that scales efficiently to a large number of objects and allows a larger and more discriminatory vocabulary to be used efficiently is presented, which it is shown experimentally leads to a dramatic improvement in retrieval quality.

...read moreread less

Collapse

Three things everyone should know to improve object retrieval

Citations

ImageNet Large Scale Visual Recognition Challenge

CNN Features off-the-shelf: an Astounding Baseline for Recognition

Scalable Person Re-identification: A Benchmark

Return of the Devil in the Details: Delving Deep into Convolutional Nets

Action Recognition with Improved Trajectories

References

Distinctive Image Features from Scale-Invariant Keypoints

LIBSVM: A library for support vector machines

Video Google: a text retrieval approach to object matching in videos

Scale & Affine Invariant Interest Point Detectors

Scalable Recognition with a Vocabulary Tree

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

Object retrieval with large vocabularies and fast spatial matching

Video Google: a text retrieval approach to object matching in videos

Scalable Recognition with a Vocabulary Tree

Improving the fisher kernel for large-scale image classification