Distinctive Image Features from Scale-Invariant Keypoints

doi:10.1023/B:VISI.0000029664.99615.94

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

- Vol. 60, Iss: 2, pp 91-110

Chats0

TLDR

This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

Abstract:

This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene. The features are invariant to image scale and rotation, and are shown to provide robust matching across a substantial range of affine distortion, change in 3D viewpoint, addition of noise, and change in illumination. The features are highly distinctive, in the sense that a single feature can be correctly matched with high probability against a large database of features from many images. This paper also describes an approach to using these features for object recognition. The recognition proceeds by matching individual features to a database of features from known objects using a fast nearest-neighbor algorithm, followed by a Hough transform to identify clusters belonging to a single object, and finally performing verification through least-squares solution for consistent pose parameters. This approach to recognition can robustly identify objects among clutter and occlusion while achieving near real-time performance.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks

Maxime Oquab, +4 more

TL;DR: This work designs a method to reuse layers trained on the ImageNet dataset to compute mid-level image representation for images in the PASCAL VOC dataset, and shows that despite differences in image statistics and tasks in the two datasets, the transferred representation leads to significantly improved results for object and action classification.

...read moreread less

Proceedings ArticleDOI

BRISK: Binary Robust invariant scalable keypoints

Stefan Leutenegger, +2 more

TL;DR: A comprehensive evaluation on benchmark datasets reveals BRISK's adaptive, high quality performance as in state-of-the-art algorithms, albeit at a dramatically lower computational cost (an order of magnitude faster than SURF in cases).

...read moreread less

Proceedings ArticleDOI

Object retrieval with large vocabularies and fast spatial matching

James Philbin, +4 more

TL;DR: To improve query performance, this work adds an efficient spatial verification stage to re-rank the results returned from the bag-of-words model and shows that this consistently improves search quality, though by less of a margin when the visual vocabulary is large.

...read moreread less

Journal ArticleDOI

Pedestrian Detection: An Evaluation of the State of the Art

Piotr Dollár, +3 more

- 01 Apr 2012 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An extensive evaluation of the state of the art in a unified framework of monocular pedestrian detection using sixteen pretrained state-of-the-art detectors across six data sets and proposes a refined per-frame evaluation methodology.

...read moreread less

Journal ArticleDOI

Tracking-Learning-Detection

Zdenek Kalal, +2 more

- 01 Jul 2012 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection, and develops a novel learning method (P-N learning) which estimates the errors by a pair of “experts”: P-expert estimates missed detections, and N-ex Expert estimates false alarms.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Object recognition from local scale-invariant features

David G. Lowe

TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

...read moreread less

Book

Multiple view geometry in computer vision

Richard Hartley, +1 more

TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.

...read moreread less

Multiple View Geometry in Computer Vision.

Bernhard P. Wrobel

TL;DR: This book is referred to read because it is an inspiring book to give you more chance to get experiences and also thoughts and it will show the best book collections and completed collections.

...read moreread less

Proceedings ArticleDOI

A Combined Corner and Edge Detector

Chris Harris, +1 more

TL;DR: The problem the authors are addressing in Alvey Project MMI149 is that of using computer vision to understand the unconstrained 3D world, in which the viewed scenes will in general contain too wide a diversity of objects for topdown recognition techniques to work.

...read moreread less

Journal ArticleDOI

Robust wide-baseline stereo from maximally stable extremal regions

Jiri Matas, +3 more

- 01 Sep 2004 -

Image and Vision Computing

TL;DR: The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes.

...read moreread less

Collapse

Distinctive Image Features from Scale-Invariant Keypoints

Citations

Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks

BRISK: Binary Robust invariant scalable keypoints

Object retrieval with large vocabularies and fast spatial matching

Pedestrian Detection: An Evaluation of the State of the Art

Tracking-Learning-Detection

References

Object recognition from local scale-invariant features

Multiple view geometry in computer vision

Multiple View Geometry in Computer Vision.

A Combined Corner and Edge Detector

Robust wide-baseline stereo from maximally stable extremal regions

Related Papers (5)

SURF: speeded up robust features

Histograms of oriented gradients for human detection

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Speeded-Up Robust Features (SURF)

A Combined Corner and Edge Detector

Trending Questions (1)