Object recognition from local scale-invariant features

doi:10.1109/ICCV.1999.790410

Proceedings ArticleDOI

Object recognition from local scale-invariant features

- Vol. 2, pp 1150-1157

TLDR

Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Abstract:

An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low residual least squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Citations

PDF

Open Access

More filters

Posted Content

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Haoran Tang, +8 more

- 15 Nov 2016 -

arXiv: Artificial Intelligence

TL;DR: A simple generalization of the classic count-based approach can reach near state-of-the-art performance on various high-dimensional and/or continuous deep RL benchmarks, and is found that simple hash functions can achieve surprisingly good results on many challenging tasks.

...read moreread less

Proceedings Article

Distributional Semantics in Technicolor

Elia Bruni, +3 more

TL;DR: While visual models with state-of-the-art computer vision techniques perform worse than textual models in general tasks, they are as good or better models of the meaning of words with visual correlates such as color terms, even in a nontrivial task that involves nonliteral uses of such words.

...read moreread less

Proceedings ArticleDOI

Nonparametric scene parsing: Label transfer via dense scene alignment

Ce Liu, +2 more

TL;DR: Compared to existing object recognition approaches that require training for each object category, the proposed nonparametric scene parsing system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.

...read moreread less

Posted Content

Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks

Lin Sun, +3 more

- 02 Oct 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, a factorized spatio-temporal convolutional networks (FstCN) is proposed to factorize the original 3D convolution kernel learning as a sequential process of learning 2D spatial kernels in the lower layers, followed by learning 1D temporal kernel in the upper layers.

...read moreread less

Proceedings ArticleDOI

POOF: Part-Based One-vs.-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation

Thomas Berg, +1 more

TL;DR: A method to automatically learn a large and diverse set of highly discriminative intermediate features that are called Part-based One-vs-One Features (POOFs), each of these features specializes in discrimination between two particular classes based on the appearance at a particular part.

...read moreread less

Zhengyou Zhang, +3 more

- 15 Oct 1995 -

Artificial Intelligence

TL;DR: A robust approach to image matching by exploiting the only available geometric constraint, namely, the epipolar constraint, is proposed and a new strategy for updating matches is developed, which only selects those matches having both high matching support and low matching ambiguity.

...read moreread less

Object recognition from local scale-invariant features

Citations

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Distributional Semantics in Technicolor

Nonparametric scene parsing: Label transfer via dense scene alignment

Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks

POOF: Part-Based One-vs.-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation

References

Color indexing

Generalizing the hough transform to detect arbitrary shapes

Visual learning and recognition of 3-D objects from appearance

Local grayvalue invariants for image retrieval

A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

SURF: speeded up robust features

Histograms of oriented gradients for human detection

A Combined Corner and Edge Detector

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography