Object recognition from local scale-invariant features

doi:10.1109/ICCV.1999.790410

Proceedings ArticleDOI

Object recognition from local scale-invariant features

- Vol. 2, pp 1150-1157

TLDR

Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Abstract:

An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low residual least squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

NutriNet: A Deep Learning Food and Drink Image Recognition System for Dietary Assessment

Simon Mezgec, +1 more

- 27 Jun 2017 -

Nutrients

TL;DR: This work presents a novel approach to the problem of food and drink image detection and recognition that uses a newly-defined deep convolutional neural network architecture, called NutriNet, which is being used in practice as part of a mobile app for the dietary assessment of Parkinson’s disease patients.

...read moreread less

Proceedings ArticleDOI

From UI design image to GUI skeleton: a neural machine translator to bootstrap mobile GUI implementation

Chunyang Chen, +4 more

TL;DR: This paper presents a neural machine translator that combines recent advances in computer vision and machine translation for translating a UI design image into a GUI skeleton, without requiring manual rule development.

...read moreread less

Proceedings ArticleDOI

Group-sensitive multiple kernel learning for object categorization

Jingjing Yang, +4 more

TL;DR: A group-sensitive multiple kernel learning method to accommodate the intra-class diversity and the inter-class correlation for object categorization by introducing an intermediate representation “group” between images and object categories is proposed.

...read moreread less

Journal ArticleDOI

A review of recent advances in visual speech decoding

Ziheng Zhou, +3 more

- 01 Sep 2014 -

Image and Vision Computing

TL;DR: A detailed review of recent advances in visual speech decoding, focusing on the important questions asked by researchers and summarize the recent studies that attempt to answer them, and providing details of audio-visual speech databases.

...read moreread less

Proceedings ArticleDOI

N-sift: n-dimensional scale invariant feature transform for matching medical images

Warren A. Cheung, +1 more

TL;DR: This method extends the concepts used in the computer vision SIFT technique for extracting and matching distinctive scale invariant features in 2D scalar images to scalar image of arbitrary dimensionality by using hyperspherical coordinates for gradients and multidimensional histograms to create the feature vectors.

...read moreread less

Zhengyou Zhang, +3 more

- 15 Oct 1995 -

Artificial Intelligence

TL;DR: A robust approach to image matching by exploiting the only available geometric constraint, namely, the epipolar constraint, is proposed and a new strategy for updating matches is developed, which only selects those matches having both high matching support and low matching ambiguity.

...read moreread less

Object recognition from local scale-invariant features

Citations

NutriNet: A Deep Learning Food and Drink Image Recognition System for Dietary Assessment

From UI design image to GUI skeleton: a neural machine translator to bootstrap mobile GUI implementation

Group-sensitive multiple kernel learning for object categorization

A review of recent advances in visual speech decoding

N-sift: n-dimensional scale invariant feature transform for matching medical images

References

Color indexing

Generalizing the hough transform to detect arbitrary shapes

Visual learning and recognition of 3-D objects from appearance

Local grayvalue invariants for image retrieval

A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

SURF: speeded up robust features

Histograms of oriented gradients for human detection

A Combined Corner and Edge Detector

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography