Object recognition from local scale-invariant features

doi:10.1109/ICCV.1999.790410

Proceedings ArticleDOI

Object recognition from local scale-invariant features

- Vol. 2, pp 1150-1157

TLDR

Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Abstract:

An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low residual least squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Citations

PDF

Open Access

More filters

Posted Content

Generic decoding of seen and imagined objects using hierarchical visual features

Tomoyasu Horikawa, +1 more

- 22 Oct 2015 -

arXiv: Neurons and Cognition

TL;DR: In this article, the authors present a decoding approach for arbitrary objects, using the machine vision principle that an object category is represented by a set of features rendered invariant through hierarchical processing.

...read moreread less

Patent

Systems and methods for using multiple hypotheses in a visual simultaneous localization and mapping system

Niklas Karlsson, +3 more

TL;DR: In this paper, the authors use a visual sensor and dead reckoning sensors to process simultaneous localization and mapping (SLAM) in robot navigation, which can be used to autonomously generate and update a map.

...read moreread less

Patent

Methods and System for Performing 3-D Tool Tracking by Fusion of Sensor and/or Camera Derived Data During Minimally Invasive Robotic Surgery

Brian D. Hoffman, +4 more

TL;DR: In this paper, the authors used triangulation techniques or a Bayesian filter to determine tool states using both non-and endoscopically derived and visually derived tool state information.

...read moreread less

Proceedings ArticleDOI

Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics

Douwe Kiela, +1 more

TL;DR: This work constructs multi-modal concept representations by concatenating a skip-gram linguistic representation vector with a visual concept representation vector computed using the feature extraction layers of a deep convolutional neural network trained on a large labeled object recognition dataset.

...read moreread less

Journal ArticleDOI

A Review on Human Activity Recognition Using Vision-Based Method

Shugang Zhang, +5 more

- 20 Jul 2017 -

Journal of Healthcare Engineering

TL;DR: This review highlights the advances of state-of-the-art activity recognition approaches, especially for the activity representation and classification methods, and classify existing literatures with a detailed taxonomy including representation and Classification methods, as well as the datasets they used.

...read moreread less

Zhengyou Zhang, +3 more

- 15 Oct 1995 -

Artificial Intelligence

TL;DR: A robust approach to image matching by exploiting the only available geometric constraint, namely, the epipolar constraint, is proposed and a new strategy for updating matches is developed, which only selects those matches having both high matching support and low matching ambiguity.

...read moreread less

Object recognition from local scale-invariant features

Citations

Generic decoding of seen and imagined objects using hierarchical visual features

Systems and methods for using multiple hypotheses in a visual simultaneous localization and mapping system

Methods and System for Performing 3-D Tool Tracking by Fusion of Sensor and/or Camera Derived Data During Minimally Invasive Robotic Surgery

Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics

A Review on Human Activity Recognition Using Vision-Based Method

References

Color indexing

Generalizing the hough transform to detect arbitrary shapes

Visual learning and recognition of 3-D objects from appearance

Local grayvalue invariants for image retrieval

A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

SURF: speeded up robust features

Histograms of oriented gradients for human detection

A Combined Corner and Edge Detector

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography