scispace - formally typeset
Proceedings ArticleDOI

Object recognition from local scale-invariant features

David G. Lowe
- Vol. 2, pp 1150-1157
Reads0
Chats0
TLDR
Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.
Abstract
An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low residual least squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Multi-image Photogrammetry for Underwater Archaeological Site Recording: An Accessible, Diver-Based Approach

TL;DR: It is argued that the availability of integrated multi-image photogrammetry software, highly light-sensitive digital sensors and wide-aperture compact cameras, now allow for simple work flows with minimal equipment and excellent natural colour images even at depths of up to 30 m.
Proceedings ArticleDOI

Multi-view facial expression recognition

TL;DR: The authors' extensive person-independent experiments suggest that the SIFT descriptor outperforms HoG and LBP, and LPP outperforms PCA and LDA in this application, but the classifier fusion does not show a significant advantage over SIFT-only classifier.
Journal ArticleDOI

Assessing the performance of structure-from-motion photogrammetry and terrestrial LiDAR for reconstructing soil surface microtopography of naturally vegetated plots

TL;DR: In this article, the performance of SfM and TLS technologies at reconstructing soil microtopography on 6'm × 2'm erosion plots with vegetation cover ranging from 0% to 77%.
Journal ArticleDOI

Mapping folds and fractures in basement and cover rocks using UAV photogrammetry, Cape Liptrap and Cape Paterson, Victoria, Australia

TL;DR: In this article, a 3D implicit structural trend model was used to visualise along-strike changes of Devonian (Tabberabberan) folds at the Fold Stack locality and to estimate bulk shortening strain.
DissertationDOI

Interactions of visual attention and object recognition : computational modeling, algorithms, and psychophysics.

TL;DR: A new model of bottom-up salient region selection is developed, which estimates the approximate extent of attended proto-objects in a biologically realistic manner and shows that attentional grouping based on bottom- up processes enables successive learning and recognition of multiple objects in cluttered natural scenes.
References
More filters
Journal ArticleDOI

Color indexing

TL;DR: In this paper, color histograms of multicolored objects provide a robust, efficient cue for indexing into a large database of models, and they can differentiate among a large number of objects.
Journal ArticleDOI

Generalizing the hough transform to detect arbitrary shapes

TL;DR: It is shown how the boundaries of an arbitrary non-analytic shape can be used to construct a mapping between image space and Hough transform space, which makes the generalized Houghtransform a kind of universal transform which can beused to find arbitrarily complex shapes.
Journal ArticleDOI

Visual learning and recognition of 3-D objects from appearance

TL;DR: A near real-time recognition system with 20 complex objects in the database has been developed and a compact representation of object appearance is proposed that is parametrized by pose and illumination.
Journal ArticleDOI

Local grayvalue invariants for image retrieval

TL;DR: This paper addresses the problem of retrieving images from large image databases with a method based on local grayvalue invariants which are computed at automatically detected interest points and allows for efficient retrieval from a database of more than 1,000 images.
Journal ArticleDOI

A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry

TL;DR: A robust approach to image matching by exploiting the only available geometric constraint, namely, the epipolar constraint, is proposed and a new strategy for updating matches is developed, which only selects those matches having both high matching support and low matching ambiguity.