Object recognition from local scale-invariant features

doi:10.1109/ICCV.1999.790410

Proceedings ArticleDOI

Object recognition from local scale-invariant features

- Vol. 2, pp 1150-1157

TLDR

Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Abstract:

An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low residual least squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An Introduction to Deep Learning for the Physical Layer

Timothy J. O'Shea, +1 more

- 02 Oct 2017 -

IEEE Transactions on Cognitive Communica...

TL;DR: In this article, an end-to-end reconstruction task was proposed to jointly optimize transmitter and receiver components in a single process, which can be extended to networks of multiple transmitters and receivers.

...read moreread less

Proceedings ArticleDOI

FREAK: Fast Retina Keypoint

Alexandre Alahi, +2 more

TL;DR: This work proposes a novel keypoint descriptor inspired by the human visual system and more precisely the retina, coined Fast Retina Keypoint (FREAK), which is in general faster to compute with lower memory load and also more robust than SIFT, SURF or BRISK.

...read moreread less

Journal ArticleDOI

80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

Antonio Torralba, +2 more

- 01 Nov 2008 -

IEEE Transactions on Pattern Analysis an...

TL;DR: For certain classes that are particularly prevalent in the dataset, such as people, this work is able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.

...read moreread less

Book ChapterDOI

Stacked convolutional auto-encoders for hierarchical feature extraction

Jonathan Masci, +3 more

TL;DR: A novel convolutional auto-encoder (CAE) for unsupervised feature learning that initializing a CNN with filters of a trained CAE stack yields superior performance on a digit and an object recognition benchmark.

...read moreread less

Journal ArticleDOI

Deep learning applications and challenges in big data analytics

Maryam M. Najafabadi, +5 more

- 24 Feb 2015 -

Journal of Big Data

TL;DR: This study explores how Deep Learning can be utilized for addressing some important problems in Big Data Analytics, including extracting complex patterns from massive volumes of data, semantic indexing, data tagging, fast information retrieval, and simplifying discriminative tasks.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Color indexing

Michael J. Swain, +1 more

- 01 Nov 1991 -

International Journal of Computer Vision

TL;DR: In this paper, color histograms of multicolored objects provide a robust, efficient cue for indexing into a large database of models, and they can differentiate among a large number of objects.

...read moreread less

Journal ArticleDOI

Generalizing the hough transform to detect arbitrary shapes

Dana H. Ballard

- 01 Jan 1987 -

Pattern Recognition

TL;DR: It is shown how the boundaries of an arbitrary non-analytic shape can be used to construct a mapping between image space and Hough transform space, which makes the generalized Houghtransform a kind of universal transform which can beused to find arbitrarily complex shapes.

...read moreread less

Journal ArticleDOI

Visual learning and recognition of 3-D objects from appearance

Hiroshi Murase, +1 more

- 01 Jan 1995 -

International Journal of Computer Vision

TL;DR: A near real-time recognition system with 20 complex objects in the database has been developed and a compact representation of object appearance is proposed that is parametrized by pose and illumination.

...read moreread less

Journal ArticleDOI

Local grayvalue invariants for image retrieval

Cordelia Schmid, +1 more

- 01 May 1997 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper addresses the problem of retrieving images from large image databases with a method based on local grayvalue invariants which are computed at automatically detected interest points and allows for efficient retrieval from a database of more than 1,000 images.

...read moreread less

Journal ArticleDOI

A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry

Zhengyou Zhang, +3 more

- 15 Oct 1995 -

Artificial Intelligence

TL;DR: A robust approach to image matching by exploiting the only available geometric constraint, namely, the epipolar constraint, is proposed and a new strategy for updating matches is developed, which only selects those matches having both high matching support and low matching ambiguity.

...read moreread less

Object recognition from local scale-invariant features

Citations

An Introduction to Deep Learning for the Physical Layer

FREAK: Fast Retina Keypoint

80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

Stacked convolutional auto-encoders for hierarchical feature extraction

Deep learning applications and challenges in big data analytics

References

Color indexing

Generalizing the hough transform to detect arbitrary shapes

Visual learning and recognition of 3-D objects from appearance

Local grayvalue invariants for image retrieval

A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

SURF: speeded up robust features

Histograms of oriented gradients for human detection

A Combined Corner and Edge Detector

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography