Open Access
Distinctive Image Features from Scale-Invariant Keypoints
Reads0
Chats0
TLDR
The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.Abstract:
The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images. These features can then be used to reliably match objects in diering images. The algorithm was rst proposed by Lowe [12] and further developed to increase performance resulting in the classic paper [13] that served as foundation for SIFT which has played an important role in robotic and machine vision in the past decade.read more
Citations
More filters
Proceedings ArticleDOI
The One-Shot similarity kernel
TL;DR: The One-Shot similarity score is analyzed and it is shown that when using a version of LDA as the underlying classifier, this score is a Conditionally Positive Definite kernel and may be used within kernel-methods (e.g., SVM) and is effective as an underlying mechanism for image representation.
Proceedings ArticleDOI
Learning Important Spatial Pooling Regions for Scene Classification
TL;DR: This work addresses the false response influence problem when learning and applying discriminative parts to construct the mid-level representation in scene classification by learning important spatial pooling regions along with their appearance and achieves state-of-the-art performance on several datasets.
Journal ArticleDOI
Real-time monocular object SLAM
TL;DR: In this article, a real-time object-based SLAM system is presented, which leverages the largest object database to date, which consists of two main components: (1) a monocular SLAM algorithm that exploits object rigidity constraints to improve the map and find its real scale, and (2) a novel object recognition algorithm based on bags of binary words.
Proceedings ArticleDOI
Clustered Synopsis of Surveillance Video
TL;DR: A new methodology for the generation of short and coherent video summaries is presented, based on clustering of similar activities, which is suitable for efficient creation of ground truth data.
Journal ArticleDOI
First-Person Vision
Takeo Kanade,Martial Hebert +1 more
TL;DR: This paper argues that the first-person vision (FPV), which senses the environment and the subject's activities from a wearable sensor, is more advantageous with images about thesubject's environment as taken from his/her view points, and with readily available information about head motion and gaze through eye tracking.
References
More filters
Journal ArticleDOI
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI
Object recognition from local scale-invariant features
TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.
Proceedings ArticleDOI
A Combined Corner and Edge Detector
Chris Harris,Mike Stephens +1 more
TL;DR: The problem the authors are addressing in Alvey Project MMI149 is that of using computer vision to understand the unconstrained 3D world, in which the viewed scenes will in general contain too wide a diversity of objects for topdown recognition techniques to work.
Journal ArticleDOI
A performance evaluation of local descriptors
TL;DR: It is observed that the ranking of the descriptors is mostly independent of the interest region detector and that the SIFT-based descriptors perform best and Moments and steerable filters show the best performance among the low dimensional descriptors.
Journal ArticleDOI
Robust wide-baseline stereo from maximally stable extremal regions
TL;DR: The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes.