scispace - formally typeset
Open AccessProceedings ArticleDOI

Hierarchical object geometric categorization and appearance classification for mobile manipulation

TLDR
This paper tackles the problem of recognizing everyday objects that are useful for a personal robotic assistant in fulfilling its tasks, using a hierarchical multi-modal 3D-2D processing and classification system.
Abstract
In this paper we present a comprehensive object categorization and classification system, of great importance for mobile manipulation applications in indoor environments. In detail, we tackle the problem of recognizing everyday objects that are useful for a personal robotic assistant in fulfilling its tasks, using a hierarchical multi-modal 3D-2D processing and classification system. The acquired 3D data is used to estimate geometric labels (plane, cylinder, edge, rim, sphere) at each voxel cell using the Radius-based Surface Descriptor (RSD). Then, we propose the use of a Global RSD feature (GRSD) to categorize point clusters that are geometrically identical into one of the object categories. Once a geometric category and a 3D position is obtained for each object cluster, we extract the region of interest in the camera image and compute a SURF-based feature vector for it. Thus we obtain the exact object instance and the orientation around the object's up-right axis from the appearance. The resultant system provides a hierarchical categorization of objects into basic classes from their geometry and identifies objects and their poses based on their appearance, with near real-time performance. We validate our approach on an extensive database of objects that we acquired using real sensing devices, and on both unseen views and unseen objects.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Combined 2D-3D categorization and classification for multimodal perception systems

TL;DR: The system employs a library of specialized perception routines that solve different, well-defined perceptual sub-tasks and can be combined into composite perceptual activities including the construction of an object model database, multimodal object classification, and object model reconstruction for grasping.
Journal ArticleDOI

Automatic apple recognition based on the fusion of color and 3D feature for robotic fruit picking

TL;DR: An automatic recognition method was proposed to achieve apple recognition from point cloud data using an improved 3D descriptor with the fusion of color features and 3D geometry features extracted from the preprocessed point clouds.
Proceedings ArticleDOI

3D object recognition in range images using visibility context

TL;DR: Recognizing and localizing queried objects in range images plays an important role for robotic manipulation and navigation and is still a challenging task for scenes with occlusion and clutter.
Journal ArticleDOI

GOOD: A global orthographic object descriptor for 3D object recognition and manipulation

TL;DR: A novel sign disambiguation method is proposed, for computing a unique reference frame from the eigenvectors obtained through Principal Component Analysis of the point cloud of the target object view captured by a 3D sensor.
References
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Journal ArticleDOI

Speeded-Up Robust Features (SURF)

TL;DR: A novel scale- and rotation-invariant detector and descriptor, coined SURF (Speeded-Up Robust Features), which approximates or even outperforms previously proposed schemes with respect to repeatability, distinctiveness, and robustness, yet can be computed and compared much faster.
Journal ArticleDOI

Least-Squares Fitting of Two 3-D Point Sets

TL;DR: An algorithm for finding the least-squares solution of R and T, which is based on the singular value decomposition (SVD) of a 3 × 3 matrix, is presented.
Proceedings ArticleDOI

Object class recognition by unsupervised scale-invariant learning

TL;DR: The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).
Journal ArticleDOI

Towards 3D Point cloud based object maps for household environments

TL;DR: The novel techniques include statistical analysis, persistent histogram features estimation that allows for a consistent registration, resampling with additional robust fitting techniques, and segmentation of the environment into meaningful regions.
Related Papers (5)