Detection-based object labeling in 3D scenes

doi:10.1109/ICRA.2012.6225316

Open AccessProceedings ArticleDOI

Detection-based object labeling in 3D scenes

- pp 1330-1337

TLDR

This work utilizes sliding window detectors trained from object views to assign class probabilities to pixels in every RGB-D frame, and performs efficient inference on a Markov Random Field over the voxels, combining cues from view-based detection and 3D shape, to label the scene.

Abstract:

We propose a view-based approach for labeling objects in 3D scenes reconstructed from RGB-D (color+depth) videos. We utilize sliding window detectors trained from object views to assign class probabilities to pixels in every RGB-D frame. These probabilities are projected into the reconstructed 3D scene and integrated using a voxel representation. We perform efficient inference on a Markov Random Field over the voxels, combining cues from view-based detection and 3D shape, to label the scene. Our detection-based approach produces accurate scene labeling on the RGB-D Scenes Dataset and improves the robustness of object detection.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Real-time grasp detection using convolutional neural networks

Joseph Redmon, +1 more

TL;DR: An accurate, real-time approach to robotic grasp detection based on convolutional neural networks that outperforms state-of-the-art approaches by 14 percentage points and runs at 13 frames per second on a GPU.

...read moreread less

Proceedings ArticleDOI

Multimodal deep learning for robust RGB-D object recognition

Andreas Eitel, +4 more

TL;DR: This paper leverages recent progress on Convolutional Neural Networks (CNNs) and proposes a novel RGB-D architecture for object recognition that is composed of two separate CNN processing streams - one for each modality - which are consecutively combined with a late fusion network.

...read moreread less

Book ChapterDOI

Sliding Shapes for 3D Object Detection in Depth Images

Shuran Song, +1 more

TL;DR: This paper proposes to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, self-occlusion and sensor noises.

...read moreread less

Posted Content

Real-Time Grasp Detection Using Convolutional Neural Networks

Joseph Redmon, +1 more

- 09 Dec 2014 -

arXiv: Robotics

TL;DR: In this paper, a convolutional neural network (CNN) is used for robotic grasp detection, which performs single-stage regression to graspable bounding boxes without using standard sliding window or region proposal techniques.

...read moreread less

Proceedings ArticleDOI

Unsupervised feature learning for 3D scene labeling

Kevin Lai, +2 more

TL;DR: This paper presents an approach for labeling objects in 3D scenes that combines features learned from raw RGB-D images and 3D point clouds directly, without any hand-designed features, to assign an object label to every3D point in the scene.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

Journal ArticleDOI

Fast approximate energy minimization via graph cuts

Yuri Boykov, +2 more

- 01 Nov 2001 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work presents two algorithms based on graph cuts that efficiently find a local minimum with respect to two types of large moves, namely expansion moves and swap moves that allow important cases of discontinuity preserving energies.

...read moreread less

Book

Probabilistic Robotics

Sebastian Thrun

TL;DR: This research presents a novel approach to planning and navigation algorithms that exploit statistics gleaned from uncertain, imperfect real-world environments to guide robots toward their goals and around obstacles.

...read moreread less

Collapse

IEEE Transactions on Pattern Analysis an...

Detection-based object labeling in 3D scenes

Citations

Real-time grasp detection using convolutional neural networks

Multimodal deep learning for robust RGB-D object recognition

Sliding Shapes for 3D Object Detection in Depth Images

Real-Time Grasp Detection Using Convolutional Neural Networks

Unsupervised feature learning for 3D scene labeling

References

ImageNet: A large-scale hierarchical image database

Histograms of oriented gradients for human detection

Object Detection with Discriminatively Trained Part-Based Models

Fast approximate energy minimization via graph cuts

Probabilistic Robotics

Related Papers (5)

Indoor segmentation and support inference from RGBD images

KinectFusion: Real-time dense surface mapping and tracking

Histograms of oriented gradients for human detection

3D is here: Point Cloud Library (PCL)

Object Detection with Discriminatively Trained Part-Based Models