scispace - formally typeset
Open AccessProceedings ArticleDOI

PlaneRCNN: 3D Plane Detection and Reconstruction From a Single Image

Reads0
Chats0
TLDR
In this article, a deep neural architecture, PlaneRCNN, is proposed to detect and reconstruct piecewise planar regions from a single RGB image, which employs a variant of Mask R-CNN to detect planes with their plane parameters and segmentation masks.
Abstract
This paper proposes a deep neural architecture, PlaneRCNN, that detects and reconstructs piecewise planar regions from a single RGB image. PlaneRCNN employs a variant of Mask R-CNN to detect planes with their plane parameters and segmentation masks. PlaneRCNN then refines an arbitrary number of segmentation masks with a novel loss enforcing the consistency with a nearby view during training. The paper also presents a new benchmark with more fine-grained plane segmentations in the ground-truth, in which, PlaneRCNN outperforms existing state-of-the-art methods with significant margins in the plane detection, segmentation, and reconstruction metrics. PlaneRCNN makes an important step towards robust plane extraction method, which would have immediate impact on a wide range of applications including Robotics, Augmented Reality, and Virtual Reality.

read more

Content maybe subject to copyright    Report

Citations
More filters
Posted Content

Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

TL;DR: This paper presents a new synthetic dataset, Structured3D, with the aim of providing large-scale photo-realistic images with rich 3D structure annotations for a wide spectrum of structured 3D modeling tasks, and takes advantage of the availability of professional interior designs to automatically extract 3D structures from them.
Proceedings ArticleDOI

Single-Image Piece-Wise Planar 3D Reconstruction via Associative Embedding

TL;DR: Zhang et al. as mentioned in this paper proposed a two-stage method based on associative embedding, inspired by its recent success in instance segmentation, which is able to detect an arbitrary number of planes.
Book ChapterDOI

Structured3D: A Large Photo-Realistic Dataset for Structured 3D Modeling

TL;DR: Structured3D as mentioned in this paper is a large-scale photo-realistic image dataset with rich 3D structure annotations for a wide spectrum of structured 3D modeling tasks, including room layout estimation.
Journal ArticleDOI

State-of-the-art in Automatic 3D Reconstruction of Structured Indoor Environments

TL;DR: An up‐to‐date integrative view of the field, bridging complementary views coming from computer graphics and computer vision is provided, and the structure of output models and the priors exploited to bridge the gap between imperfect sources and desired output are defined.
Proceedings ArticleDOI

End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans

TL;DR: In this paper, a differentiable Procrustes alignment is proposed to align CAD models to 3D scans of a scene, enabling transformation of a noisy, incomplete 3D scan to a compact, CAD reconstruction with clean, complete object geometry.
References
More filters
Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.
Posted Content

U-Net: Convolutional Networks for Biomedical Image Segmentation

TL;DR: It is shown that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.
Proceedings ArticleDOI

Feature Pyramid Networks for Object Detection

TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.
Proceedings ArticleDOI

Mask R-CNN

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation, which extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition.
Posted Content

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

TL;DR: This work proposes a Parametric Rectified Linear Unit (PReLU) that generalizes the traditional rectified unit and derives a robust initialization method that particularly considers the rectifier nonlinearities.
Related Papers (5)