3D Semantic Parsing of Large-Scale Indoor Spaces

doi:10.1109/CVPR.2016.170

Proceedings ArticleDOI

3D Semantic Parsing of Large-Scale Indoor Spaces

Iro Armeni, +6 more

- pp 1534-1543

Chats0

TLDR

This paper argues that identification of structural elements in indoor spaces is essentially a detection problem, rather than segmentation which is commonly used, and proposes a method for semantic parsing the 3D point cloud of an entire building using a hierarchical approach.

Abstract:

In this paper, we propose a method for semantic parsing the 3D point cloud of an entire building using a hierarchical approach: first, the raw data is parsed into semantically meaningful spaces (e.g. rooms, etc) that are aligned into a canonical reference coordinate system. Second, the spaces are parsed into their structural and building elements (e.g. walls, columns, etc). Performing these with a strong notation of global 3D space is the backbone of our method. The alignment in the first step injects strong 3D priors from the canonical coordinate system into the second step for discovering elements. This allows diverse challenging scenarios as man-made indoor spaces often show recurrent geometric patterns while the appearance features can change drastically. We also argue that identification of structural elements in indoor spaces is essentially a detection problem, rather than segmentation which is commonly used. We evaluated our method on a new dataset of several buildings with a covered area of over 6, 000m2 and over 215 million points, demonstrating robust results readily useful for practical applications.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

R. Qi Charles, +3 more

TL;DR: This paper designs a novel type of neural network that directly consumes point clouds, which well respects the permutation invariance of points in the input and provides a unified architecture for applications ranging from object classification, part segmentation, to scene semantic parsing.

...read moreread less

Journal ArticleDOI

Dynamic Graph CNN for Learning on Point Clouds

Yue Wang, +5 more

- 10 Oct 2019 -

ACM Transactions on Graphics

TL;DR: This work proposes a new neural network module suitable for CNN-based high-level tasks on point clouds, including classification and segmentation called EdgeConv, which acts on graphs dynamically computed in each layer of the network.

...read moreread less

Proceedings ArticleDOI

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Angela Dai, +5 more

TL;DR: This work introduces ScanNet, an RGB-D video dataset containing 2.5M views in 1513 scenes annotated with 3D camera poses, surface reconstructions, and semantic segmentations, and shows that using this data helps achieve state-of-the-art performance on several 3D scene understanding tasks.

...read moreread less

Proceedings ArticleDOI

KPConv: Flexible and Deformable Convolution for Point Clouds

Hugues Thomas, +5 more

TL;DR: KPConv is a new design of point convolution, i.e. that operates on point clouds without any intermediate representation, that outperform state-of-the-art classification and segmentation approaches on several datasets.

...read moreread less

Proceedings Article

PointCNN: convolution on Χ -transformed points

Yangyan Li, +5 more

TL;DR: This work proposes to learn an Χ-transformation from the input points to simultaneously promote two causes: the first is the weighting of the input features associated with the points, and the second is the permutation of the points into a latent and potentially canonical order.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

Mark Everingham, +4 more

- 01 Jun 2010 -

International Journal of Computer Vision

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.

...read moreread less

Journal Article

LIBLINEAR: A Library for Large Linear Classification

Rong-En Fan, +4 more

- 01 Jun 2008 -

Journal of Machine Learning Research

TL;DR: LIBLINEAR is an open source library for large-scale linear classification that supports logistic regression and linear support vector machines and provides easy-to-use command-line tools and library calls for users and developers.

...read moreread less

Journal ArticleDOI

Vision meets robotics: The KITTI dataset

Andreas Geiger, +3 more

- 01 Sep 2013 -

The International Journal of Robotics Re...

TL;DR: A novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research, using a variety of sensor modalities such as high-resolution color and grayscale stereo cameras and a high-precision GPS/IMU inertial navigation system.

...read moreread less

Book

Probabilistic graphical models : principles and techniques

Daniel L. Koller, +1 more

TL;DR: The framework of probabilistic graphical models, presented in this book, provides a general approach for causal reasoning and decision making under uncertainty, allowing interpretable models to be constructed and then manipulated by reasoning algorithms.

...read moreread less

Journal ArticleDOI

Clustering by Passing Messages Between Data Points

Brendan J. Frey, +1 more

- 16 Feb 2007 -

Science

TL;DR: A method called “affinity propagation,” which takes as input measures of similarity between pairs of data points, which found clusters with much lower error than other methods, and it did so in less than one-hundredth the amount of time.

...read moreread less

Collapse

ACM Transactions on Graphics

3D Semantic Parsing of Large-Scale Indoor Spaces

Citations

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Dynamic Graph CNN for Learning on Point Clouds

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

KPConv: Flexible and Deformable Convolution for Point Clouds

PointCNN: convolution on Χ -transformed points

References

The Pascal Visual Object Classes (VOC) Challenge

LIBLINEAR: A Library for Large Linear Classification

Vision meets robotics: The KITTI dataset

Probabilistic graphical models : principles and techniques

Clustering by Passing Messages Between Data Points

Related Papers (5)

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

3D ShapeNets: A deep representation for volumetric shapes

VoxNet: A 3D Convolutional Neural Network for real-time object recognition

Dynamic Graph CNN for Learning on Point Clouds