3D Shape Induction from 2D Views of Multiple Objects

doi:10.1109/3DV.2017.00053

Open AccessProceedings ArticleDOI

3D Shape Induction from 2D Views of Multiple Objects

Matheus Gadelha, +2 more

- pp 402-411

Chats0

TLDR

The approach called "projective generative adversarial networks" (PrGANs) trains a deep generative model of 3D shapes whose projections match the distributions of the input 2D views, which allows it to predict 3D, viewpoint, and generate novel views from an input image in a completely unsupervised manner.

Abstract:

In this paper we investigate the problem of inducing a distribution over three-dimensional structures given two-dimensional views of multiple objects taken from unknown viewpoints. Our approach called "projective generative adversarial networks" (PrGANs) trains a deep generative model of 3D shapes whose projections match the distributions of the input 2D views. The addition of a projection module allows us to infer the underlying 3D shape distribution without using any 3D, viewpoint information, or annotation during the learning phase. We show that our approach produces 3D shapes of comparable quality to GANs trained on 3D data for a number of shape categories including chairs, airplanes, and cars. Experiments also show that the disentangled representation of 2D shapes into geometry and viewpoint leads to a good generative model of 2D shapes. The key advantage is that our model allows us to predict 3D, viewpoint, and generate novel views from an input image in a completely unsupervised manner.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Unsupervised Learning of Depth and Ego-Motion from Video

Tinghui Zhou, +3 more

TL;DR: In this paper, an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences is presented, which uses single-view depth and multiview pose networks with a loss based on warping nearby views to the target using the computed depth and pose.

...read moreread less

Posted Content

Occupancy Networks: Learning 3D Reconstruction in Function Space

Lars Mescheder, +4 more

- 10 Dec 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes Occupancy Networks, a new representation for learning-based 3D reconstruction methods that encodes a description of the 3D output at infinite resolution without excessive memory footprint, and validate that the representation can efficiently encode 3D structure and can be inferred from various kinds of input.

...read moreread less

Proceedings ArticleDOI

Occupancy Networks: Learning 3D Reconstruction in Function Space

Lars Mescheder, +4 more

TL;DR: In this paper, the authors propose Occupancy Networks, which implicitly represent the 3D surface as the continuous decision boundary of a deep neural network classifier, which can be used for learning-based 3D reconstruction methods.

...read moreread less

Proceedings ArticleDOI

Differentiable Volumetric Rendering: Learning Implicit 3D Representations Without 3D Supervision

Michael Niemeyer, +3 more

TL;DR: This work proposes a differentiable rendering formulation for implicit shape and texture representations, showing that depth gradients can be derived analytically using the concept of implicit differentiation, and finds that this method can be used for multi-view 3D reconstruction, directly resulting in watertight meshes.

...read moreread less

Proceedings ArticleDOI

Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs

Maxim Tatarchenko, +2 more

TL;DR: In this paper, a deep convolutional decoder architecture is proposed to generate volumetric 3D outputs in a compute-and memory-efficient manner by using an octree representation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Book

Multiple view geometry in computer vision

Richard Hartley, +1 more

TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.

...read moreread less

Posted Content

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, +2 more

- 19 Nov 2015 -

arXiv: Learning

TL;DR: This work introduces a class of CNNs called deep convolutional generative adversarial networks (DCGANs), that have certain architectural constraints, and demonstrates that they are a strong candidate for unsupervised learning.

...read moreread less

Collapse

arXiv: Graphics

Generative Adversarial Nets

Ian Goodfellow, +7 more

3D Shape Induction from 2D Views of Multiple Objects

Citations

Unsupervised Learning of Depth and Ego-Motion from Video

Occupancy Networks: Learning 3D Reconstruction in Function Space

Occupancy Networks: Learning 3D Reconstruction in Function Space

Differentiable Volumetric Rendering: Learning Implicit 3D Representations Without 3D Supervision

Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs

References

Deep Residual Learning for Image Recognition

Generative Adversarial Nets

Fully convolutional networks for semantic segmentation

Multiple view geometry in computer vision

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Related Papers (5)

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

ShapeNet: An Information-Rich 3D Model Repository

Generative Adversarial Nets