MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

doi:10.1109/ICCV.2017.401

Open AccessProceedings ArticleDOI

MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Ayush Tewari, +6 more

- pp 3735-3744

Chats0

TLDR

A novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image and can be trained end-to-end in an unsupervised manner, which renders training on very large real world data feasible.

Abstract:

In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation is the differentiable parametric decoder that encapsulates image formation analytically based on a generative model. Our decoder takes as input a code vector with exactly defined semantic meaning that encodes detailed face pose, shape, expression, skin reflectance and scene illumination. Due to this new way of combining CNN-based with model-based face reconstruction, the CNN-based encoder learns to extract semantically meaningful parameters from a single monocular input image. For the first time, a CNN encoder and an expert-designed generative model can be trained end-to-end in an unsupervised manner, which renders training on very large (unlabeled) real world data feasible. The obtained reconstructions compare favorably to current state-of-the-art approaches in terms of quality and richness of representation.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era

Xian-Feng Han, +2 more

- 01 May 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A comprehensive survey of the recent developments in 3D reconstruction using convolutional neural networks, focusing on the works which use deep learning techniques to estimate the 3D shape of generic objects either from a single or multiple RGB images.

...read moreread less

Proceedings ArticleDOI

Unsupervised Training for 3D Morphable Model Regression

Kyle Genova, +5 more

TL;DR: In this paper, a method for training a regression network from image pixels to 3D morphable model coordinates using only unlabeled photographs is presented. But the training loss is based on features from a facial recognition network, computed on-the-fly by rendering the predicted faces with a differentiable renderer.

...read moreread less

Proceedings ArticleDOI

SfSNet: Learning Shape, Reflectance and Illuminance of Faces 'in the Wild'

Soumyadip Sengupta, +3 more

TL;DR: SfSNet produces significantly better quantitative and qualitative results than state-of-the-art methods for inverse rendering and independent normal and illumination estimation and is designed to reflect a physical lambertian rendering model.

...read moreread less

Proceedings ArticleDOI

Accurate 3D Face Reconstruction With Weakly-Supervised Learning: From Single Image to Image Set

Yu Deng, +5 more

TL;DR: Deep3DFaceReconstruction as mentioned in this paper leverages a robust, hybrid loss function for weakly supervised learning which takes into account both low-level and perception-level information for supervision, and performs multi-image face reconstruction by exploiting complementary information from different images for shape aggregation.

...read moreread less

Proceedings ArticleDOI

Nonlinear 3D Face Morphable Model

Luan Tran, +1 more

TL;DR: This paper proposes an innovative framework to learn a nonlinear 3DMM model from a large set of unconstrained face images, without collecting 3D face scans, and demonstrates the superior representation power of the nonlinear 2D Morphable Model over its linear counterpart.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

Geoffrey E. Hinton, +1 more

- 28 Jul 2006 -

Science

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Proceedings ArticleDOI

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Proceedings ArticleDOI

Deep Learning Face Attributes in the Wild

Ziwei Liu, +3 more

TL;DR: A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.

...read moreread less

Collapse

MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Citations

Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era

Unsupervised Training for 3D Morphable Model Regression

SfSNet: Learning Shape, Reflectance and Illuminance of Faces 'in the Wild'

Accurate 3D Face Reconstruction With Weakly-Supervised Learning: From Single Image to Image Set

Nonlinear 3D Face Morphable Model

References

ImageNet Classification with Deep Convolutional Neural Networks

Reducing the Dimensionality of Data with Neural Networks

Caffe: Convolutional Architecture for Fast Feature Embedding

Caffe: Convolutional Architecture for Fast Feature Embedding

Deep Learning Face Attributes in the Wild

Related Papers (5)

A morphable model for the synthesis of 3D faces

A 3D Face Model for Pose and Illumination Invariant Face Recognition

FaceWarehouse: A 3D Facial Expression Database for Visual Computing

Face Alignment Across Large Poses: A 3D Solution

Deep Residual Learning for Image Recognition