Predicting Forward & Backward Facial Depth Maps From a Single RGB Image For Mobile 3d AR Application

doi:10.1109/IC3D48390.2019.8975899

Proceedings ArticleDOI

Predicting Forward & Backward Facial Depth Maps From a Single RGB Image For Mobile 3d AR Application

P Avinash, +1 more

- pp 1-8

Chats0

TLDR

A novel deep learning based solution to predict robust depth maps of a face, one forward facing and the other backward facing, from a single image from the wild, by training a fully convolutional neural network to learn the dual depth maps.

Abstract:

Cheap and fast 3D asset creation to enable AR/VR applications is a fast growing domain. This paper addresses a significant problem of reconstructing complete 3D information of a face in near real-time speed on a mobile phone. We propose a novel deep learning based solution to predict robust depth maps of a face, one forward facing and the other backward facing, from a single image from the wild. A critical contribution is that the proposed network is capable of learning the depths of the occluded part of the face too. This is achieved by training a fully convolutional neural network to learn the dual (forward and backward) depth maps, with a common encoder and two separate decoders. The 300W-LP, a cloud point dataset, is used to compute the required dual depth maps from the training data. The code and results will be made available at project page.

Citations

PDF

Open Access

More filters

Proceedings Article

A morphable model for the synthesis of 3D faces

Matthew Turk

Journal ArticleDOI

Pano-SfMLearner: Self-Supervised Multi-Task Learning of Depth and Semantics in Panoramic Videos

Mengyi Liu, +4 more

- 16 Apr 2021 -

IEEE Signal Processing Letters

TL;DR: Li et al. as discussed by the authors proposed a self-supervised framework for multi-task learning on depth, camera motion and semantics from panoramic videos, which is based on differentiable warping of adjacent views to the target.

...read moreread less

Journal ArticleDOI

Geometry Sampling-Based Adaption to DCGAN for 3D Face Generation

Guoliang Luo, +8 more

- 01 Feb 2023 -

Sensors

TL;DR: In this article , a geometric sampling method for the structured representation of 3D faces based on the intersection of iso-geodesic curves and radial curves, and a depth-like map sampling method using the average depth of grid cells on the front surface are proposed.

...read moreread less

Journal ArticleDOI

2T-UNET: A Two-Tower UNet with Depth Clues for Robust Stereo Depth Estimation

Rohit Choudhary, +2 more

- 27 Oct 2022 -

arXiv.org

TL;DR: The depth estimation problem is revisits, avoiding the explicit stereo matching step using a simple two-tower convolutional neural network, and the proposed algorithm is entitled 2T-UNet, which surpasses state-of-the-art monocular and stereo depth estimation methods on the challenging Scene dataset.

...read moreread less

Journal ArticleDOI

Facial Expression Recognition Based on Depth Fusion and Discriminative Association Learning

Xing Jin, +3 more

- 29 Jan 2022 -

Neural Processing Letters

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Posted Content

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

Andrew Howard, +7 more

- 17 Apr 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work introduces two simple global hyper-parameters that efficiently trade off between latency and accuracy and demonstrates the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

...read moreread less

Journal ArticleDOI

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Liang-Chieh Chen, +4 more

- 01 Apr 2018 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work addresses the task of semantic image segmentation with Deep Learning and proposes atrous spatial pyramid pooling (ASPP), which is proposed to robustly segment objects at multiple scales, and improves the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models.

...read moreread less

Posted Content

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Liang-Chieh Chen, +4 more

- 02 Jun 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: DeepLab as discussed by the authors proposes atrous spatial pyramid pooling (ASPP) to segment objects at multiple scales by probing an incoming convolutional feature layer with filters at multiple sampling rates and effective fields-of-views.

...read moreread less

arXiv: Computer Vision and Pattern Recog...

Connecting the Dots: Learning Representations for Active Monocular Depth Estimation

Gernot Riegler, +4 more

Near laser-scan quality 3-D face reconstruction from a low-quality depth stream

Matthias Hernandez, +2 more

- 01 Apr 2015 -

Image and Vision Computing

Predicting Forward & Backward Facial Depth Maps From a Single RGB Image For Mobile 3d AR Application

Citations

A morphable model for the synthesis of 3D faces

Pano-SfMLearner: Self-Supervised Multi-Task Learning of Depth and Semantics in Panoramic Videos

Geometry Sampling-Based Adaption to DCGAN for 3D Face Generation

2T-UNET: A Two-Tower UNet with Depth Clues for Robust Stereo Depth Estimation

Facial Expression Recognition Based on Depth Fusion and Discriminative Association Learning

References

Adam: A Method for Stochastic Optimization

Fully convolutional networks for semantic segmentation

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Related Papers (5)

3D Aware Correction and Completion of Depth Maps in Piecewise Planar Scenes

Face Reconstruction on Mobile Devices Using a Height Map Shape Model and Fast Regularization

Video Depth Estimation by Fusing Flow-to-Depth Proposals

Connecting the Dots: Learning Representations for Active Monocular Depth Estimation

Near laser-scan quality 3-D face reconstruction from a low-quality depth stream