scispace - formally typeset
Open AccessPosted Content

Real-time single image depth perception in the wild with handheld devices

Reads0
Chats0
TLDR
A thorough evaluation of real-time, depth-aware augmented reality networks highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications.
Abstract
Depth perception is paramount to tackle real-world problems, ranging from autonomous driving to consumer applications For the latter, depth estimation from a single image represents the most versatile solution, since a standard camera is available on almost any handheld device Nonetheless, two main issues limit its practical deployment: i) the low reliability when deployed in-the-wild and ii) the demanding resource requirements to achieve real-time performance, often not compatible with such devices Therefore, in this paper, we deeply investigate these issues showing how they are both addressable adopting appropriate network design and training strategies -- also outlining how to map the resulting networks on handheld devices to achieve real-time performance Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications Indeed, to further support this evidence, we report experimental results concerning real-time depth-aware augmented reality and image blurring with smartphones in-the-wild

read more

Citations
More filters
Posted Content

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

TL;DR: An improved DepthNet, HR-Depth, is presented with two effective strategies: (1) re-design the skip-connection in DepthNet to get better high-resolution features and (2) propose feature fusion Squeeze-and-Excitation module to fuse feature more efficiently.
Journal ArticleDOI

A review of computer graphics approaches to urban modeling from a machine learning perspective

TL;DR: This serves as an overview of the current state of research on urban modeling from a machine learning perspective and presents a review of approaches to urban modeling in computer graphics using machine learning in the literature published between 2010 and 2019.
Proceedings ArticleDOI

Lightweight Monocular Depth Estimation through Guided Decoding

TL;DR: A lightweight encoder-decoder architecture for monocular depth estimation, specifically designed for embedded platforms, based on the Guided Upsampling Block (GUB) for building the decoder of this model, achieving high resolution results with fine-grained details.
Journal ArticleDOI

URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth Estimation

TL;DR: Shao et al. as mentioned in this paper proposed an uncertainty rectified cross-distillation between Transformer and convolutional neural network (CNN) to learn a unified depth estimator by using the depth estimates from the Transformer branch and the CNN branch as pseudo labels to teach each other.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Journal ArticleDOI

Image quality assessment: from error visibility to structural similarity

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Related Papers (5)