Real-time single image depth perception in the wild with handheld devices

Open AccessPosted Content

Real-time single image depth perception in the wild with handheld devices

Filippo Aleotti, +5 more

- 10 Jun 2020 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

A thorough evaluation of real-time, depth-aware augmented reality networks highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications.

Abstract:

Depth perception is paramount to tackle real-world problems, ranging from autonomous driving to consumer applications For the latter, depth estimation from a single image represents the most versatile solution, since a standard camera is available on almost any handheld device Nonetheless, two main issues limit its practical deployment: i) the low reliability when deployed in-the-wild and ii) the demanding resource requirements to achieve real-time performance, often not compatible with such devices Therefore, in this paper, we deeply investigate these issues showing how they are both addressable adopting appropriate network design and training strategies -- also outlining how to map the resulting networks on handheld devices to achieve real-time performance Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications Indeed, to further support this evidence, we report experimental results concerning real-time depth-aware augmented reality and image blurring with smartphones in-the-wild

Citations

PDF

Open Access

More filters

Posted Content

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

Xiaoyang Lyu, +7 more

- 14 Dec 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: An improved DepthNet, HR-Depth, is presented with two effective strategies: (1) re-design the skip-connection in DepthNet to get better high-resolution features and (2) propose feature fusion Squeeze-and-Excitation module to fuse feature more efficiently.

...read moreread less

Journal ArticleDOI

A review of computer graphics approaches to urban modeling from a machine learning perspective

Tian Feng, +3 more

- 22 May 2021 -

Journal of Zhejiang University Science C

TL;DR: This serves as an overview of the current state of research on urban modeling from a machine learning perspective and presents a review of approaches to urban modeling in computer graphics using machine learning in the literature published between 2010 and 2019.

...read moreread less

Proceedings ArticleDOI

Lightweight Monocular Depth Estimation through Guided Decoding

Michael Bernard Rudolph, +4 more

TL;DR: A lightweight encoder-decoder architecture for monocular depth estimation, specifically designed for embedded platforms, based on the Guided Upsampling Block (GUB) for building the decoder of this model, achieving high resolution results with fine-grained details.

...read moreread less

Journal ArticleDOI

Bayesian cue integration of structure from motion and CNN-based monocular depth estimation for autonomous robot navigation

Fuseini Mumuni, +1 more

- 02 Mar 2022 -

International journal of intelligent rob...

Journal ArticleDOI

URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth Estimation

Shuwei Shao, +5 more

- 16 Feb 2023 -

arXiv.org

TL;DR: Shao et al. as mentioned in this paper proposed an uncertainty rectified cross-distillation between Transformer and convolutional neural network (CNN) to learn a unified depth estimator by using the depth estimates from the Transformer branch and the CNN branch as pseudo labels to teach each other.

...read moreread less

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

Image quality assessment: from error visibility to structural similarity

Zhou Wang, +3 more

- 01 Apr 2004 -

IEEE Transactions on Image Processing

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less