UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

doi:10.1109/CVPR42600.2020.00861

Open AccessProceedings ArticleDOI

UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

- pp 8582-8591

TLDR

Zhang et al. as mentioned in this paper proposed a probabilistic RGB-D saliency detection network via conditional variational autoencoders to model human annotation uncertainty and generate multiple saliency maps for each input image by sampling in the latent space.

Abstract:

In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection methods treat the saliency detection task as a point estimation problem, and produce a single saliency map following a deterministic learning pipeline. Inspired by the saliency data labeling process, we propose probabilistic RGB-D saliency detection network via conditional variational autoencoders to model human annotation uncertainty and generate multiple saliency maps for each input image by sampling in the latent space. With the proposed saliency consensus process, we are able to generate an accurate saliency map based on these multiple predictions. Quantitative and qualitative evaluations on six challenging benchmark datasets against 18 competing algorithms demonstrate the effectiveness of our approach in learning the distribution of saliency maps, leading to a new state-of-the-art in RGB-D saliency detection.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

Moloud Abdar, +13 more

- 12 Nov 2020 -

arXiv: Learning

TL;DR: This study reviews recent advances in UQ methods used in deep learning and investigates the application of these methods in reinforcement learning (RL), and outlines a few important applications of UZ methods.

...read moreread less

Journal ArticleDOI

Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks

Deng-Ping Fan, +4 more

- 01 May 2021 -

IEEE Transactions on Neural Networks

TL;DR: It is demonstrated that D3Net can be used to efficiently extract salient object masks from real scenes, enabling effective background-changing application with a speed of 65 frames/s on a single GPU.

...read moreread less

Posted ContentDOI

Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Scans

Deng-Ping Fan, +7 more

- 22 Apr 2020 -

medRxiv

TL;DR: A novel COVID-19 Lung Infection Segmentation Deep Network (Inf-Net) is proposed to automatically identify infected regions from chest CT scans and outperforms most cutting-edge segmentation models and advances the state-of-the-art technology.

...read moreread less

Posted Content

PraNet: Parallel Reverse Attention Network for Polyp Segmentation

Deng-Ping Fan, +6 more

- 13 Jun 2020 -

arXiv: Image and Video Processing

TL;DR: Quantitative and qualitative evaluations on five challenging datasets across six metrics show that the PraNet improves the segmentation accuracy significantly, and presents a number of advantages in terms of generalizability, and real-time segmentation efficiency.

...read moreread less

Proceedings ArticleDOI

Camouflaged Object Detection

Deng-Ping Fan, +5 more

TL;DR: A simple but effective framework for COD, termed Search Identification Network (SINet), which outperforms various state-of-the-art object detection baselines on all datasets tested, making it a robust, general framework that can help facilitate future research in COD.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

Journal ArticleDOI

A model of saliency-based visual attention for rapid scene analysis

Laurent Itti, +2 more

- 01 Nov 1998 -

IEEE Transactions on Pattern Analysis an...

TL;DR: In this article, a visual attention system inspired by the behavior and the neuronal architecture of the early primate visual system is presented, where multiscale image features are combined into a single topographical saliency map.

...read moreread less

Proceedings ArticleDOI

Frequency-tuned salient region detection

Radhakrishna Achanta, +3 more

TL;DR: This paper introduces a method for salient region detection that outputs full resolution saliency maps with well-defined boundaries of salient objects that outperforms the five algorithms both on the ground-truth evaluation and on the segmentation task by achieving both higher precision and better recall.

...read moreread less

Journal ArticleDOI

A saliency-based search mechanism for overt and covert shifts of visual attention.

Laurent Itti, +1 more

- 01 Jun 2000 -

Vision Research

TL;DR: A detailed computer implementation of a saliency map scheme is described, focusing on the problem of combining information across modalities, here orientation, intensity and color information, in a purely stimulus-driven manner, which is applied to common psychophysical stimuli as well as to a very demanding visual search task.

...read moreread less

Collapse

UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

Citations

A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks

Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Scans

PraNet: Parallel Reverse Attention Network for Polyp Segmentation

Camouflaged Object Detection

References

Very Deep Convolutional Networks for Large-Scale Image Recognition

Auto-Encoding Variational Bayes

A model of saliency-based visual attention for rapid scene analysis

Frequency-tuned salient region detection

A saliency-based search mechanism for overt and covert shifts of visual attention.

Related Papers (5)

Structure-Measure: A New Way to Evaluate Foreground Maps

EGNet: Edge Guidance Network for Salient Object Detection

Deep Residual Learning for Image Recognition

Enhanced-alignment Measure for Binary Foreground Map Evaluation

Cascaded Partial Decoder for Fast and Accurate Salient Object Detection