Progressive Attention Guided Recurrent Network for Salient Object Detection

doi:10.1109/CVPR.2018.00081

Proceedings ArticleDOI

Progressive Attention Guided Recurrent Network for Salient Object Detection

Xiaoning Zhang, +4 more

- pp 714-722

Chats0

TLDR

A novel attention guided network which selectively integrates multi-level contextual information in a progressive manner and introduces multi-path recurrent feedback to enhance this proposed progressive attention driven framework.

Abstract:

Effective convolutional features play an important role in saliency estimation but how to learn powerful features for saliency is still a challenging task. FCN-based methods directly apply multi-level convolutional features without distinction, which leads to sub-optimal results due to the distraction from redundant details. In this paper, we propose a novel attention guided network which selectively integrates multi-level contextual information in a progressive manner. Attentive features generated by our network can alleviate distraction of background thus achieve better performance. On the other hand, it is observed that most of existing algorithms conduct salient object detection by exploiting side-output features of the backbone feature extraction network. However, shallower layers of backbone network lack the ability to obtain global semantic information, which limits the effective feature learning. To address the problem, we introduce multi-path recurrent feedback to enhance our proposed progressive attention driven framework. Through multi-path recurrent connections, global semantic information from the top convolutional layer is transferred to shallower layers, which intrinsically refines the entire network. Experimental results on six benchmark datasets demonstrate that our algorithm performs favorably against the state-of-the-art approaches.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

BASNet: Boundary-Aware Salient Object Detection

Xuebin Qin, +5 more

TL;DR: Experimental results on six public datasets show that the proposed predict-refine architecture, BASNet, outperforms the state-of-the-art methods both in terms of regional and boundary evaluation measures.

...read moreread less

Proceedings ArticleDOI

EGNet: Edge Guidance Network for Salient Object Detection

Jiaxing Zhao, +5 more

TL;DR: In this article, an edge guidance network (EGNet) is proposed for salient object detection with three steps to simultaneously model these two kinds of complementary information in a single network, which can help locate salient objects especially their boundaries more accurately.

...read moreread less

Proceedings ArticleDOI

A Simple Pooling-Based Design for Real-Time Salient Object Detection

Jiang-Jiang Liu, +4 more

TL;DR: This work solves the problem of salient object detection by investigating how to expand the role of pooling in convolutional neural networks by building a global guidance module (GGM) and designing a feature aggregation module (FAM) to make the coarse-level semantic information well fused with the fine-level features from the top-down path- way.

...read moreread less

Proceedings ArticleDOI

Cascaded Partial Decoder for Fast and Accurate Salient Object Detection

Zhe Wu, +2 more

TL;DR: A novel Cascaded Partial Decoder (CPD) framework for fast and accurate salient object detection and applies the proposed framework to optimize existing multi-level feature aggregation models and significantly improve their efficiency and accuracy.

...read moreread less

Journal ArticleDOI

U2-Net: Going deeper with nested U-structure for salient object detection

Xuebin Qin, +5 more

- 01 Oct 2020 -

Pattern Recognition

TL;DR: A simple yet powerful deep network architecture, U2-Net, for salient object detection (SOD), a two-level nested U-structure that enables us to train a deep network from scratch without using backbones from image classification tasks.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Proceedings ArticleDOI

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Posted Content

Fully Convolutional Networks for Semantic Segmentation

Jonathan Long, +2 more

- 14 Nov 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: It is shown that convolutional networks by themselves, trained end- to-end, pixels-to-pixels, improve on the previous best result in semantic segmentation.

...read moreread less

Proceedings Article

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Kelvin Xu, +10 more

TL;DR: An attention based model that automatically learns to describe the content of images is introduced that can be trained in a deterministic manner using standard backpropagation techniques and stochastically by maximizing a variational lower bound.

...read moreread less

Collapse

Progressive Attention Guided Recurrent Network for Salient Object Detection

Citations

BASNet: Boundary-Aware Salient Object Detection

EGNet: Edge Guidance Network for Salient Object Detection

A Simple Pooling-Based Design for Real-Time Salient Object Detection

Cascaded Partial Decoder for Fast and Accurate Salient Object Detection

U2-Net: Going deeper with nested U-structure for salient object detection

References

Fully convolutional networks for semantic segmentation

Caffe: Convolutional Architecture for Fast Feature Embedding

Caffe: Convolutional Architecture for Fast Feature Embedding

Fully Convolutional Networks for Semantic Segmentation

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Related Papers (5)

Deep Residual Learning for Image Recognition

Saliency Detection via Graph-Based Manifold Ranking

The Secrets of Salient Object Segmentation

Learning to Detect Salient Objects with Image-Level Supervision

Global contrast based salient region detection