Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

doi:10.1109/ICCV.2017.31

Open AccessProceedings ArticleDOI

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

- pp 202-211

TLDR

Amulet is presented, a generic aggregating multi-level convolutional feature framework for salient object detection that provides accurate salient object labeling and performs favorably against state-of-the-art approaches in terms of near all compared evaluation metrics.

Abstract:

Fully convolutional neural networks (FCNs) have shown outstanding performance in many dense labeling problems. One key pillar of these successes is mining relevant information from features in convolutional layers. However, how to better aggregate multi-level convolutional feature maps for salient object detection is underexplored. In this work, we present Amulet, a generic aggregating multi-level convolutional feature framework for salient object detection. Our framework first integrates multi-level feature maps into multiple resolutions, which simultaneously incorporate coarse semantics and fine details. Then it adaptively learns to combine these feature maps at each resolution and predict saliency maps with the combined features. Finally, the predicted results are efficiently fused to generate the final saliency map. In addition, to achieve accurate boundary inference and semantic enhancement, edge-aware feature maps in low-level layers and the predicted results of low resolution features are recursively embedded into the learning framework. By aggregating multi-level convolutional features in this efficient and flexible manner, the proposed saliency model provides accurate salient object labeling. Comprehensive experiments demonstrate that our method performs favorably against state-of-the-art approaches in terms of near all compared evaluation metrics.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

VSA-CGAN: An Intelligent Generation Model for Deep Learning Sample Database Construction

Peng Zhang, +6 more

- 27 Jul 2020 -

IEEE Access

TL;DR: A conditional generative adversarial network model (VSA-CGAN) is proposed, which integrates the self-attention mechanism of visual perception to optimize the inference of object attention feature maps, so as to learn the global information of the image and the detailed features of the object.

...read moreread less

Journal ArticleDOI

Saliency-aware inter-image color transfer for image manipulation

Xiuwen Liu, +4 more

- 29 Mar 2019 -

Multimedia Tools and Applications

TL;DR: Experimental results show that the proposed saliency-aware inter-image color transfer method not only highlights objects effectively but also preserves the naturalness of images well, and consistently outperforms other image manipulation methods when viewing the manipulated images with or without the source image as the reference.

...read moreread less

Journal ArticleDOI

Quality-Driven Dual-Branch Feature Integration Network for Video Salient Object Detection

Xiaofei Zhou, +4 more

- 29 Jan 2023 -

Electronics

TL;DR: Wang et al. as discussed by the authors proposed a quality-driven dual-branch feature integration network majoring in the adaptive fusion of multi-modal cues and sufficient aggregation of multilevel spatio-temporal features.

...read moreread less

Book ChapterDOI

Bi-directional Features Reuse Network for Salient Object Detection

Fengwei Jia, +5 more

TL;DR: A novel bi-directional features reuse network (BDFRN) for salient object detection, which consists of two subnets: forward-skip subnet and reverse-connect subnet, which can transmit the location features from top blocks to bottom blocks, such that these features can be reused and communicated between different blocks.

...read moreread less

Journal ArticleDOI

Saliency detection network with two-stream encoder and interactive decoder

Aiping Yang, +6 more

- 01 Aug 2022 -

Neurocomputing

TL;DR: Zhang et al. as discussed by the authors proposed a two-stream encoder consisting of the region extraction branch and the edge extraction branch to balance the feature domain differences between the regions and edges.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Collapse

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

Citations

VSA-CGAN: An Intelligent Generation Model for Deep Learning Sample Database Construction

Saliency-aware inter-image color transfer for image manipulation

Quality-Driven Dual-Branch Feature Integration Network for Video Salient Object Detection

Bi-directional Features Reuse Network for Salient Object Detection

Saliency detection network with two-stream encoder and interactive decoder

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

Fully convolutional networks for semantic segmentation

Related Papers (5)

Saliency Detection via Graph-Based Manifold Ranking

Global contrast based salient region detection

Hierarchical Saliency Detection

Deep Residual Learning for Image Recognition

The Secrets of Salient Object Segmentation