scispace - formally typeset
Open AccessProceedings ArticleDOI

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

TLDR
Amulet is presented, a generic aggregating multi-level convolutional feature framework for salient object detection that provides accurate salient object labeling and performs favorably against state-of-the-art approaches in terms of near all compared evaluation metrics.
Abstract
Fully convolutional neural networks (FCNs) have shown outstanding performance in many dense labeling problems. One key pillar of these successes is mining relevant information from features in convolutional layers. However, how to better aggregate multi-level convolutional feature maps for salient object detection is underexplored. In this work, we present Amulet, a generic aggregating multi-level convolutional feature framework for salient object detection. Our framework first integrates multi-level feature maps into multiple resolutions, which simultaneously incorporate coarse semantics and fine details. Then it adaptively learns to combine these feature maps at each resolution and predict saliency maps with the combined features. Finally, the predicted results are efficiently fused to generate the final saliency map. In addition, to achieve accurate boundary inference and semantic enhancement, edge-aware feature maps in low-level layers and the predicted results of low resolution features are recursively embedded into the learning framework. By aggregating multi-level convolutional features in this efficient and flexible manner, the proposed saliency model provides accurate salient object labeling. Comprehensive experiments demonstrate that our method performs favorably against state-of-the-art approaches in terms of near all compared evaluation metrics.

read more

Citations
More filters
Book ChapterDOI

Multi-level Features Selection Network Based on Multi-attention for Salient Object Detection.

TL;DR: Zhang et al. as discussed by the authors proposed a pyramid attention module that integrates channel and spatial attention to extract semantic information for multi-scale high-level features and a self-interaction attention module to extract detailed information for multiscale low-level feature.
Proceedings ArticleDOI

Fine-grained EEG classification using convolution neural network

TL;DR: In this paper , fine-grained EEG convolutional neural networks were used for low-contrast oddball classification, classification and recognition of EEG signals, and the results showed that finegrained CNN had a better performance in small-difference EEG signal classification compared with the classical EEG CNN.
Posted Content

Unifying Global-Local Representations in Salient Object Detection with Transformer.

TL;DR: Zhang et al. as discussed by the authors introduced a new attention-based encoder, vision transformer, into salient object detection to ensure the globalization of the representations from shallow to deep layers.
Journal ArticleDOI

Progressive Attention-Based Feature Recovery With Scribble Supervision for Saliency Detection in Optical Remote Sensing Image

TL;DR: Zhang et al. as discussed by the authors proposed a novel end-to-end progressive attention-based feature recovery framework with scribble supervision, where an object position module (OPM) is proposed to capture and enhance the long-range semantic dependence of objects' position information, which depends on the complementary attention mechanism.
Posted Content

CMA-Net: A Cascaded Mutual Attention Network for Light Field Salient Object Detection

TL;DR: This work proposes CMA-Net, which consists of two novel cascaded mutual attention modules aiming at fusing the high level features from the modalities of all-in-focus and depth, and outperforms 30 SOD methods on two widely applied light field benchmark datasets.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.
Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.
Related Papers (5)