Spatial Pyramid Based Graph Reasoning for Semantic Segmentation

doi:10.1109/CVPR42600.2020.00897

Open AccessProceedings ArticleDOI

Spatial Pyramid Based Graph Reasoning for Semantic Segmentation

Xia Li, +5 more

- pp 8950-8959

Chats0

TLDR

Wang et al. as mentioned in this paper applied graph convolution into the semantic segmentation task and proposed an improved Laplacian, which is data-dependent and introduces an attention diagonal matrix to learn a better distance metric.

Abstract:

The convolution operation suffers from a limited receptive filed, while global modeling is fundamental to dense prediction tasks, such as semantic segmentation. In this paper, we apply graph convolution into the semantic segmentation task and propose an improved Laplacian. The graph reasoning is directly performed in the original feature space organized as a spatial pyramid. Different from existing methods, our Laplacian is data-dependent and we introduce an attention diagonal matrix to learn a better distance metric. It gets rid of projecting and re-projecting processes, which makes our proposed method a light-weight module that can be easily plugged into current computer vision architectures. More importantly, performing graph reasoning directly in the feature space retains spatial relationships and makes spatial pyramid possible to explore multiple long-range contextual patterns from different scales. Experiments on Cityscapes, COCO Stuff, PASCAL Context and PASCAL VOC demonstrate the effectiveness of our proposed methods on semantic segmentation. We achieve comparable performance with advantages in computational and memory overhead.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Attention mechanisms in computer vision: A survey

- 15 Mar 2022 -

Computational Visual Media

TL;DR: Guo et al. as mentioned in this paper provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention.

...read moreread less

Posted Content

Attention Mechanisms in Computer Vision: A Survey.

Meng-Hao Guo, +9 more

- 15 Nov 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A comprehensive review of attention mechanisms in computer vision can be found in this article, which categorizes them according to approach, such as channel attention, spatial attention, temporal attention and branch attention.

...read moreread less

Posted Content

Semantic Flow for Fast and Accurate Scene Parsing

Xiangtai Li, +6 more

- 24 Feb 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a Flow Alignment Module (FAM) to learn Semantic Flow between feature maps of adjacent levels, and broadcast high-level features to high resolution features effectively and efficiently and exhibits superior performance over other real-time methods even on light-weight backbone networks.

...read moreread less

Journal ArticleDOI

Scene Segmentation With Dual Relation-Aware Attention Network

Jun Fu, +5 more

- 01 Jun 2021 -

IEEE Transactions on Neural Networks

TL;DR: A Dual Relation-aware Attention Network (DRANet) is proposed to handle the task of scene segmentation and designs two types of compact attention modules, which model the contextual dependencies in spatial and channel dimensions, respectively.

...read moreread less

Posted Content

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Wenguan Wang, +5 more

- 28 Jan 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, a pixel-wise contrastive framework is proposed to enforce pixel embeddings belonging to a same semantic class to be more similar than embedding from different classes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Proceedings ArticleDOI

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, +5 more

TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.

...read moreread less

Collapse

IEEE Transactions on Pattern Analysis an...

Spatial Pyramid Based Graph Reasoning for Semantic Segmentation

Citations

Attention mechanisms in computer vision: A survey

Attention Mechanisms in Computer Vision: A Survey.

Semantic Flow for Fast and Accurate Scene Parsing

Scene Segmentation With Dual Relation-Aware Attention Network

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

References

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

U-Net: Convolutional Networks for Biomedical Image Segmentation

Fully convolutional networks for semantic segmentation

Feature Pyramid Networks for Object Detection

Related Papers (5)

Pyramid Scene Parsing Network

Dual Attention Network for Scene Segmentation

Deep Residual Learning for Image Recognition

Fully convolutional networks for semantic segmentation

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs