CGNet: A Light-weight Context Guided Network for Semantic Segmentation

Open AccessPosted Content

CGNet: A Light-weight Context Guided Network for Semantic Segmentation

Tianyi Wu, +3 more

- 20 Nov 2018 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

This work proposes a novel Context Guided Network (CGNet), which is a light-weight and efficient network for semantic segmentation, and develops CGNet which captures contextual information in all stages of the network.

Abstract:

The demand of applying semantic segmentation model on mobile devices has been increasing rapidly. Current state-of-the-art networks have enormous amount of parameters hence unsuitable for mobile devices, while other small memory footprint models follow the spirit of classification network and ignore the inherent characteristic of semantic segmentation. To tackle this problem, we propose a novel Context Guided Network (CGNet), which is a light-weight and efficient network for semantic segmentation. We first propose the Context Guided (CG) block, which learns the joint feature of both local feature and surrounding context, and further improves the joint feature with the global context. Based on the CG block, we develop CGNet which captures contextual information in all stages of the network and is specially tailored for increasing segmentation accuracy. CGNet is also elaborately designed to reduce the number of parameters and save memory footprint. Under an equivalent number of parameters, the proposed CGNet significantly outperforms existing segmentation networks. Extensive experiments on Cityscapes and CamVid datasets verify the effectiveness of the proposed approach. Specifically, without any post-processing and multi-scale testing, the proposed CGNet achieves 64.8% mean IoU on Cityscapes with less than 0.5 M parameters. The source code for the complete system can be found at this https URL.

Citations

PDF

Open Access

More filters

Posted Content

Few-shot Video-to-Video Synthesis

Ting-Chun Wang, +5 more

- 28 Oct 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, a few-shot vid2vid framework is proposed to synthesize videos of previously unseen subjects or scenes by leveraging few example images of the target at test time.

...read moreread less

Posted Content

MiniSeg: An Extremely Minimum Network for Efficient COVID-19 Segmentation

Yu Qiu, +2 more

- 21 Apr 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: MiniSeg, a lightweight deep learning model for efficient COVID-19 segmentation, is proposed, which has several significant strengths: i) it only has 472K parameters and is thus not easy to overfit; ii) it has high computational efficiency and is therefore convenient for practical deployment; iii) it can be fast retrained by other users using their private CO VID-19 data for further improving performance.

...read moreread less

Journal ArticleDOI

AGLNet: Towards real-time semantic segmentation of self-driving images via attention-guided lightweight network

Quan Zhou, +6 more

- 01 Nov 2020 -

Applied Soft Computing

TL;DR: An attention-guided lightweight network, namely AGLNet, which employs an encoder–decoder architecture for real-time semantic segmentation, and achieves state-of-the-art results in terms of speed and accuracy on three self-driving datasets.

...read moreread less

Book ChapterDOI

Segmenting Transparent Objects in the Wild

Enze Xie, +5 more

TL;DR: A novel boundary-aware segmentation method, termed TransLab, is proposed, which exploits boundary as the clue to improve segmentation of transparent objects and significantly outperforms 20 recent object segmentation methods based on deep learning.

...read moreread less

Journal ArticleDOI

DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes

Mohammed A. M. Elhassan, +3 more

- 30 Nov 2021 -

Expert Systems With Applications

TL;DR: A computationally efficient network named DSANet is presented, which follows a two-branch strategy to tackle the problem of real-time semantic segmentation in urban scenes and introduces a Simple Feature Fusion Module to better combine context information and spatial information.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Proceedings ArticleDOI

Densely Connected Convolutional Networks

Gao Huang, +3 more

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.

...read moreread less

Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less