scispace - formally typeset
Open AccessPosted Content

CGNet: A Light-weight Context Guided Network for Semantic Segmentation

Reads0
Chats0
TLDR
This work proposes a novel Context Guided Network (CGNet), which is a light-weight and efficient network for semantic segmentation, and develops CGNet which captures contextual information in all stages of the network.
Abstract
The demand of applying semantic segmentation model on mobile devices has been increasing rapidly. Current state-of-the-art networks have enormous amount of parameters hence unsuitable for mobile devices, while other small memory footprint models follow the spirit of classification network and ignore the inherent characteristic of semantic segmentation. To tackle this problem, we propose a novel Context Guided Network (CGNet), which is a light-weight and efficient network for semantic segmentation. We first propose the Context Guided (CG) block, which learns the joint feature of both local feature and surrounding context, and further improves the joint feature with the global context. Based on the CG block, we develop CGNet which captures contextual information in all stages of the network and is specially tailored for increasing segmentation accuracy. CGNet is also elaborately designed to reduce the number of parameters and save memory footprint. Under an equivalent number of parameters, the proposed CGNet significantly outperforms existing segmentation networks. Extensive experiments on Cityscapes and CamVid datasets verify the effectiveness of the proposed approach. Specifically, without any post-processing and multi-scale testing, the proposed CGNet achieves 64.8% mean IoU on Cityscapes with less than 0.5 M parameters. The source code for the complete system can be found at this https URL.

read more

Citations
More filters
Posted Content

Few-shot Video-to-Video Synthesis

TL;DR: In this article, a few-shot vid2vid framework is proposed to synthesize videos of previously unseen subjects or scenes by leveraging few example images of the target at test time.
Posted Content

MiniSeg: An Extremely Minimum Network for Efficient COVID-19 Segmentation

TL;DR: MiniSeg, a lightweight deep learning model for efficient COVID-19 segmentation, is proposed, which has several significant strengths: i) it only has 472K parameters and is thus not easy to overfit; ii) it has high computational efficiency and is therefore convenient for practical deployment; iii) it can be fast retrained by other users using their private CO VID-19 data for further improving performance.
Journal ArticleDOI

AGLNet: Towards real-time semantic segmentation of self-driving images via attention-guided lightweight network

TL;DR: An attention-guided lightweight network, namely AGLNet, which employs an encoder–decoder architecture for real-time semantic segmentation, and achieves state-of-the-art results in terms of speed and accuracy on three self-driving datasets.
Book ChapterDOI

Segmenting Transparent Objects in the Wild

TL;DR: A novel boundary-aware segmentation method, termed TransLab, is proposed, which exploits boundary as the clue to improve segmentation of transparent objects and significantly outperforms 20 recent object segmentation methods based on deep learning.
Journal ArticleDOI

DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes

TL;DR: A computationally efficient network named DSANet is presented, which follows a two-branch strategy to tackle the problem of real-time semantic segmentation in urban scenes and introduces a Simple Feature Fusion Module to better combine context information and spatial information.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings ArticleDOI

Going deeper with convolutions

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Proceedings ArticleDOI

Densely Connected Convolutional Networks

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.
Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
Related Papers (5)