Fully convolutional networks for semantic segmentation

doi:10.1109/CVPR.2015.7298965

Open AccessProceedings ArticleDOI

Fully convolutional networks for semantic segmentation

- pp 3431-3440

TLDR

The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

Abstract:

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, exceed the state-of-the-art in semantic segmentation. Our key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet [20], the VGG net [31], and GoogLeNet [32]) into fully convolutional networks and transfer their learned representations by fine-tuning [3] to the segmentation task. We then define a skip architecture that combines semantic information from a deep, coarse layer with appearance information from a shallow, fine layer to produce accurate and detailed segmentations. Our fully convolutional network achieves state-of-the-art segmentation of PASCAL VOC (20% relative improvement to 62.2% mean IU on 2012), NYUDv2, and SIFT Flow, while inference takes less than one fifth of a second for a typical image.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Saliency Detection with Recurrent Fully Convolutional Networks

Linzhao Wang, +4 more

TL;DR: This paper develops a new saliency model using recurrent fully convolutional networks (RFCNs) that is able to incorporate saliency prior knowledge for more accurate inference and enables the network to capture generic representations of objects for saliency detection.

...read moreread less

Book ChapterDOI

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

Sachin Mehta, +4 more

TL;DR: ESPapernot et al. as discussed by the authors introduced a fast and efficient convolutional neural network, ESPNet, for semantic segmentation of high resolution images under resource constraints, which is efficient in terms of computation, memory, and power.

...read moreread less

Journal ArticleDOI

Video Salient Object Detection via Fully Convolutional Networks

Wenguan Wang, +2 more

- 01 Jan 2018 -

IEEE Transactions on Image Processing

TL;DR: Wang et al. as discussed by the authors proposed a deep video saliency network consisting of two modules, for capturing the spatial and temporal saliency information, respectively, which can directly produce spatio-temporal saliency inference without time-consuming optical flow computation.

...read moreread less

Posted Content

The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches.

Md. Zahangir Alom, +8 more

- 03 Mar 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This report presents a brief survey on development of DL approaches, including Deep Neural Network (DNN), Convolutional neural network (CNN), Recurrent Neural network (RNN) including Long Short Term Memory (LSTM) and Gated Recurrent Units (GRU), Auto-Encoder (AE), Deep Belief Network (DBN), Generative Adversarial Network (GAN), and Deep Reinforcement Learning (DRL).

...read moreread less

Proceedings ArticleDOI

Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation

Anurag Ranjan, +6 more

TL;DR: In this article, the authors propose a competitive collaboration framework that facilitates the coordinated training of multiple specialized neural networks to solve complex low-level vision problems, such as single view depth prediction, camera motion estimation, optical flow, and segmentation of a video into the static scene and moving regions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Book

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.

...read moreread less

Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, +3 more

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.

...read moreread less

Book

A wavelet tour of signal processing

Stéphane Mallat

TL;DR: An introduction to a Transient World and an Approximation Tour of Wavelet Packet and Local Cosine Bases.

...read moreread less

Collapse

Fully convolutional networks for semantic segmentation

Citations

Saliency Detection with Recurrent Fully Convolutional Networks

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

Video Salient Object Detection via Fully Convolutional Networks

The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches.

Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation

References

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Pattern Recognition and Machine Learning

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

A wavelet tour of signal processing

Related Papers (5)

Deep Residual Learning for Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions