Very Deep Convolutional Networks for Large-Scale Image Recognition

Open AccessProceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TLDR

This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

Abstract:

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

Citations

PDF

Open Access

More filters

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Proceedings ArticleDOI

Rethinking the Inception Architecture for Computer Vision

Christian Szegedy, +4 more

TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.

...read moreread less

Posted Content

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

Andrew Howard, +7 more

- 17 Apr 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work introduces two simple global hyper-parameters that efficiently trade off between latency and accuracy and demonstrates the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

...read moreread less

Journal ArticleDOI

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Vijay Badrinarayanan, +2 more

- 01 Dec 2017 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures, including FCN and DeconvNet.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

Karen Simonyan, +2 more

TL;DR: In this paper, the gradient of the class score with respect to the input image is computed to compute a class saliency map, which can be used for weakly supervised object segmentation using classification ConvNets.

...read moreread less

Posted Content

CNN Features off-the-shelf: an Astounding Baseline for Recognition

Ali Sharif Razavian, +3 more

- 23 Mar 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

...read moreread less

Book ChapterDOI

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He, +3 more

TL;DR: This work equips the networks with another pooling strategy, “spatial pyramid pooling”, to eliminate the above requirement, and develops a new network structure, called SPP-net, which can generate a fixed-length representation regardless of image size/scale.

...read moreread less

Posted Content

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Jeff Donahue, +6 more

- 06 Oct 2013 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: DeCAF, an open-source implementation of deep convolutional activation features, along with all associated network parameters, are released to enable vision researchers to be able to conduct experimentation with deep representations across a range of visual concept learning paradigms.

...read moreread less

Proceedings ArticleDOI

Return of the Devil in the Details: Delving Deep into Convolutional Nets

Ken Chatfield, +3 more

TL;DR: It is shown that the data augmentation techniques commonly applied to CNN-based methods can also be applied to shallow methods, and result in an analogous performance boost, and it is identified that the dimensionality of the CNN output layer can be reduced significantly without having an adverse effect on performance.

...read moreread less

Collapse

Very Deep Convolutional Networks for Large-Scale Image Recognition

Citations

U-Net: Convolutional Networks for Biomedical Image Segmentation

ImageNet Large Scale Visual Recognition Challenge

Rethinking the Inception Architecture for Computer Vision

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

References

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

CNN Features off-the-shelf: an Astounding Baseline for Recognition

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Return of the Devil in the Details: Delving Deep into Convolutional Nets

Related Papers (5)

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

ImageNet: A large-scale hierarchical image database

Gradient-based learning applied to document recognition