Very Deep Convolutional Networks for Large-Scale Image Recognition

Open AccessProceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TLDR

In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

Abstract:

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Recent Advances in Deep Learning for Object Detection

Xiongwei Wu, +3 more

- 05 Jul 2020 -

Neurocomputing

TL;DR: A comprehensive survey of recent advances in visual object detection with deep learning can be found in this article, where the authors systematically analyze the existing object detection frameworks and organize the survey into three major parts: detection components, learning strategies, and applications and benchmarks.

...read moreread less

Posted Content

Contrastive Learning for Unpaired Image-to-Image Translation

Taesung Park, +3 more

- 30 Jul 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The framework enables one-sided translation in the unpaired image-to-image translation setting, while improving quality and reducing training time, and can be extended to the training setting where each "domain" is only a single image.

...read moreread less

Proceedings ArticleDOI

Rethinking ImageNet Pre-Training

Kaiming He, +2 more

TL;DR: In this paper, the authors report competitive results on object detection and instance segmentation on the COCO dataset using standard models trained from random initialization, with the sole exception of increasing the number of training iterations so the randomly initialized models may converge.

...read moreread less

Proceedings ArticleDOI

Learning Deep Object Detectors from 3D Models

Xingchao Peng, +3 more

TL;DR: This work shows that augmenting the training data of contemporary Deep Convolutional Neural Net (DCNN) models with such synthetic data can be effective, especially when real training data is limited or not well matched to the target domain.

...read moreread less

Proceedings ArticleDOI

Shallow and Deep Convolutional Networks for Saliency Prediction

Junting Pan, +4 more

TL;DR: In this paper, the authors proposed a completely data-driven approach by training a convolutional neural network (convnet) for saliency prediction, where the learning process is formulated as a minimization of a loss function that measures the Euclidean distance of the predicted saliency map with the provided ground truth.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

A and V.

Robert W. Stephenson

- 01 Nov 1962 -

British Journal of Ophthalmology

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Collapse

Very Deep Convolutional Networks for Large-Scale Image Recognition

Citations

Recent Advances in Deep Learning for Object Detection

Contrastive Learning for Unpaired Image-to-Image Translation

Rethinking ImageNet Pre-Training

Learning Deep Object Detectors from 3D Models

Shallow and Deep Convolutional Networks for Saliency Prediction

References

I and J

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

A and V.

Going deeper with convolutions

Related Papers (5)

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

ImageNet: A large-scale hierarchical image database

Adam: A Method for Stochastic Optimization