Very Deep Convolutional Networks for Large-Scale Image Recognition

Open AccessProceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Chats0

TLDR

In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

Abstract:

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Image De-Raining Using a Conditional Generative Adversarial Network

He Zhang, +2 more

- 01 Nov 2020 -

IEEE Transactions on Circuits and System...

TL;DR: This work attempts to leverage powerful generative modeling capabilities of the recently introduced conditional generative adversarial networks (CGAN) by enforcing an additional constraint that the de-rained image must be indistinguishable from its corresponding ground truth clean image.

...read moreread less

Proceedings ArticleDOI

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Shan Li, +2 more

TL;DR: A new DLP-CNN (Deep Locality-Preserving CNN) method, which aims to enhance the discriminative power of deep features by preserving the locality closeness while maximizing the inter-class scatters, is proposed.

...read moreread less

Proceedings ArticleDOI

Switching Convolutional Neural Network for Crowd Counting

Deepak Babu Sam, +2 more

TL;DR: In this paper, the authors propose a switching convolutional neural network that leverages variation of crowd density within an image to improve the accuracy and localization of the predicted crowd count, and provide interpretable representations of the multichotomy of space of crowd scene patches inferred from the switch.

...read moreread less

Posted Content

Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units

Dan Hendrycks, +1 more

- 27 Jun 2016 -

arXiv: Learning

TL;DR: An empirical evaluation of the GELU nonlinearity against the ReLU and ELU activations and finding performance improvements across all tasks suggests a new probabilistic understanding of nonlinearities.

...read moreread less

Book ChapterDOI

The Visual Object Tracking VOT2016 Challenge Results

Matej Kristan, +140 more

TL;DR: The Visual Object Tracking challenge VOT2016 goes beyond its predecessors by introducing a new semi-automatic ground truth bounding box annotation methodology and extending the evaluation system with the no-reset experiment.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

A and V.

Robert W. Stephenson

- 01 Nov 1962 -

British Journal of Ophthalmology

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Collapse

Very Deep Convolutional Networks for Large-Scale Image Recognition

Citations

Image De-Raining Using a Conditional Generative Adversarial Network

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Switching Convolutional Neural Network for Crowd Counting

Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units

The Visual Object Tracking VOT2016 Challenge Results

References

I and J

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

A and V.

Going deeper with convolutions

Related Papers (5)

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

ImageNet: A large-scale hierarchical image database

Adam: A Method for Stochastic Optimization