Very Deep Convolutional Networks for Large-Scale Image Recognition

Open AccessProceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Chats0

TLDR

In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

Abstract:

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Temporal Segment Networks for Action Recognition in Videos

Limin Wang, +6 more

- 01 Nov 2019 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Temporal Segment Networks (TSN) as discussed by the authors is proposed to model long-range temporal structure with a new segment-based sampling and aggregation scheme, which enables the TSN framework to efficiently learn action models by using the whole video.

...read moreread less

Journal ArticleDOI

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

Relja Arandjelovic, +4 more

- 01 Jun 2018 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A convolutional neural network architecture that is trainable in an end-to-end manner directly for the place recognition task, and significantly outperforms non-learnt image representations and off-the-shelf CNN descriptors on two challenging place recognition benchmarks.

...read moreread less

Journal ArticleDOI

Survey of the state of the art in natural language generation: core tasks, applications and evaluation

Albert Gatt, +1 more

- 01 Jan 2018 -

Journal of Artificial Intelligence Resea...

TL;DR: A survey of the state of the art in natural language generation can be found in this article, with an up-to-date synthesis of research on the core tasks in NLG and the architectures adopted in which such tasks are organized.

...read moreread less

Proceedings ArticleDOI

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

Jieyu Zhao, +4 more

TL;DR: The authors proposed to inject corpus-level constraints for calibrating existing structured prediction models and design an algorithm based on Lagrangian relaxation for collective inference, which results in almost no performance loss for the underlying recognition task but decreases the magnitude of bias amplification.

...read moreread less

Journal ArticleDOI

Deep Learning for Anomaly Detection: A Review

Guansong Pang, +3 more

- 05 Mar 2021 -

ACM Computing Surveys

TL;DR: A comprehensive survey of deep anomaly detection with a comprehensive taxonomy is presented in this paper, covering advancements in 3 high-level categories and 11 fine-grained categories of the methods.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

I and J

William Marsden

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

A and V.

Robert W. Stephenson

- 01 Nov 1962 -

British Journal of Ophthalmology

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Collapse

Very Deep Convolutional Networks for Large-Scale Image Recognition

Citations

Temporal Segment Networks for Action Recognition in Videos

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

Survey of the state of the art in natural language generation: core tasks, applications and evaluation

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

Deep Learning for Anomaly Detection: A Review

References

I and J

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

A and V.

Going deeper with convolutions

Related Papers (5)

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

ImageNet: A large-scale hierarchical image database

Adam: A Method for Stochastic Optimization