ImageNet Classification with Deep Convolutional Neural Networks

Open AccessProceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

- Vol. 25, pp 1097-1105

TLDR

The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

Abstract:

We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0% which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implementation of the convolution operation. To reduce overriding in the fully-connected layers we employed a recently-developed regularization method called "dropout" that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Book ChapterDOI

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, +2 more

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A High-Throughput Screening Approach to Discovering Good Forms of Biologically Inspired Visual Representation

Nicolas Pinto, +8 more

- 26 Nov 2009 -

PLOS Computational Biology

TL;DR: This approach can yield significant, reproducible gains in performance across an array of basic object recognition tasks, consistently outperforming a variety of state-of-the-art purpose-built vision systems from the literature.

...read moreread less

Posted Content

High-Performance Neural Networks for Visual Object Classification

Dan Claudio Ciresan, +4 more

- 01 Feb 2011 -

arXiv: Artificial Intelligence

TL;DR: A fast, fully parameterizable GPU implementation of Convolutional Neural Network variants and their feature extractors are neither carefully designed nor pre-wired, but rather learned in a supervised way.

...read moreread less

Journal ArticleDOI

Taylor expansion of the accumulated rounding error

Seppo Linnainmaa

- 01 Jun 1976 -

Bit Numerical Mathematics

TL;DR: In this paper, analytic and algorithmic methods for determining the coefficients of the Taylor expansion of an accumulated rounding error with respect to the local rounding errors, and hence determining the influence of the local errors on the accumulated error second and higher order coefficients are also discussed.

...read moreread less

Une procedure d'apprentissage pour reseau a seuil asymmetrique (A learning scheme for asymmetric threshold networks)

Yann LeCun

Journal ArticleDOI

Artificial neural networks applied to cancer detection in a breast screening programme

L. ÁLvarez MenéNdez, +3 more

- 01 Oct 2010 -

Mathematical and Computer Modelling

TL;DR: A neural network based approach to breast cancer diagnosis is described; the model developed is able to determine which women are more likely to suffer from a particular kind of tumour before they undergo a mammography.

...read moreread less

Collapse

ImageNet Classification with Deep Convolutional Neural Networks

Citations

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Very Deep Convolutional Networks for Large-Scale Image Recognition

U-Net: Convolutional Networks for Biomedical Image Segmentation

Deep Residual Learning for Image Recognition

References

A High-Throughput Screening Approach to Discovering Good Forms of Biologically Inspired Visual Representation

High-Performance Neural Networks for Visual Object Classification

Taylor expansion of the accumulated rounding error

Une procedure d'apprentissage pour reseau a seuil asymmetrique (A learning scheme for asymmetric threshold networks)

Artificial neural networks applied to cancer detection in a breast screening programme

Related Papers (5)

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions

Gradient-based learning applied to document recognition

ImageNet: A large-scale hierarchical image database