Deep learning

doi:10.1038/NATURE14539

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

- Vol. 521, Iss: 7553, pp 436-444

TLDR

Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

Abstract:

Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images

Hao Chen, +4 more

- 23 Apr 2017 -

NeuroImage

TL;DR: An auto‐context version of the VoxResNet is proposed by combining the low‐level image appearance features, implicit shape information, and high‐level context together for further improving the segmentation performance, and achieved the best performance in the 2013 MICCAI MRBrainS challenge.

...read moreread less

Journal ArticleDOI

A deep learning framework for neuroscience

Blake A. Richards, +38 more

- 28 Oct 2019 -

Nature Neuroscience

TL;DR: It is argued that a deep network is best understood in terms of components used to design it—objective functions, architecture and learning rules—rather than unit-by-unit computation.

...read moreread less

Proceedings ArticleDOI

STAMP: Short-Term Attention/Memory Priority Model for Session-based Recommendation

Qiao Liu, +3 more

TL;DR: It is argued that a long-term memory model may be insufficient for modeling long sessions that usually contain user interests drift caused by unintended clicks, and a novel short-term attention/memory priority model is proposed as a remedy, which is capable of capturing users' general interests from the long- Term memory of a session context, whilst taking into account users' current interest from the short- term memory of the last-clicks.

...read moreread less

Journal ArticleDOI

Deep learning in environmental remote sensing: Achievements and challenges

Qiangqiang Yuan, +11 more

- 01 May 2020 -

Remote Sensing of Environment

TL;DR: The potential of DL in environmental remote sensing, including land cover mapping, environmental parameter retrieval, data fusion and downscaling, and information reconstruction and prediction, will be analyzed and a typical network structure will be introduced.

...read moreread less

Proceedings ArticleDOI

Deep & Cross Network for Ad Click Predictions

Ruoxi Wang, +3 more

TL;DR: This paper proposes the Deep & Cross Network (DCN), which keeps the benefits of a DNN model, and beyond that, it introduces a novel cross network that is more efficient in learning certain bounded-degree feature interactions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal ArticleDOI

Learning representations by back-propagating errors

David E. Rumelhart, +2 more

- 01 Jan 1988 -

Nature

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.

...read moreread less

Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

Geoffrey E. Hinton, +1 more

- 28 Jul 2006 -

Science

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.

...read moreread less

Collapse

Neural Computation

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

Deep learning

Citations

VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images

A deep learning framework for neuroscience

STAMP: Short-Term Attention/Memory Priority Model for Session-based Recommendation

Deep learning in environmental remote sensing: Achievements and challenges

Deep & Cross Network for Ad Click Predictions

References

Long short-term memory

Gradient-based learning applied to document recognition

Learning representations by back-propagating errors

Human-level control through deep reinforcement learning

Reducing the Dimensionality of Data with Neural Networks

Related Papers (5)

ImageNet Classification with Deep Convolutional Neural Networks

Deep Residual Learning for Image Recognition

Gradient-based learning applied to document recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition