Recent advances in convolutional neural networks

doi:10.1016/J.PATCOG.2017.10.013

Open AccessJournal ArticleDOI

Recent advances in convolutional neural networks

Jiuxiang Gu, +10 more

- 01 May 2018 -

Pattern Recognition

- Vol. 77, pp 354-377

TLDR

A broad survey of the recent advances in convolutional neural networks can be found in this article, where the authors discuss the improvements of CNN on different aspects, namely, layer design, activation function, loss function, regularization, optimization and fast computation.

About:

This article is published in Pattern Recognition.The article was published on 2018-05-01 and is currently open access. It has received 3125 citations till now. The article focuses on the topics: Deep learning & Convolutional neural network.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep-Learning-Based Precipitation Observation Quality Control

Yingkai Sha, +3 more

- 01 May 2021 -

Journal of Atmospheric and Oceanic Techn...

TL;DR: This automated QC approach uses Convolutional Neural Networks to classify bad observation values, incorporating a multi-classifier ensemble to achieve better QC performance and is an option for eliminating bad observations for various applications, including the pre-processing of training datasets for machine learning.

...read moreread less

Proceedings ArticleDOI

Optimizing Relevance Maps of Vision Transformers Improves Robustness

Hila Chefer, +2 more

TL;DR: It is proposed to monitor the model’s relevancy signal and manipulate it such that the model is focused on the foreground object, resulting in a marked improvement in robustness to domain shifts is observed.

...read moreread less

Proceedings ArticleDOI

Performance Analysis of Machine Learning and Deep Learning Models for Text Classification

C. M. Suneera, +1 more

TL;DR: In this paper, the performance of different machine learning and deep learning algorithms for text classification was evaluated on the 20 newsgroups dataset and the results indicate that Logistic regression outperforms over other ML algorithms and a Bi-channel Convolution Neural Network model gains exciting results compared to other deep learning models.

...read moreread less

Journal ArticleDOI

IDMIL: an alignment-free Interpretable Deep Multiple Instance Learning (MIL) for predicting disease from whole-metagenomic data

Mohammad Arifur Rahman, +1 more

- 01 Jul 2020 -

Bioinformatics

TL;DR: The proposed alignment-free approach provides higher accuracy in prediction by harnessing the capability of deep convolutional neural network (CNN) within a MIL framework and provides interpretability via neural attention mechanism, which allows for the identification of groups of sequences that are likely to be correlated to diseases providing the much-needed interpretation.

...read moreread less

Journal ArticleDOI

Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch

Zefeng Xu, +7 more

- 06 Oct 2022 -

Light-Science & Applications

TL;DR: In this article , an efficient in-situ nonlinear accelerator comprising a unique solution-processed two-dimensional (2D) MoS2 Opto-Resistive RAM Switch (ORS), which exhibits tunable nonlinear resistance switching that allow to introduce nonlinearity to the photonic neuron which overcomes the linear voltage-power relationship of typical photonic components.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Collapse

Related Papers (5)

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

Recent advances in convolutional neural networks

Citations

Deep-Learning-Based Precipitation Observation Quality Control

Optimizing Relevance Maps of Vision Transformers Improves Robustness

Performance Analysis of Machine Learning and Deep Learning Models for Text Classification

IDMIL: an alignment-free Interpretable Deep Multiple Instance Learning (MIL) for predicting disease from whole-metagenomic data

Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch

References

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Gradient-based learning applied to document recognition

Related Papers (5)

Deep Residual Learning for Image Recognition

Deep learning

ImageNet Classification with Deep Convolutional Neural Networks

Very Deep Convolutional Networks for Large-Scale Image Recognition

Going deeper with convolutions