Recent advances in convolutional neural networks
Jiuxiang Gu,Zhenhua Wang,Jason Kuen,Lianyang Ma,Amir Shahroudy,Bing Shuai,Ting Liu,Xingxing Wang,Gang Wang,Jianfei Cai,Tsuhan Chen +10 more
TLDR
A broad survey of the recent advances in convolutional neural networks can be found in this article, where the authors discuss the improvements of CNN on different aspects, namely, layer design, activation function, loss function, regularization, optimization and fast computation.About:
This article is published in Pattern Recognition.The article was published on 2018-05-01 and is currently open access. It has received 3125 citations till now. The article focuses on the topics: Deep learning & Convolutional neural network.read more
Citations
More filters
Journal ArticleDOI
Deep Learning for Generic Object Detection: A Survey
Li Liu,Li Liu,Wanli Ouyang,Xiaogang Wang,Paul Fieguth,Jie Chen,Xinwang Liu,Matti Pietikäinen +7 more
TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.
Journal ArticleDOI
A survey of the recent architectures of deep convolutional neural networks
TL;DR: Deep Convolutional Neural Networks (CNNs) as mentioned in this paper are a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing.
Journal ArticleDOI
Applications of machine learning to machine fault diagnosis: A review and roadmap
TL;DR: A review and roadmap to systematically cover the development of IFD following the progress of machine learning theories and offer a future perspective is presented.
Journal ArticleDOI
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
Laith Alzubaidi,Jinglan Zhang,Amjad J. Humaidi,Ayad Q. Al-Dujaili,Ye Duan,Omran Al-Shamma,José Santamaría,Mohammed A. Fadhel,Muthana Al-Amidie,Laith Farhan +9 more
TL;DR: In this paper, a comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field is provided, and the challenges and suggested solutions to help researchers understand the existing research gaps.
Journal ArticleDOI
Albumentations: fast and flexible image augmentations
Alexander Buslaev,Vladimir Iglovikov,Eugene Khvedchenya,Alex Parinov,Mikhail Druzhinin,Alexandr A. Kalinin +5 more
TL;DR: Albumentations as mentioned in this paper is a fast and flexible open source library for image augmentation with many various image transform operations available that is also an easy-to-use wrapper around other augmentation libraries.
References
More filters
Proceedings ArticleDOI
Speech acoustic modeling from raw multichannel waveforms
TL;DR: A convolutional neural network - deep neural network (CNN-DNN) acoustic model which takes raw multichannel waveforms as input, and learns a similar feature representation through supervised training and outperforms a DNN that uses log-mel filterbank magnitude features under noisy and reverberant conditions.
Journal ArticleDOI
Category-Independent Object Proposals with Diverse Ranking
Ian Endres,Derek Hoiem +1 more
TL;DR: A category-independent method to produce a bag of regions and rank them, such that top-ranked regions are likely to be good segmentations of different objects, demonstrating the ability to find most objects within a small bag of proposed regions.
Journal ArticleDOI
A survey on still image based human action recognition
Guodong Guo,Alice Lai +1 more
TL;DR: A detailed overview of the state-of-the-art methods for still image-based action recognition is presented, and various high-level cues and low-level features for action analysis in still images are described.
Proceedings ArticleDOI
Adaptation of context-dependent deep neural networks for automatic speech recognition
TL;DR: On a large vocabulary speech recognition task, a stochastic gradient ascent implementation of the fDLR and the top hidden layer adaptation is shown to reduce word error rates (WERs) by 17% and 14%, respectively, compared to the baseline DNN performances.
Journal ArticleDOI
RGB-D-based action recognition datasets
TL;DR: In this article, a comprehensive review of the most commonly used action recognition related RGB-D video datasets, including 27 single-view, 10 multi-view and 7 multi-person datasets, is presented.