scispace - formally typeset
Search or ask a question

Showing papers in "Pattern Recognition in 2018"


Journal ArticleDOI
TL;DR: A broad survey of the recent advances in convolutional neural networks can be found in this article, where the authors discuss the improvements of CNN on different aspects, namely, layer design, activation function, loss function, regularization, optimization and fast computation.

3,125 citations


Journal ArticleDOI
TL;DR: The background of deep visual tracking is introduced, including the fundamental concepts of visual tracking and related deep learning algorithms, and the existing deep-learning-based trackers are categorize into three classes according to network structure, network function and network training.

473 citations


Journal ArticleDOI
Yanghao Li1, Naiyan Wang, Jianping Shi, Xiaodi Hou, Jiaying Liu1 
TL;DR: This paper proposes a simple yet powerful remedy, called Adaptive Batch Normalization (AdaBN) to increase the generalization ability of a DNN, and demonstrates that the method is complementary with other existing methods and may further improve model performance.

453 citations


Journal ArticleDOI
TL;DR: The survey provides an overview on deep learning and the popular architectures used for cancer detection and diagnosis and presents four popular deep learning architectures, including convolutional neural networks, fully Convolutional networks, auto-encoders, and deep belief networks in the survey.

356 citations


Journal ArticleDOI
TL;DR: This article revisit Multiple Instance Neural Networks (MINNs) that the neural networks aim at solving the MIL problems and proposes a new type of MINN to learn bag representations, which is different from the existing MINNs that focus on estimating instance label.

343 citations


Journal ArticleDOI
TL;DR: This task challenges state-of-the-art methods from a variety of research fields to applications including fraud detection, intrusion detection, medical diagnoses and data cleaning.

341 citations


Journal ArticleDOI
TL;DR: This work presents a novel background subtraction from video sequences algorithm that uses a deep Convolutional Neural Network (CNN) to perform the segmentation, and it outperforms the existing algorithms with respect to the average ranking over different evaluation metrics announced in CDnet 2014.

331 citations


Journal ArticleDOI
TL;DR: An approach to multi-view subspace clustering that learns a joint subspace representation by constructing affinity matrix shared among all views is presented, relying on the importance of both low-rank and sparsity constraints in the construction of the affinity matrix.

297 citations


Journal ArticleDOI
TL;DR: A deep learning-based approach for temporal 3D pose recognition problems based on a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network and a data augmentation method that has also been validated experimentally is proposed.

294 citations


Journal ArticleDOI
TL;DR: It is proved that selecting useful deep descriptors contributes well to fine-grained image recognition, and a novel Mask-CNN model without the fully connected layers is proposed, which has a small feature dimensionality and efficient inference speed by comparing with other fine- grained approaches.

261 citations


Journal ArticleDOI
TL;DR: This paper first introduces fully convolutional auto-encoders for image feature learning and then proposes a unified clustering framework to learn image representations and cluster centers jointly based on a fully Convolutional Auto-encoder and soft k-means scores.

Journal ArticleDOI
TL;DR: IDNet is the first system that exploits a deep learning approach as universal feature extractors for gait recognition, and that combines classification results from subsequent walking cycles into a multi-stage decision making framework.

Journal ArticleDOI
TL;DR: A survey about recent methods that localize a visual acquisition system according to a known environment by categorizing VBL methods into two distinct families: indirect and direct localization systems.

Journal ArticleDOI
TL;DR: An overview of current LDW system is provided, describing in particular pre-processing, lane models, lane de Ntection techniques and departure warning system.

Journal ArticleDOI
TL;DR: A novel objective function is proposed to jointly optimize similarity metric learning, local positive mining and robust deep feature embedding for person re-id by proposing a novel sampling to mine suitable positives within a local range to improve the deep embedding in the context of large intra-class variations.

Journal ArticleDOI
TL;DR: Commentary on the proposed revision of the IASP definition of pain of 1979 summarizes, why this proposal is useful for guiding assessment of pain, but not its definition.

Journal ArticleDOI
TL;DR: A human-related multi-stream CNN (HR-MSCNN) architecture that encodes appearance, motion, and the captured tubes of the human- related regions is introduced that achieves state-of-the-art results on these four datasets.

Journal ArticleDOI
TL;DR: Quantization-based Hashing (QBH) is a generic framework which incorporates the advantages of quantization error reduction methods into conventional property preserving hashing methods and can be applied to both unsupervised and supervised hashing methods.

Journal ArticleDOI
TL;DR: This paper develops a novel medical image fusion, denoising, and enhancement method based on low-rank sparse component decomposition and dictionary learning that consistently outperforms existing state-of-the-art methods in terms of both visual and quantitative evaluations.

Journal ArticleDOI
TL;DR: A novel distance metric optimization driven learning approach that integrates these traditional steps via a deep convolutional neural network, which learns feature representations and the decision function in an end-to-end way, and can be optimized simultaneously by backward propagation.

Journal ArticleDOI
Zhicheng Jiao1, Xinbo Gao1, Ying Wang1, Jie Li1, Haojun Xu1 
TL;DR: Inspired by the recent breakthrough via deep recurrent convolutional neural networks (CNNs) on classifying mental load, improved CNNs methods for this task are proposed, which contain less parameters than state-of-the-art ones, making it be more competitive in further practical application.

Journal ArticleDOI
TL;DR: The basic ideas, theories, pros and cons of the approaches, group them into categories, and extensively review each category in depth by discussing the principles, application issues, and advantages/disadvantages are studied.

Journal ArticleDOI
TL;DR: A novel deep multiplicative integration gating function is proposed, which answers the question of what-and-where to match for effective person re-id and is designed to be end-to-end trainable to characterize local pairwise feature interactions in a spatially aligned manner.

Journal ArticleDOI
TL;DR: A fingerprint and finger-vein based cancelable multi-biometric system, which provides template protection and revocability and security is strengthened, thanks to the enhanced partial discrete Fourier transform based non-invertible transformation.

Journal ArticleDOI
TL;DR: This article provides a bird's eye view of data irregularities, beginning with a taxonomy and characterization of various distribution-based and feature-based irregularities, and discusses the notable and recent approaches that have been taken to make the existing stand-alone as well as ensemble classifiers robust against such irregularities.

Journal ArticleDOI
TL;DR: This article introduces a regularized ensemble framework of deep learning to address the imbalanced, multi-class learning problems in medical diagnosis and demonstrates the superior performance of the method compared to several state-of-the-art algorithms.

Journal ArticleDOI
TL;DR: This work treats the small-dim targets as a special sparse noise component of the complex background noise and adopt Mixture of Gaussians (MoG) with Markov random field (MRF) with MRF to model the small target detection problem.

Journal ArticleDOI
TL;DR: An integrated framework consisting of bottom-up and top-down attention mechanisms that enable attention to be computed at the level of salient objects and/or regions is proposed.

Journal ArticleDOI
TL;DR: A novel regression-based Convolutional Neural Network (CNN) pipeline is presented for polyp detection during Colonoscopy and has great potential to be used to assist endoscopists in tracking polyps during colonoscopy.

Journal ArticleDOI
TL;DR: The International Association for the Study of Pain (IASP) definition of pain has been widely accepted as a pragmatic characterisation of human experience as mentioned in this paper, but it fails to sufficiently integrate phenomenological aspects of pain.