Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis

doi:10.1016/J.MEASUREMENT.2016.07.054

Home
/
Papers
/
Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis

Journal Article•DOI•

Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis

Xiaojie Guo¹, Liang Chen¹, Changqing Shen¹•Institutions (1)

Soochow University (Suzhou)¹

01 Nov 2016-Measurement (Elsevier)-Vol. 93, Iss: 93, pp 490-502

TL;DR: A novel hierarchical learning rate adaptive deep convolution neural network based on an improved algorithm that is well suited to the fault-diagnosis model and superior to other existing methods is proposed.

read less

About: This article is published in Measurement.The article was published on 2016-11-01. It has received 592 citations till now. The article focuses on the topics: Convolutional neural network & Support vector machine.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning and its applications to machine health monitoring

[...]

Rui Zhao¹, Ruqiang Yan¹, Zhenghua Chen², Kezhi Mao², Peng Wang³, Robert X. Gao³ - Show less +2 more•Institutions (3)

Xi'an Jiaotong University¹, Nanyang Technological University², Case Western Reserve University³

15 Jan 2019-Mechanical Systems and Signal Processing

TL;DR: The applications of deep learning in machine health monitoring systems are reviewed mainly from the following aspects: Auto-encoder and its variants, Restricted Boltzmann Machines, Convolutional Neural Networks, and Recurrent Neural Networks.

...read moreread less

1,569 citations

Journal Article•DOI•

A New Convolutional Neural Network-Based Data-Driven Fault Diagnosis Method

[...]

Long Wen, Xinyu Li, Liang Gao, Yuyan Zhang

01 Jul 2018-IEEE Transactions on Industrial Electronics

TL;DR: A new CNN based on LeNet-5 is proposed for fault diagnosis which can extract the features of the converted 2-D images and eliminate the effect of handcrafted features and has achieved significant improvements.

...read moreread less

Abstract: Fault diagnosis is vital in manufacturing system, since early detections on the emerging problem can save invaluable time and cost. With the development of smart manufacturing, the data-driven fault diagnosis becomes a hot topic. However, the traditional data-driven fault diagnosis methods rely on the features extracted by experts. The feature extraction process is an exhausted work and greatly impacts the final result. Deep learning (DL) provides an effective way to extract the features of raw data automatically. Convolutional neural network (CNN) is an effective DL method. In this study, a new CNN based on LeNet-5 is proposed for fault diagnosis. Through a conversion method converting signals into two-dimensional (2-D) images, the proposed method can extract the features of the converted 2-D images and eliminate the effect of handcrafted features. The proposed method which is tested on three famous datasets, including motor bearing dataset, self-priming centrifugal pump dataset, and axial piston hydraulic pump dataset, has achieved prediction accuracy of 99.79%, 99.481%, and 100%, respectively. The results have been compared with other DL and traditional methods, including adaptive deep CNN, sparse filter, deep belief network, and support vector machine. The comparisons show that the proposed CNN-based data-driven fault diagnosis method has achieved significant improvements.

...read moreread less

1,240 citations

Cites background or methods from "Hierarchical adaptive deep convolut..."

...Guo et al [30] investigated the LeNet-5-based adaptive deep CNN on the bearing fault diagnosis....
[...]
...[30] investigated a hierarchical adaptive deep CNN to bearing fault diagnosis....
[...]
...Guo et al. [30] investigated a hierarchical adaptive deep CNN to bearing fault diagnosis....
[...]
...They are sparse filter [16], DBN) [12], DBN-based HDN [27], SVM [43], ANN [12], and adaptive deep convolution neural network (ADCNN) [30]....
[...]
...D features of the converted images automatically, these results outperform other DL and traditional methods, such as ADCNN, sparse filter, DBN, SVM, ANN, and SURF-based PNN, which shows the potential of the proposed method....
[...]

Journal Article•DOI•

Deep learning for smart manufacturing: Methods and applications

[...]

Jinjiang Wang¹, Yulin Ma¹, Laibin Zhang¹, Robert X. Gao², Dazhong Wu³ - Show less +1 more•Institutions (3)

China University of Petroleum¹, Case Western Reserve University², University of Central Florida³

01 Jul 2018-Journal of Manufacturing Systems

TL;DR: A comprehensive survey of commonly used deep learning algorithms and discusses their applications toward making manufacturing “smart”, including computational methods based on deep learning that aim to improve system performance in manufacturing.

...read moreread less

1,025 citations

Journal Article•DOI•

Remaining useful life estimation in prognostics using deep convolution neural networks

[...]

Xiang Li¹, Qian Ding², Jian-Qiao Sun³•Institutions (3)

Northeastern University (China)¹, Tianjin University², University of California, Merced³

01 Apr 2018-Reliability Engineering & System Safety

TL;DR: A new data-driven approach for prognostics using deep convolution neural networks (DCNN) using time window approach is employed for sample preparation in order for better feature extraction by DCNN.

...read moreread less

948 citations

Cites background from "Hierarchical adaptive deep convolut..."

...It should be noted that the selection of batch size affects the network training performance [45]....
[...]

Journal Article•DOI•

A New Deep Learning Model for Fault Diagnosis with Good Anti-Noise and Domain Adaptation Ability on Raw Vibration Signals

[...]

Wei Zhang¹, Gaoliang Peng¹, Chuanhao Li¹, Yuanhang Chen¹, Zhujun Zhang¹ - Show less +1 more•Institutions (1)

Harbin Institute of Technology¹

22 Feb 2017-Sensors

TL;DR: A novel method named Deep Convolutional Neural Networks with Wide First-layer Kernels (WDCNN), which can not only achieve 100% classification accuracy on normal signals, but also outperform the state-of-the-art DNN model which is based on frequency features under different working load and noisy environment conditions.

...read moreread less

Abstract: Intelligent fault diagnosis techniques have replaced time-consuming and unreliable human analysis, increasing the efficiency of fault diagnosis. Deep learning models can improve the accuracy of intelligent fault diagnosis with the help of their multilayer nonlinear mapping ability. This paper proposes a novel method named Deep Convolutional Neural Networks with Wide First-layer Kernels (WDCNN). The proposed method uses raw vibration signals as input (data augmentation is used to generate more inputs), and uses the wide kernels in the first convolutional layer for extracting features and suppressing high frequency noise. Small convolutional kernels in the preceding layers are used for multilayer nonlinear mapping. AdaBN is implemented to improve the domain adaptation ability of the model. The proposed model addresses the problem that currently, the accuracy of CNN applied to fault diagnosis is not very high. WDCNN can not only achieve 100% classification accuracy on normal signals, but also outperform the state-of-the-art DNN model which is based on frequency features under different working load and noisy environment conditions.

...read moreread less

876 citations

Cites background from "Hierarchical adaptive deep convolut..."

...Many papers overlook the effects of this simple operation, and most of these work use hundreds of training samples without any overlap [23,24]....
[...]
...In [23], a hierarchical adaptive deep convolutional neural network....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

Collapse

References

PDF

Open Access

More filters

Proceedings Article•

ImageNet Classification with Deep Convolutional Neural Networks

[...]

Alex Krizhevsky¹, Ilya Sutskever¹, Geoffrey E. Hinton¹•Institutions (1)

University of Toronto¹

03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Abstract: We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0% which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implementation of the convolution operation. To reduce overriding in the fully-connected layers we employed a recently-developed regularization method called "dropout" that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry.

...read moreread less

73,978 citations

Journal Article•DOI•

Gradient-based learning applied to document recognition

[...]

Yann LeCun¹, Léon Bottou², Léon Bottou³, Yoshua Bengio³, Yoshua Bengio⁴, Yoshua Bengio⁵, Patrick Haffner³ - Show less +3 more•Institutions (5)

Bell Labs¹, École Normale Supérieure², AT&T³, École Polytechnique de Montréal⁴, Alcatel-Lucent⁵

01 Jan 1998

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Abstract: Multilayer neural networks trained with the back-propagation algorithm constitute the best example of a successful gradient based learning technique. Given an appropriate network architecture, gradient-based learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional neural networks, which are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation recognition, and language modeling. A new learning paradigm, called graph transformer networks (GTN), allows such multimodule systems to be trained globally using gradient-based methods so as to minimize an overall performance measure. Two systems for online handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of graph transformer networks. A graph transformer network for reading a bank cheque is also described. It uses convolutional neural network character recognizers combined with global training techniques to provide record accuracy on business and personal cheques. It is deployed commercially and reads several million cheques per day.

...read moreread less

42,067 citations

Book•

Learning Deep Architectures for AI

[...]

Yoshua Bengio¹•Institutions (1)

Université de Montréal¹

01 Jan 2009

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less

Abstract: Can machine learning deliver AI? Theoretical results, inspiration from the brain and cognition, as well as machine learning experiments suggest that in order to learn the kind of complicated functions that can represent high-level abstractions (e.g. in vision, language, and other AI-level tasks), one would need deep architectures. Deep architectures are composed of multiple levels of non-linear operations, such as in neural nets with many hidden layers, graphical models with many levels of latent variables, or in complicated propositional formulae re-using many sub-formulae. Each level of the architecture represents features at a different level of abstraction, defined as a composition of lower-level features. Searching the parameter space of deep architectures is a difficult task, but new algorithms have been discovered and a new sub-area has emerged in the machine learning community since 2006, following these discoveries. Learning algorithms such as those for Deep Belief Networks and other related unsupervised learning algorithms have recently been proposed to train deep architectures, yielding exciting results and beating the state-of-the-art in certain areas. Learning Deep Architectures for AI discusses the motivations for and principles of learning algorithms for deep architectures. By analyzing and comparing recent results with different learning algorithms for deep architectures, explanations for their success are proposed and discussed, highlighting challenges and suggesting avenues for future explorations in this area.

...read moreread less

7,767 citations

Journal Article•DOI•

Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data

[...]

Feng Jia¹, Yaguo Lei¹, Jing Lin¹, Xin Zhou¹, Na Lu¹ - Show less +1 more•Institutions (1)

Xi'an Jiaotong University¹

01 May 2016-Mechanical Systems and Signal Processing

TL;DR: The diagnosis results show that the proposed method is able to not only adaptively mine available fault characteristics from the measured signals, but also obtain superior diagnosis accuracy compared with the existing methods.

...read moreread less

1,289 citations

Proceedings Article•

Unsupervised feature learning for audio classification using convolutional deep belief networks

[...]

Honglak Lee¹, Peter T. Pham¹, Yan Largman¹, Andrew Y. Ng¹•Institutions (1)

Stanford University¹

07 Dec 2009

TL;DR: In this paper, the authors apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks and show that the learned features correspond to phones/phonemes.

...read moreread less

Abstract: In recent years, deep learning approaches have gained significant interest as a way of building hierarchical representations from unlabeled data. However, to our knowledge, these deep learning approaches have not been extensively studied for auditory data. In this paper, we apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks. In the case of speech data, we show that the learned features correspond to phones/phonemes. In addition, our feature representations learned from unlabeled audio data show very good performance for multiple audio classification tasks. We hope that this paper will inspire more research on deep learning approaches applied to a wide range of audio recognition tasks.

...read moreread less

1,094 citations