Artificial intelligence for fault diagnosis of rotating machinery: A review

doi:10.1016/J.YMSSP.2018.02.016

Home
/
Papers
/
Artificial intelligence for fault diagnosis of rotating machinery: A review

Journal Article•DOI•

Artificial intelligence for fault diagnosis of rotating machinery: A review

Ruonan Liu¹, Boyuan Yang¹, Enrico Zio², Enrico Zio³, Xuefeng Chen¹ - Show less +1 more•Institutions (3)

Xi'an Jiaotong University¹, Supélec², CentraleSupélec³

01 Aug 2018-Mechanical Systems and Signal Processing (Academic Press)-Vol. 108, pp 33-47

TL;DR: This paper attempts to present a comprehensive review of AI algorithms in rotating machinery fault diagnosis, from both the views of theory background and industrial applications.

read less

About: This article is published in Mechanical Systems and Signal Processing.The article was published on 2018-08-01. It has received 1287 citations till now. The article focuses on the topics: Literature survey.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Applications of machine learning to machine fault diagnosis: A review and roadmap

[...]

Yaguo Lei¹, Bin Yang¹, Jiang Xinwei¹, Feng Jia¹, Naipeng Li¹, Asoke K. Nandi² - Show less +2 more•Institutions (2)

Xi'an Jiaotong University¹, Brunel University London²

01 Apr 2020-Mechanical Systems and Signal Processing

TL;DR: A review and roadmap to systematically cover the development of IFD following the progress of machine learning theories and offer a future perspective is presented.

...read moreread less

1,173 citations

Journal Article•DOI•

Deep Residual Shrinkage Networks for Fault Diagnosis

[...]

Minghang Zhao¹, Shisheng Zhong¹, Xuyun Fu¹, Baoping Tang², Michael Pecht³ - Show less +1 more•Institutions (3)

Harbin Institute of Technology¹, Chongqing University², University of Maryland, College Park³

01 Jul 2020-IEEE Transactions on Industrial Informatics

TL;DR: New deep learning methods, namely, deep residual shrinkage networks, are developed to improve the feature learning ability from highly noised vibration signals and achieve a high fault diagnosing accuracy.

...read moreread less

Abstract: This article develops new deep learning methods, namely, deep residual shrinkage networks, to improve the feature learning ability from highly noised vibration signals and achieve a high fault diagnosing accuracy. Soft thresholding is inserted as nonlinear transformation layers into the deep architectures to eliminate unimportant features. Moreover, considering that it is generally challenging to set proper values for the thresholds, the developed deep residual shrinkage networks integrate a few specialized neural networks as trainable modules to automatically determine the thresholds, so that professional expertise on signal processing is not required. The efficacy of the developed methods is validated through experiments with various types of noise.

...read moreread less

520 citations

Journal Article•DOI•

A review of vibration-based damage detection in civil structures : from traditional methods to Machine Learning and Deep Learning applications

[...]

Onur Avci¹, Osama Abdeljaber², Serkan Kiranyaz³, Mohammed Hussein³, Moncef Gabbouj⁴, Daniel J. Inman⁵ - Show less +2 more•Institutions (5)

Iowa State University¹, Linnaeus University², Qatar University³, Tampere University of Technology⁴, University of Michigan⁵

15 Jan 2021-Mechanical Systems and Signal Processing

TL;DR: This paper aims to fulfill the gap by presenting the highlights of the traditional methods and provide a comprehensive review of the most recent applications of ML and DL algorithms utilized for vibration-based structural damage detection in civil structures.

...read moreread less

440 citations

Cites background from "Artificial intelligence for fault d..."

...The high efficiency of vibration-based SDD systems via SHM of rotating machines [35] is based on the fact that the vibration response of such machines is minimally affected by the operational and environmental conditions [36–38]....
[...]

Journal Article•DOI•

Deep transfer network with joint distribution adaptation: A new intelligent fault diagnosis framework for industry application.

[...]

Te Han¹, Chao Liu¹, Wenguang Yang¹, Dongxiang Jiang¹•Institutions (1)

Tsinghua University¹

01 Feb 2020-Isa Transactions

TL;DR: Wang et al. as mentioned in this paper proposed a new intelligent fault diagnosis framework, i.e., deep transfer network (DTN), which generalizes deep learning model to domain adaptation scenario, by extending the marginal distribution adaptation to joint distribution adaptation (JDA).

...read moreread less

Abstract: In recent years, an increasing popularity of deep learning model for intelligent condition monitoring and diagnosis as well as prognostics used for mechanical systems and structures has been observed. In the previous studies, however, a major assumption accepted by default, is that the training and testing data are taking from same feature distribution. Unfortunately, this assumption is mostly invalid in real application, resulting in a certain lack of applicability for the traditional diagnosis approaches. Inspired by the idea of transfer learning that leverages the knowledge learnt from rich labeled data in source domain to facilitate diagnosing a new but similar target task, a new intelligent fault diagnosis framework, i.e., deep transfer network (DTN), which generalizes deep learning model to domain adaptation scenario, is proposed in this paper. By extending the marginal distribution adaptation (MDA) to joint distribution adaptation (JDA), the proposed framework can exploit the discrimination structures associated with the labeled data in source domain to adapt the conditional distribution of unlabeled target data, and thus guarantee a more accurate distribution matching. Extensive empirical evaluations on three fault datasets validate the applicability and practicability of DTN, while achieving many state-of-the-art transfer results in terms of diverse operating conditions, fault severities and fault types.

...read moreread less

321 citations

Journal Article•DOI•

A transfer convolutional neural network for fault diagnosis based on ResNet-50

[...]

Long Wen¹, Xinyu Li¹, Liang Gao¹•Institutions (1)

Huazhong University of Science and Technology¹

01 May 2020-Neural Computing and Applications

TL;DR: A new TCNN with the depth of 51 convolutional layers is proposed for fault diagnosis of ResNet-50 and achieves state-of-the-art results, which demonstrates that TCNN(ResNet- 50) outperforms other DL models and traditional methods.

...read moreread less

Abstract: With the rapid development of smart manufacturing, data-driven fault diagnosis has attracted increasing attentions. As one of the most popular methods applied in fault diagnosis, deep learning (DL) has achieved remarkable results. However, due to the fact that the volume of labeled samples is small in fault diagnosis, the depths of DL models for fault diagnosis are shallow compared with convolutional neural network in other areas (including ImageNet), which limits their final prediction accuracies. In this research, a new TCNN(ResNet-50) with the depth of 51 convolutional layers is proposed for fault diagnosis. By combining with transfer learning, TCNN(ResNet-50) applies ResNet-50 trained on ImageNet as feature extractor for fault diagnosis. Firstly, a signal-to-image method is developed to convert time-domain fault signals to RGB images format as the input datatype of ResNet-50. Then, a new structure of TCNN(ResNet-50) is proposed. Finally, the proposed TCNN(ResNet-50) has been tested on three datasets, including bearing damage dataset provided by KAT datacenter, motor bearing dataset provided by Case Western Reserve University (CWRU) and self-priming centrifugal pump dataset. It achieved state-of-the-art results. The prediction accuracies of TCNN(ResNet-50) are as high as 98.95% ± 0.0074, 99.99% ± 0 and 99.20% ± 0, which demonstrates that TCNN(ResNet-50) outperforms other DL models and traditional methods.

...read moreread less

319 citations

Cites background or methods from "Artificial intelligence for fault d..."

...In generally, these machine learning methods are combined with the feature extraction processes [11]....
[...]
...With the rapid development of smart manufacturing, datadriven fault diagnosis has become a hot research topic [11, 18]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning

[...]

Yann LeCun¹, Yann LeCun², Yoshua Bengio³, Geoffrey E. Hinton⁴, Geoffrey E. Hinton⁵ - Show less +1 more•Institutions (5)

Facebook¹, New York University², Université de Montréal³, Google⁴, University of Toronto⁵

28 May 2015-Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Abstract: Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.

...read moreread less

46,982 citations

Journal Article•DOI•

Gradient-based learning applied to document recognition

[...]

Yann LeCun¹, Léon Bottou², Léon Bottou³, Yoshua Bengio², Yoshua Bengio⁴, Yoshua Bengio⁵, Patrick Haffner² - Show less +3 more•Institutions (5)

Bell Labs¹, AT&T², École Normale Supérieure³, Alcatel-Lucent⁴, École Polytechnique de Montréal⁵

01 Jan 1998

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Abstract: Multilayer neural networks trained with the back-propagation algorithm constitute the best example of a successful gradient based learning technique. Given an appropriate network architecture, gradient-based learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional neural networks, which are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation recognition, and language modeling. A new learning paradigm, called graph transformer networks (GTN), allows such multimodule systems to be trained globally using gradient-based methods so as to minimize an overall performance measure. Two systems for online handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of graph transformer networks. A graph transformer network for reading a bank cheque is also described. It uses convolutional neural network character recognizers combined with global training techniques to provide record accuracy on business and personal cheques. It is deployed commercially and reads several million cheques per day.

...read moreread less

42,067 citations

Book•

The Nature of Statistical Learning Theory

[...]

Vladimir Vapnik¹•Institutions (1)

Bell Labs¹

01 Jan 1995

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

Abstract: Setting of the learning problem consistency of learning processes bounds on the rate of convergence of learning processes controlling the generalization ability of learning processes constructing learning algorithms what is important in learning theory?.

...read moreread less

40,147 citations

Book•

Deep Learning

[...]

Ian Goodfellow¹, Yoshua Bengio², Aaron Courville²•Institutions (2)

Google¹, Université de Montréal²

18 Nov 2016

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Abstract: Deep learning is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts. Because the computer gathers knowledge from experience, there is no need for a human computer operator to formally specify all the knowledge that the computer needs. The hierarchy of concepts allows the computer to learn complicated concepts by building them out of simpler ones; a graph of these hierarchies would be many layers deep. This book introduces a broad range of topics in deep learning. The text offers mathematical and conceptual background, covering relevant concepts in linear algebra, probability theory and information theory, numerical computation, and machine learning. It describes deep learning techniques used by practitioners in industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling, and practical methodology; and it surveys such applications as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames. Finally, the book offers research perspectives, covering such theoretical topics as linear factor models, autoencoders, representation learning, structured probabilistic models, Monte Carlo methods, the partition function, approximate inference, and deep generative models. Deep Learning can be used by undergraduate or graduate students planning careers in either industry or research, and by software engineers who want to begin using deep learning in their products or platforms. A website offers supplementary material for both readers and instructors.

...read moreread less

38,208 citations

Journal Article•DOI•

Reducing the Dimensionality of Data with Neural Networks

[...]

Geoffrey E. Hinton¹, Ruslan Salakhutdinov¹•Institutions (1)

University of Toronto¹

28 Jul 2006-Science

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.

...read moreread less

Abstract: High-dimensional data can be converted to low-dimensional codes by training a multilayer neural network with a small central layer to reconstruct high-dimensional input vectors. Gradient descent can be used for fine-tuning the weights in such "autoencoder" networks, but this works well only if the initial weights are close to a good solution. We describe an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data.

...read moreread less

16,717 citations