Home
/
Authors
/
Shaobo Li

Author

Shaobo Li

Other affiliations: Hebei University of Engineering, Chinese Academy of Sciences

Bio: Shaobo Li is an academic researcher from Guizhou University. The author has contributed to research in topics: Deep learning & Convolutional neural network. The author has an hindex of 20, co-authored 102 publications receiving 1238 citations. Previous affiliations of Shaobo Li include Hebei University of Engineering & Chinese Academy of Sciences.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2014
2012
2011
2010
2009
2008
2007
2006
2002

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Limited Data Rolling Bearing Fault Diagnosis With Few-Shot Learning

[...]

Ansi Zhang¹, Shaobo Li¹, Yuxin Cui², Wanli Yang¹, Rongzhi Dong¹, Jianjun Hu² - Show less +2 more•Institutions (2)

Guizhou University¹, University of South Carolina²

09 Aug 2019-IEEE Access

TL;DR: A deep neural network based few-shot learning approach for rolling bearing fault diagnosis with limited data availability based on the siamese neural network, which learns by exploiting sample pairs of the same or different categories.

...read moreread less

Abstract: This paper focuses on bearing fault diagnosis with limited training data. A major challenge in fault diagnosis is the infeasibility of obtaining sufficient training samples for every fault type under all working conditions. Recently deep learning based fault diagnosis methods have achieved promising results. However, most of these methods require large amount of training data. In this study, we propose a deep neural network based few-shot learning approach for rolling bearing fault diagnosis with limited data. Our model is based on the siamese neural network, which learns by exploiting sample pairs of the same or different categories. Experimental results over the standard Case Western Reserve University (CWRU) bearing fault diagnosis benchmark dataset showed that our few-shot learning approach is more effective in fault diagnosis with limited data availability. When tested over different noise environments with minimal amount of training data, the performance of our few-shot learning model surpasses the one of the baseline with reasonable noise level. When evaluated over test sets with new fault types or new working conditions, few-shot models work better than the baseline trained with all fault types. All our models and datasets in this study are open sourced and can be downloaded from https://mekhub.cn/as/fault_diagnosis_with_few-shot_learning/ .

...read moreread less

207 citations

Journal Article•DOI•

An Ensemble Deep Convolutional Neural Network Model with Improved D-S Evidence Fusion for Bearing Fault Diagnosis.

[...]

Shaobo Li¹, Guokai Liu¹, Xianghong Tang¹, Jianguang Lu¹, Jianjun Hu², Jianjun Hu¹ - Show less +2 more•Institutions (2)

Guizhou University¹, University of South Carolina²

28 Jul 2017-Sensors

TL;DR: This paper proposed IDSCNN, a novel bearing fault diagnosis algorithm based on ensemble deep convolutional neural networks and an improved Dempster–Shafer theory based evidence fusion, which can achieve better fault diagnosis performance than existing machine learning methods.

...read moreread less

Abstract: Intelligent machine health monitoring and fault diagnosis are becoming increasingly important for modern manufacturing industries. Current fault diagnosis approaches mostly depend on expert-designed features for building prediction models. In this paper, we proposed IDSCNN, a novel bearing fault diagnosis algorithm based on ensemble deep convolutional neural networks and an improved Dempster-Shafer theory based evidence fusion. The convolutional neural networks take the root mean square (RMS) maps from the FFT (Fast Fourier Transformation) features of the vibration signals from two sensors as inputs. The improved D-S evidence theory is implemented via distance matrix from evidences and modified Gini Index. Extensive evaluations of the IDSCNN on the Case Western Reserve Dataset showed that our IDSCNN algorithm can achieve better fault diagnosis performance than existing machine learning methods by fusing complementary or conflicting evidences from different models and sensors and adapting to different load conditions.

...read moreread less

155 citations

Journal Article•DOI•

Using Deep Learning to Detect Defects in Manufacturing: A Comprehensive Survey and Current Challenges.

[...]

Jing Yang¹, Shaobo Li¹, Zheng Wang¹, Dong Hao¹, Wang Jun¹, Shihao Tang¹ - Show less +2 more•Institutions (1)

Guizhou University¹

16 Dec 2020-Materials

TL;DR: In this paper, a survey of state-of-the-art deep learning methods for defect detection is presented, focusing on three aspects, namely method and experimental results, and the core ideas and codes of studies related to high precision, high positioning, rapid detection, small object, complex background, occluded object detection and object association.

...read moreread less

Abstract: The detection of product defects is essential in quality control in manufacturing. This study surveys stateoftheart deep-learning methods in defect detection. First, we classify the defects of products, such as electronic components, pipes, welded parts, and textile materials, into categories. Second, recent mainstream techniques and deep-learning methods for defects are reviewed with their characteristics, strengths, and shortcomings described. Third, we summarize and analyze the application of ultrasonic testing, filtering, deep learning, machine vision, and other technologies used for defect detection, by focusing on three aspects, namely method and experimental results. To further understand the difficulties in the field of defect detection, we investigate the functions and characteristics of existing equipment used for defect detection. The core ideas and codes of studies related to high precision, high positioning, rapid detection, small object, complex background, occluded object detection and object association, are summarized. Lastly, we outline the current achievements and limitations of the existing methods, along with the current research challenges, to assist the research community on defect detection in setting a further agenda for future studies.

...read moreread less

147 citations

Journal Article•DOI•

Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials

[...]

Dan Yabo¹, Yong Zhao², Li Xiang¹, Shaobo Li¹, Ming Hu², Jianjun Hu², Jianjun Hu¹ - Show less +3 more•Institutions (2)

Guizhou University¹, University of South Carolina²

26 Jun 2020

TL;DR: A generative machine learning model (MatGAN) based on a generative adversarial network (GAN) for efficient generation of new hypothetical inorganic materials and is expected to be used to greatly expand the range of the design space for inverse design and large-scale computational screening of in organic materials.

...read moreread less

Abstract: A major challenge in materials design is how to efficiently search the vast chemical design space to find the materials with desired properties. One effective strategy is to develop sampling algorithms that can exploit both explicit chemical knowledge and implicit composition rules embodied in the large materials database. Here, we propose a generative machine learning model (MatGAN) based on a generative adversarial network (GAN) for efficient generation of new hypothetical inorganic materials. Trained with materials from the ICSD database, our GAN model can generate hypothetical materials not existing in the training dataset, reaching a novelty of 92.53% when generating 2 million samples. The percentage of chemically valid (charge-neutral and electronegativity-balanced) samples out of all generated ones reaches 84.5% when generated by our GAN trained with such samples screened from ICSD, even though no such chemical rules are explicitly enforced in our GAN model, indicating its capability to learn implicit chemical composition rules to form compounds. Our algorithm is expected to be used to greatly expand the range of the design space for inverse design and large-scale computational screening of inorganic materials.

...read moreread less

123 citations

Journal Article•DOI•

Transfer Learning with Deep Recurrent Neural Networks for Remaining Useful Life Estimation

[...]

Ansi Zhang, Honglei Wang, Shaobo Li, Yuxin Cui, Zhonghao Liu, Guanci Yang, Jianjun Hu - Show less +3 more

01 Nov 2018-Applied Sciences

TL;DR: A transfer learning algorithm based on Bi-directional Long Short-Term Memory (BLSTM) recurrent neural networks for RUL estimation is proposed, in which the models can be first trained on different but related datasets and then fine-tuned by the target dataset.

...read moreread less

Abstract: Prognostics, such as remaining useful life (RUL) prediction, is a crucial task in condition-based maintenance. A major challenge in data-driven prognostics is the difficulty of obtaining a sufficient number of samples of failure progression. However, for traditional machine learning methods and deep neural networks, enough training data is a prerequisite to train good prediction models. In this work, we proposed a transfer learning algorithm based on Bi-directional Long Short-Term Memory (BLSTM) recurrent neural networks for RUL estimation, in which the models can be first trained on different but related datasets and then fine-tuned by the target dataset. Extensive experimental results show that transfer learning can in general improve the prediction models on the dataset with a small number of samples. There is one exception that when transferring from multi-type operating conditions to single operating conditions, transfer learning led to a worse result.

...read moreread less

117 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23

Collapse

Cited by

PDF

Open Access

More filters

Fast parallel algorithms for short-range molecular dynamics

[...]

Steven J. Plimpton¹•Institutions (1)

Sandia National Laboratories¹

01 May 1993

TL;DR: Comparing the results to the fastest reported vectorized Cray Y-MP and C90 algorithm shows that the current generation of parallel machines is competitive with conventional vector supercomputers even for small problems.

...read moreread less

Abstract: Three parallel algorithms for classical molecular dynamics are presented. The first assigns each processor a fixed subset of atoms; the second assigns each a fixed subset of inter-atomic forces to compute; the third assigns each a fixed spatial region. The algorithms are suitable for molecular dynamics models which can be difficult to parallelize efficiently—those with short-range forces where the neighbors of each atom change rapidly. They can be implemented on any distributed-memory parallel machine which allows for message-passing of data between independently executing processors. The algorithms are tested on a standard Lennard-Jones benchmark problem for system sizes ranging from 500 to 100,000,000 atoms on several parallel supercomputers--the nCUBE 2, Intel iPSC/860 and Paragon, and Cray T3D. Comparing the results to the fastest reported vectorized Cray Y-MP and C90 algorithm shows that the current generation of parallel machines is competitive with conventional vector supercomputers even for small problems. For large problems, the spatial algorithm achieves parallel efficiencies of 90% and a 1840-node Intel Paragon performs up to 165 faster than a single Cray C9O processor. Trade-offs between the three algorithms and guidelines for adapting them to more complex molecular dynamics simulations are also discussed.

...read moreread less

29,323 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Posted Content•

An Overview of Innovation

[...]

Stephen J. Kline, Nathan Rosenberg

01 Jan 2009-Research Papers in Economics

TL;DR: The process of innovation must be viewed as a series of changes in a complete system not only of hardware, but also of market environment, production facilities and knowledge, and the social contexts of the innovation organization as discussed by the authors.

...read moreread less

Abstract: Models that depict innovation as a smooth, well-behaved linear process badly misspecify the nature and direction of the causal factors at work. Innovation is complex, uncertain, somewhat disorderly, and subject to changes of many sorts. Innovation is also difficult to measure and demands close coordination of adequate technical knowledge and excellent market judgment in order to satisfy economic, technological, and other types of constraints—all simultaneously. The process of innovation must be viewed as a series of changes in a complete system not only of hardware, but also of market environment, production facilities and knowledge, and the social contexts of the innovation organization.

...read moreread less

2,154 citations

Journal Article•DOI•

Deep learning and its applications to machine health monitoring

[...]

Rui Zhao¹, Ruqiang Yan¹, Zhenghua Chen², Kezhi Mao², Peng Wang³, Robert X. Gao³ - Show less +2 more•Institutions (3)

Xi'an Jiaotong University¹, Nanyang Technological University², Case Western Reserve University³

15 Jan 2019-Mechanical Systems and Signal Processing

TL;DR: The applications of deep learning in machine health monitoring systems are reviewed mainly from the following aspects: Auto-encoder and its variants, Restricted Boltzmann Machines, Convolutional Neural Networks, and Recurrent Neural Networks.

...read moreread less

1,569 citations

Journal Article•DOI•

Applications of machine learning to machine fault diagnosis: A review and roadmap

[...]

Yaguo Lei¹, Bin Yang¹, Jiang Xinwei¹, Feng Jia¹, Naipeng Li¹, Asoke K. Nandi² - Show less +2 more•Institutions (2)

Xi'an Jiaotong University¹, Brunel University London²

01 Apr 2020-Mechanical Systems and Signal Processing

TL;DR: A review and roadmap to systematically cover the development of IFD following the progress of machine learning theories and offer a future perspective is presented.

...read moreread less

1,173 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse