Home
/
Authors
/
Dan Li

Author

Dan Li

Other affiliations: University of California, Berkeley, National University of Singapore

Bio: Dan Li is an academic researcher from Nanyang Technological University. The author has contributed to research in topics: Fault detection and isolation & Anomaly detection. The author has an hindex of 11, co-authored 19 publications receiving 726 citations. Previous affiliations of Dan Li include University of California, Berkeley & National University of Singapore.

Papers

PDF

Open Access

More filters

Posted Content•

MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks

[...]

Dan Li, Dacheng Chen, Lei Shi, Baihong Jin, Jonathan Goh, See-Kiong Ng - Show less +2 more

15 Jan 2019-arXiv: Learning

TL;DR: The proposed MAD-GAN framework considers the entire variable set concurrently to capture the latent interactions amongst the variables and is effective in reporting anomalies caused by various cyber-intrusions compared in these complex real-world systems.

...read moreread less

Abstract: The prevalence of networked sensors and actuators in many real-world systems such as smart buildings, factories, power plants, and data centers generate substantial amounts of multivariate time series data for these systems. The rich sensor data can be continuously monitored for intrusion events through anomaly detection. However, conventional threshold-based anomaly detection methods are inadequate due to the dynamic complexities of these systems, while supervised machine learning methods are unable to exploit the large amounts of data due to the lack of labeled data. On the other hand, current unsupervised machine learning approaches have not fully exploited the spatial-temporal correlation and other dependencies amongst the multiple variables (sensors/actuators) in the system for detecting anomalies. In this work, we propose an unsupervised multivariate anomaly detection method based on Generative Adversarial Networks (GANs). Instead of treating each data stream independently, our proposed MAD-GAN framework considers the entire variable set concurrently to capture the latent interactions amongst the variables. We also fully exploit both the generator and discriminator produced by the GAN, using a novel anomaly score called DR-score to detect anomalies by discrimination and reconstruction. We have tested our proposed MAD-GAN using two recent datasets collected from real-world CPS: the Secure Water Treatment (SWaT) and the Water Distribution (WADI) datasets. Our experimental results showed that the proposed MAD-GAN is effective in reporting anomalies caused by various cyber-intrusions compared in these complex real-world systems.

...read moreread less

462 citations

Book Chapter•DOI•

MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks

[...]

Dan Li¹, Dacheng Chen¹, Baihong Jin², Lei Shi¹, Jonathan Goh, See-Kiong Ng¹ - Show less +2 more•Institutions (2)

National University of Singapore¹, University of California, Berkeley²

17 Sep 2019

TL;DR: In this article, an unsupervised multivariate anomaly detection method based on Generative Adversarial Networks (GANs), using the Long Short-Term-Memory Recurrent Neural Networks (LSTM-RNN) as the base models (namely, the generator and discriminator) in the GAN framework, was proposed.

...read moreread less

Abstract: Many real-world cyber-physical systems (CPSs) are engineered for mission-critical tasks and usually are prime targets for cyber-attacks. The rich sensor data in CPSs can be continuously monitored for intrusion events through anomaly detection. On one hand, conventional supervised anomaly detection methods are unable to exploit the large amounts of data due to the lack of labelled data. On the other hand, current unsupervised machine learning approaches have not fully exploited the spatial-temporal correlation and other dependencies amongst the multiple variables (sensors/actuators) in the system when detecting anomalies. In this work, we propose an unsupervised multivariate anomaly detection method based on Generative Adversarial Networks (GANs), using the Long-Short-Term-Memory Recurrent Neural Networks (LSTM-RNN) as the base models (namely, the generator and discriminator) in the GAN framework to capture the temporal correlation of time series distributions. Instead of treating each data stream independently, our proposed Multivariate Anomaly Detection with GAN (MAD-GAN) framework considers the entire variable set concurrently to capture the latent interactions amongst the variables. We also fully exploit both the generator and discriminator produced by the GAN, using a novel anomaly score called DR-score to detect anomalies through discrimination and reconstruction. We have tested our proposed MAD-GAN using two recent datasets collected from real-world CPSs: the Secure Water Treatment (SWaT) and the Water Distribution (WADI) datasets. Our experimental results show that the proposed MAD-GAN is effective in reporting anomalies caused by various cyber-attacks inserted in these complex real-world systems.

...read moreread less

230 citations

Posted Content•

Anomaly Detection with Generative Adversarial Networks for Multivariate Time Series

[...]

Dan Li, Dacheng Chen, Jonathan Goh, See-Kiong Ng

13 Sep 2018-arXiv: Learning

TL;DR: This work proposed a novel Generative Adversarial Networks-based Anomaly Detection (GAN-AD) method that was used to distinguish abnormal attacked situations from normal working conditions for a complex six-stage Secure Water Treatment (SWaT) system.

...read moreread less

Abstract: Today's Cyber-Physical Systems (CPSs) are large, complex, and affixed with networked sensors and actuators that are targets for cyber-attacks. Conventional detection techniques are unable to deal with the increasingly dynamic and complex nature of the CPSs. On the other hand, the networked sensors and actuators generate large amounts of data streams that can be continuously monitored for intrusion events. Unsupervised machine learning techniques can be used to model the system behaviour and classify deviant behaviours as possible attacks. In this work, we proposed a novel Generative Adversarial Networks-based Anomaly Detection (GAN-AD) method for such complex networked CPSs. We used LSTM-RNN in our GAN to capture the distribution of the multivariate time series of the sensors and actuators under normal working conditions of a CPS. Instead of treating each sensor's and actuator's time series independently, we model the time series of multiple sensors and actuators in the CPS concurrently to take into account of potential latent interactions between them. To exploit both the generator and the discriminator of our GAN, we deployed the GAN-trained discriminator together with the residuals between generator-reconstructed data and the actual samples to detect possible anomalies in the complex CPS. We used our GAN-AD to distinguish abnormal attacked situations from normal working conditions for a complex six-stage Secure Water Treatment (SWaT) system. Experimental results showed that the proposed strategy is effective in identifying anomalies caused by various attacks with high detection rate and low false positive rate as compared to existing methods.

...read moreread less

199 citations

Journal Article•DOI•

A data-driven strategy for detection and diagnosis of building chiller faults using linear discriminant analysis

[...]

Dan Li¹, Guoqiang Hu¹, Costas J. Spanos²•Institutions (2)

Nanyang Technological University¹, University of California, Berkeley²

15 Sep 2016-Energy and Buildings

TL;DR: In this article, a two-stage data-driven FDD strategy is proposed to detect and diagnose chiller faults in order to save energy and improve the performance of building automation systems, which formulates the chiller detection and diagnosis task as a multi-class classification problem.

...read moreread less

121 citations

Journal Article•DOI•

Fault detection and diagnosis for building cooling system with a tree-structured learning method

[...]

Dan Li¹, Yuxun Zhou², Guoqiang Hu¹, Costas J. Spanos²•Institutions (2)

Nanyang Technological University¹, University of California, Berkeley²

01 Sep 2016-Energy and Buildings

TL;DR: Experimental results show that compared to previous data-driven methods, TFDK can greatly improve the FDD performance as well as recognize the fault severity levels with high accuracy.

...read moreread less

79 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Posted Content•

Deep Learning for Anomaly Detection: A Survey.

[...]

Raghavendra Chalapathy¹, Sanjay Chawla²•Institutions (2)

Cooperative Research Centre¹, Qatar Computing Research Institute²

10 Jan 2019-arXiv: Learning

TL;DR: A structured and comprehensive overview of research methods in deep learning-based anomaly detection, grouped state-of-the-art research techniques into different categories based on the underlying assumptions and approach adopted.

...read moreread less

Abstract: Anomaly detection is an important problem that has been well-studied within diverse research areas and application domains. The aim of this survey is two-fold, firstly we present a structured and comprehensive overview of research methods in deep learning-based anomaly detection. Furthermore, we review the adoption of these methods for anomaly across various application domains and assess their effectiveness. We have grouped state-of-the-art research techniques into different categories based on the underlying assumptions and approach adopted. Within each category we outline the basic anomaly detection technique, along with its variants and present key assumptions, to differentiate between normal and anomalous behavior. For each category, we present we also present the advantages and limitations and discuss the computational complexity of the techniques in real application domains. Finally, we outline open issues in research and challenges faced while adopting these techniques.

...read moreread less

522 citations

Posted Content•

MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks

[...]

Dan Li, Dacheng Chen, Lei Shi, Baihong Jin, Jonathan Goh, See-Kiong Ng - Show less +2 more

15 Jan 2019-arXiv: Learning

...read moreread less

462 citations

Journal Article•DOI•

Machine Learning in IoT Security: Current Solutions and Future Challenges

[...]

Fatima Hussain¹, Rasheed Hussain, Syed Ali Hassan², Ekram Hossain³•Institutions (3)

Royal Bank of Canada¹, University of the Sciences², University of Manitoba³

08 Apr 2020-IEEE Communications Surveys and Tutorials

TL;DR: This paper systematically review the security requirements, attack vectors, and the current security solutions for the IoT networks, and sheds light on the gaps in these security solutions that call for ML and DL approaches.

...read moreread less

Abstract: The future Internet of Things (IoT) will have a deep economical, commercial and social impact on our lives. The participating nodes in IoT networks are usually resource-constrained, which makes them luring targets for cyber attacks. In this regard, extensive efforts have been made to address the security and privacy issues in IoT networks primarily through traditional cryptographic approaches. However, the unique characteristics of IoT nodes render the existing solutions insufficient to encompass the entire security spectrum of the IoT networks. Machine Learning (ML) and Deep Learning (DL) techniques, which are able to provide embedded intelligence in the IoT devices and networks, can be leveraged to cope with different security problems. In this paper, we systematically review the security requirements, attack vectors, and the current security solutions for the IoT networks. We then shed light on the gaps in these security solutions that call for ML and DL approaches. Finally, we discuss in detail the existing ML and DL solutions for addressing different security problems in IoT networks. We also discuss several future research directions for ML- and DL-based IoT security.

...read moreread less

407 citations

Posted Content•

A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications

[...]

Jie Gui¹, Zhenan Sun, Yonggang Wen², Dacheng Tao³, Jieping Ye⁴ - Show less +1 more•Institutions (4)

Southeast University¹, Nanyang Technological University², University of Sydney³, University of Michigan⁴

20 Jan 2020-arXiv: Learning

TL;DR: This paper attempts to provide a review on various GANs methods from the perspectives of algorithms, theory, and applications, and compares the commonalities and differences of these GAns methods.

...read moreread less

Abstract: Generative adversarial networks (GANs) are a hot research topic recently. GANs have been widely studied since 2014, and a large number of algorithms have been proposed. However, there is few comprehensive study explaining the connections among different GANs variants, and how they have evolved. In this paper, we attempt to provide a review on various GANs methods from the perspectives of algorithms, theory, and applications. Firstly, the motivations, mathematical representations, and structure of most GANs algorithms are introduced in details. Furthermore, GANs have been combined with other machine learning algorithms for specific applications, such as semi-supervised learning, transfer learning, and reinforcement learning. This paper compares the commonalities and differences of these GANs methods. Secondly, theoretical issues related to GANs are investigated. Thirdly, typical applications of GANs in image processing and computer vision, natural language processing, music, speech and audio, medical field, and data science are illustrated. Finally, the future open research problems for GANs are pointed out.

...read moreread less

344 citations

Journal Article•DOI•

Random Forest based hourly building energy prediction

[...]

Zeyu Wang¹, Yueren Wang², Ruochen Zeng³, Ravi S. Srinivasan³, Sherry Ahrentzen³ - Show less +1 more•Institutions (3)

Guangzhou University¹, Microsoft², University of Florida³

15 Jul 2018-Energy and Buildings

TL;DR: In this article, the authors proposed a homogeneous ensemble approach, i.e., use of Random Forest (RF), for hourly building energy prediction, which was adopted to predict the hourly electricity usage of two educational buildings in North Central Florida.

...read moreread less

331 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse