Home
/
Authors
/
Vivekanand Gopalkrishnan

Author

Vivekanand Gopalkrishnan

Other affiliations: City University of Hong Kong, Deloitte, IBM

Bio: Vivekanand Gopalkrishnan is an academic researcher from Nanyang Technological University. The author has contributed to research in topics: Cluster analysis & Relational database. The author has an hindex of 21, co-authored 51 publications receiving 1362 citations. Previous affiliations of Vivekanand Gopalkrishnan include City University of Hong Kong & Deloitte.

Papers published on a yearly basis

2014
2013
2012
2011
2010
2009
2008
2007
2006
2002
2001
2000
1999
1998

Papers

PDF

Open Access

More filters

Journal Article•DOI•

PAMR: Passive aggressive mean reversion strategy for portfolio selection

[...]

Bin Li¹, Peilin Zhao¹, Steven C. H. Hoi¹, Vivekanand Gopalkrishnan²•Institutions (2)

Nanyang Technological University¹, Deloitte²

01 May 2012-Machine Learning

TL;DR: By analyzing PAMR’s update scheme, it is found that it nicely trades off between portfolio return and volatility risk and reflects the mean reversion trading principle.

...read moreread less

Abstract: This article proposes a novel online portfolio selection strategy named "Passive Aggressive Mean Reversion" (PAMR). Unlike traditional trend following approaches, the proposed approach relies upon the mean reversion relation of financial markets. Equipped with online passive aggressive learning technique from machine learning, the proposed portfolio selection strategy can effectively exploit the mean reversion property of markets. By analyzing PAMR's update scheme, we find that it nicely trades off between portfolio return and volatility risk and reflects the mean reversion trading principle. We also present several variants of PAMR algorithm, including a mixture algorithm which mixes PAMR and other strategies. We conduct extensive numerical experiments to evaluate the empirical performance of the proposed algorithms on various real datasets. The encouraging results show that in most cases the proposed PAMR strategy outperforms all benchmarks and almost all state-of-the-art portfolio selection strategies under various performance metrics. In addition to its superior performance, the proposed PAMR runs extremely fast and thus is very suitable for real-life online trading applications. The experimental testbed including source codes and data sets is available at http://www.cais.ntu.edu.sg/~chhoi/PAMR/ .

...read moreread less

171 citations

Journal Article•DOI•

A survey on enhanced subspace clustering

[...]

Kelvin Sim¹, Vivekanand Gopalkrishnan², Arthur Zimek³, Gao Cong⁴•Institutions (4)

Agency for Science, Technology and Research¹, IBM², Ludwig Maximilian University of Munich³, Nanyang Technological University⁴

01 Mar 2013-Data Mining and Knowledge Discovery

TL;DR: This survey presents enhanced approaches to subspace clustering by discussing the problems they are solving, their cluster definitions and algorithms, and the related works in high-dimensional clustering.

...read moreread less

Abstract: Subspace clustering finds sets of objects that are homogeneous in subspaces of high-dimensional datasets, and has been successfully applied in many domains. In recent years, a new breed of subspace clustering algorithms, which we denote as enhanced subspace clustering algorithms, have been proposed to (1) handle the increasing abundance and complexity of data and to (2) improve the clustering results. In this survey, we present these enhanced approaches to subspace clustering by discussing the problems they are solving, their cluster definitions and algorithms. Besides enhanced subspace clustering, we also present the basic subspace clustering and the related works in high-dimensional clustering.

...read moreread less

157 citations

Journal Article•DOI•

CORN: Correlation-driven nonparametric learning approach for portfolio selection

[...]

Bin Li¹, Steven C. H. Hoi¹, Vivekanand Gopalkrishnan¹•Institutions (1)

Nanyang Technological University¹

06 May 2011-ACM Transactions on Intelligent Systems and Technology

TL;DR: In this article, a learning-to-trade algorithm termed CORrelation-driven nonparametric learning strategy (CORN) was proposed for actively trading stocks. But, the performance of CORN was evaluated on several large historical and latest real stock markets, and showed that it can easily beat both the market index and the best stock in the market substantially.

...read moreread less

Abstract: Machine learning techniques have been adopted to select portfolios from financial markets in some emerging intelligent business applications. In this article, we propose a novel learning-to-trade algorithm termed CORrelation-driven Nonparametric learning strategy (CORN) for actively trading stocks. CORN effectively exploits statistical relations between stock market windows via a nonparametric learning approach. We evaluate the empirical performance of our algorithm extensively on several large historical and latest real stock markets, and show that it can easily beat both the market index and the best stock in the market substantially (without or with small transaction costs), and also surpass a variety of state-of-the-art techniques significantly.

...read moreread less

113 citations

Book Chapter•DOI•

Mining outliers with ensemble of heterogeneous detectors on random subspaces

[...]

Hoang Vu Nguyen¹, Hock Hee Ang¹, Vivekanand Gopalkrishnan¹•Institutions (1)

Nanyang Technological University¹

01 Apr 2010

TL;DR: This paper proposes a unified framework for combining different outlier detection algorithms that is very effective in detecting outliers in the real-world context compared to other ensemble and individual approaches.

...read moreread less

Abstract: Outlier detection has many practical applications, especially in domains that have scope for abnormal behavior. Despite the importance of detecting outliers, defining outliers in fact is a nontrivial task which is normally application-dependent. On the other hand, detection techniques are constructed around the chosen definitions. As a consequence, available detection techniques vary significantly in terms of accuracy, performance and issues of the detection problem which they address. In this paper, we propose a unified framework for combining different outlier detection algorithms. Unlike existing work, our approach combines non-compatible techniques of different types to improve the outlier detection accuracy compared to other ensemble and individual approaches. Through extensive empirical studies, our framework is shown to be very effective in detecting outliers in the real-world context.

...read moreread less

110 citations

Journal Article•DOI•

Confidence Weighted Mean Reversion Strategy for Online Portfolio Selection

[...]

Bin Li¹, Steven C. H. Hoi¹, Peilin Zhao¹, Vivekanand Gopalkrishnan²•Institutions (2)

Nanyang Technological University¹, Deloitte²

01 Mar 2013-ACM Transactions on Knowledge Discovery From Data

TL;DR: Zhang et al. as discussed by the authors modeled the portfolio vector as a Gaussian distribution, and sequentially updated the distribution by following the mean reversion trading principle, which has not been fully exploited by existing strategies.

...read moreread less

Abstract: Online portfolio selection has been attracting increasing attention from the data mining and machine learning communities. All existing online portfolio selection strategies focus on the first order information of a portfolio vector, though the second order information may also be beneficial to a strategy. Moreover, empirical evidence shows that relative stock prices may follow the mean reversion property, which has not been fully exploited by existing strategies. This article proposes a novel online portfolio selection strategy named Confidence Weighted Mean Reversion (CWMR). Inspired by the mean reversion principle in finance and confidence weighted online learning technique in machine learning, CWMR models the portfolio vector as a Gaussian distribution, and sequentially updates the distribution by following the mean reversion trading principle. CWMR’s closed-form updates clearly reflect the mean reversion trading idea. We also present several variants of CWMR algorithms, including a CWMR mixture algorithm that is theoretical universal. Empirically, CWMR strategy is able to effectively exploit the power of mean reversion for online portfolio selection. Extensive experiments on various real markets show that the proposed strategy is superior to the state-of-the-art techniques. The experimental testbed including source codes and data sets is available online.

...read moreread less

97 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Journal Article•

Data Mining Practical Machine Learning Tools and Techniques

[...]

อนิรุธ สืบสิงห์

01 Jan 2014-Journal of management science

9,185 citations

Journal Article•DOI•

A survey on concept drift adaptation

[...]

João Gama¹, Indrė Žliobaitė², Albert Bifet, Mykola Pechenizkiy³, Abdelhamid Bouchachia⁴ - Show less +1 more•Institutions (4)

University of Porto¹, Aalto University², Eindhoven University of Technology³, Bournemouth University⁴

01 Mar 2014-ACM Computing Surveys

TL;DR: The survey covers the different facets of concept drift in an integrated way to reflect on the existing scattered state of the art and aims at providing a comprehensive introduction to the concept drift adaptation for researchers, industry analysts, and practitioners.

...read moreread less

Abstract: Concept drift primarily refers to an online supervised learning scenario when the relation between the input data and the target variable changes over time. Assuming a general knowledge of supervised learning in this article, we characterize adaptive learning processes; categorize existing strategies for handling concept drift; overview the most representative, distinct, and popular techniques and algorithms; discuss evaluation methodology of adaptive algorithms; and present a set of illustrative applications. The survey covers the different facets of concept drift in an integrated way to reflect on the existing scattered state of the art. Thus, it aims at providing a comprehensive introduction to the concept drift adaptation for researchers, industry analysts, and practitioners.

...read moreread less

2,374 citations

Journal Article•

When is nearest neighbor meaningful

[...]

Kevin S. Beyer, Jonathan Goldstein, Raghu Ramakrishnan, Uri Shaft

01 Jan 1999-Lecture Notes in Computer Science

TL;DR: In this article, the authors explore the effect of dimensionality on the nearest neighbor problem and show that under a broad set of conditions (much broader than independent and identically distributed dimensions), as dimensionality increases, the distance to the nearest data point approaches the distance of the farthest data point.

...read moreread less

Abstract: We explore the effect of dimensionality on the nearest neighbor problem. We show that under a broad set of conditions (much broader than independent and identically distributed dimensions), as dimensionality increases, the distance to the nearest data point approaches the distance to the farthest data point. To provide a practical perspective, we present empirical results on both real and synthetic data sets that demonstrate that this effect can occur for as few as 10-15 dimensions. These results should not be interpreted to mean that high-dimensional indexing is never meaningful; we illustrate this point by identifying some high-dimensional workloads for which this effect does not occur. However, our results do emphasize that the methodology used almost universally in the database literature to evaluate high-dimensional indexing techniques is flawed, and should be modified. In particular, most such techniques proposed in the literature are not evaluated versus simple linear scan, and are evaluated over workloads for which nearest neighbor is not meaningful. Often, even the reported experiments, when analyzed carefully, show that linear scan would outperform the techniques being proposed on the workloads studied in high (10-15) dimensionality!.

...read moreread less

1,992 citations

Book•

Outlier Analysis

[...]

Charu C. Aggarwal

11 Jan 2013

TL;DR: Outlier Analysis is a comprehensive exposition, as understood by data mining experts, statisticians and computer scientists, and emphasis was placed on simplifying the content, so that students and practitioners can also benefit.

...read moreread less

Abstract: With the increasing advances in hardware technology for data collection, and advances in software technology (databases) for data organization, computer scientists have increasingly participated in the latest advancements of the outlier analysis field. Computer scientists, specifically, approach this field based on their practical experiences in managing large amounts of data, and with far fewer assumptions the data can be of any type, structured or unstructured, and may be extremely large. Outlier Analysisis a comprehensive exposition, as understood by data mining experts, statisticians and computer scientists. The book has been organized carefully, and emphasis was placed on simplifying the content, so that students and practitioners can also benefit. Chapters will typically cover one of three areas: methods and techniques commonly used in outlier analysis, such as linear methods, proximity-based methods, subspace methods, and supervised methods; data domains, such as, text, categorical, mixed-attribute, time-series, streaming, discrete sequence, spatial and network data; and key applications of these methods as applied to diverse domains such as credit card fraud detection, intrusion detection, medical diagnosis, earth science, web log analytics, and social network analysis are covered.

...read moreread less

1,278 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse