Home
/
Authors
/
Amineh Amini

Author

Amineh Amini

Other affiliations: Islamic Azad University, University of Malaya

Bio: Amineh Amini is an academic researcher from Information Technology University. The author has contributed to research in topics: Cluster analysis & Data stream clustering. The author has an hindex of 12, co-authored 23 publications receiving 640 citations. Previous affiliations of Amineh Amini include Islamic Azad University & University of Malaya.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

On Density-Based Data Streams Clustering Algorithms: A Survey

[...]

Amineh Amini¹, Teh Ying Wah¹, Hadi Saboohi¹•Institutions (1)

Information Technology University¹

07 Jan 2014-Journal of Computer Science and Technology

TL;DR: This paper summarizes the main density-based clustering algorithms on data streams, discusses their uniqueness and limitations, but also explains how they address the challenges in clustering data streams and investigates the evaluation metrics used in validating cluster quality and measuring algorithms’ performance.

...read moreread less

Abstract: Clustering data streams has drawn lots of attention in the last few years due to their ever-growing presence. Data streams put additional challenges on clustering such as limited time and memory and one pass clustering. Furthermore, discovering clusters with arbitrary shapes is very important in data stream applications. Data streams are infinite and evolving over time, and we do not have any knowledge about the number of clusters. In a data stream environment due to various factors, some noise appears occasionally. Density-based method is a remarkable class in clustering data streams, which has the ability to discover arbitrary shape clusters and to detect noise. Furthermore, it does not need the number of clusters in advance. Due to data stream characteristics, the traditional density-based clustering is not applicable. Recently, a lot of density-based clustering algorithms are extended for data streams. The main idea in these algorithms is using density-based methods in the clustering process and at the same time overcoming the constraints, which are put out by data stream’s nature. The purpose of this paper is to shed light on some algorithms in the literature on density-based clustering over data streams. We not only summarize the main density-based clustering algorithms on data streams, discuss their uniqueness and limitations, but also explain how they address the challenges in clustering data streams. Moreover, we investigate the evaluation metrics used in validating cluster quality and measuring algorithms’ performance. It is hoped that this survey will serve as a steppingstone for researchers studying data streams clustering, particularly density-based algorithms.

...read moreread less

183 citations

Journal Article•DOI•

Support vector regression methodology for wind turbine reaction torque prediction with power-split hydrostatic continuous variable transmission

[...]

Shahaboddin Shamshirband¹, Dalibor Petković², Amineh Amini³, Nor Badrul Anuar³, Vlastimir Nikolić², Žarko Ćojbašić², Miss Laiha Mat Kiah³, Abdullah Gani³ - Show less +4 more•Institutions (3)

Islamic Azad University¹, University of Niš², Information Technology University³

01 Apr 2014-Energy

TL;DR: In this paper, the polynomial and radial basis function (RBF) are applied as the kernel function of Support Vector Regression (SVR) for prediction of wind turbine reaction torque.

...read moreread less

113 citations

Journal Article•DOI•

D-FICCA: A density-based fuzzy imperialist competitive clustering algorithm for intrusion detection in wireless sensor networks

[...]

Shahaboddin Shamshirband¹, Shahaboddin Shamshirband², Amineh Amini², Nor Badrul Anuar², Miss Laiha Mat Kiah², Ying Wah Teh², Steven Furnell³ - Show less +3 more•Institutions (3)

Islamic Azad University¹, Information Technology University², University of Plymouth³

01 Sep 2014-Measurement

TL;DR: The imperialist competitive algorithm (ICA) is modified with a density-based algorithm and fuzzy logic for optimum clustering in WSNs and achieves higher detection accuracy 87% and clustering quality 0.99 compared to existing approaches.

...read moreread less

106 citations

Proceedings Article•DOI•

A study of density-grid based clustering algorithms on data streams

[...]

Amineh Amini¹, Teh Ying Wah¹, Mahmoud Reza Saybani¹, Saeed Reza Aghabozorgi Sahaf Yazdi¹•Institutions (1)

Information Technology University¹

26 Jul 2011

TL;DR: This paper reviews the grid based clustering algorithms that use density-based algorithms or density concept for the clustering and discusses about how well the algorithms address the challenging issues in the clustered data streams.

...read moreread less

Abstract: Clustering data streams attracted many researchers since the applications that generate data streams have become more popular. Several clustering algorithms have been introduced for data streams based on distance which are incompetent to find clusters of arbitrary shapes and cannot handle the outliers. Density-based clustering algorithms are remarkable not only to find arbitrarily shaped clusters but also to deal with noise in data. In density-based clustering algorithms, dense areas of objects in the data space are considered as clusters which are segregated by low-density area. Another group of the clustering methods for data streams is grid-based clustering where the data space is quantized into finite number of cells which form the grid structure and perform clustering on the grids. Grid-based clustering maps the infinite number of data records in data streams to finite numbers of grids. In this paper we review the grid based clustering algorithms that use density-based algorithms or density concept for the clustering. We called them density-grid clustering algorithms. We explore the algorithms in details and the merits and limitations of them. The algorithms are also summarized in a table based on the important features. Besides that, we discuss about how well the algorithms address the challenging issues in the clustering data streams.

...read moreread less

79 citations

Journal Article•DOI•

MuDi-Stream

[...]

Amineh Amini¹, Hadi Saboohi¹, Tutut Herawan¹, Teh Ying Wah¹•Institutions (1)

Information Technology University¹

01 Jan 2016-Journal of Network and Computer Applications

TL;DR: The proposed MuDi-Stream algorithm improves clustering quality in multi-density environments and is evaluated on various synthetic and real-world datasets using different quality metrics and further, scalability results are compared.

...read moreread less

64 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Journal Article•DOI•

Time-series clustering - A decade review

[...]

Saeed Aghabozorgi¹, Ali Seyed Shirkhorshidi¹, Teh Ying Wah¹•Institutions (1)

Information Technology University¹

01 Oct 2015-Information Systems

TL;DR: This review will expose four main components of time-series clustering and is aimed to represent an updated investigation on the trend of improvements in efficiency, quality and complexity of clustering time- series approaches during the last decade and enlighten new paths for future works.

...read moreread less

1,235 citations

Journal Article•DOI•

Data stream clustering: A survey

[...]

Jonathan de Andrade Silva¹, Elaine R. Faria¹, Rodrigo C. Barros¹, Eduardo R. Hruschka¹, André C. P. L. F. de Carvalho¹, João Gama² - Show less +2 more•Institutions (2)

University of São Paulo¹, University of Porto²

11 Jul 2013-ACM Computing Surveys

TL;DR: A survey of data stream clustering algorithms is presented, providing a thorough discussion of the main design components of state-of-the-art algorithms and an overview of the usually employed experimental methodologies.

...read moreread less

Abstract: Data stream mining is an active research area that has recently emerged to discover knowledge from large amounts of continuously generated data. In this context, several data stream clustering algorithms have been proposed to perform unsupervised learning. Nevertheless, data stream clustering imposes several challenges to be addressed, such as dealing with nonstationary, unbounded data that arrive in an online fashion. The intrinsic nature of stream data requires the development of algorithms capable of performing fast and incremental processing of data objects, suitably addressing time and memory limitations. In this article, we present a survey of data stream clustering algorithms, providing a thorough discussion of the main design components of state-of-the-art algorithms. In addition, this work addresses the temporal aspects involved in data stream clustering, and presents an overview of the usually employed experimental methodologies. A number of references are provided that describe applications of data stream clustering in different domains, such as network intrusion detection, sensor networks, and stock market analysis. Information regarding software packages and data repositories are also available for helping researchers and practitioners. Finally, some important issues and open questions that can be subject of future research are discussed.

...read moreread less

479 citations

Journal Article•DOI•

Adaptive random forests for evolving data stream classification

[...]

Heitor Murilo Gomes¹, Albert Bifet², Jesse Read², Jean Paul Barddal¹, Fabrício Enembreck¹, Bernhard Pfharinger³, Geoff Holmes³, Talel Abdessalem⁴ - Show less +4 more•Institutions (4)

Pontifícia Universidade Católica do Paraná¹, Université Paris-Saclay², University of Waikato³, National University of Singapore⁴

01 Oct 2017-Machine Learning

TL;DR: This work presents the adaptive random forest (ARF) algorithm, which includes an effective resampling method and adaptive operators that can cope with different types of concept drifts without complex optimizations for different data sets.

...read moreread less

Abstract: Random forests is currently one of the most used machine learning algorithms in the non-streaming (batch) setting. This preference is attributable to its high learning performance and low demands with respect to input preparation and hyper-parameter tuning. However, in the challenging context of evolving data streams, there is no random forests algorithm that can be considered state-of-the-art in comparison to bagging and boosting based algorithms. In this work, we present the adaptive random forest (ARF) algorithm for classification of evolving data streams. In contrast to previous attempts of replicating random forests for data stream learning, ARF includes an effective resampling method and adaptive operators that can cope with different types of concept drifts without complex optimizations for different data sets. We present experiments with a parallel implementation of ARF which has no degradation in terms of classification performance in comparison to a serial implementation, since trees and adaptive operators are independent from one another. Finally, we compare ARF with state-of-the-art algorithms in a traditional test-then-train evaluation and a novel delayed labelling evaluation, and show that ARF is accurate and uses a feasible amount of resources.

...read moreread less

442 citations

Journal Article•

Ranking outliers using symmetric neighborhood relationship

[...]

Wen Jin, Anthony K. H. Tung, Jiawei Han, Wei Wang

01 Jan 2006-Lecture Notes in Computer Science

TL;DR: In this article, the authors proposed a measure on local outliers based on a symmetric neighborhood relationship, which considers both neighbors and reverse neighbors of an object when estimating its density distribution.

...read moreread less

Abstract: Mining outliers in database is to find exceptional objects that deviate from the rest of the data set. Besides classical outlier analysis algorithms, recent studies have focused on mining local outliers, i.e., the outliers that have density distribution significantly different from their neighborhood. The estimation of density distribution at the location of an object has so far been based on the density distribution of its k-nearest neighbors [2,11]. However, when outliers are in the location where the density distributions in the neighborhood are significantly different, for example, in the case of objects from a sparse cluster close to a denser cluster, this may result in wrong estimation. To avoid this problem, here we propose a simple but effective measure on local outliers based on a symmetric neighborhood relationship. The proposed measure considers both neighbors and reverse neighbors of an object when estimating its density distribution. As a result, outliers so discovered are more meaningful. To compute such local outliers efficiently, several mining algorithms are developed that detects top-n outliers based on our definition. A comprehensive performance evaluation and analysis shows that our methods are not only efficient in the computation but also more effective in ranking outliers.

...read moreread less

321 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132

Collapse