Home
/
Authors
/
Alp Kut

Author

Alp Kut

Bio: Alp Kut is an academic researcher from Dokuz Eylül University. The author has contributed to research in topics: Data warehouse & Cluster analysis. The author has an hindex of 7, co-authored 31 publications receiving 1085 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

ST-DBSCAN: An algorithm for clustering spatial-temporal data

[...]

Derya Birant¹, Alp Kut¹•Institutions (1)

Dokuz Eylül University¹

01 Jan 2007

TL;DR: A new density-based clustering algorithm based on DBSCAN, which has the ability of discovering clusters according to non-spatial, spatial and temporal values of the objects, is presented and an implementation of the algorithm is shown by using this data warehouse and the data mining results are presented.

...read moreread less

Abstract: This paper presents a new density-based clustering algorithm, ST-DBSCAN, which is based on DBSCAN. We propose three marginal extensions to DBSCAN related with the identification of (i) core objects, (ii) noise objects, and (iii) adjacent clusters. In contrast to the existing density-based clustering algorithms, our algorithm has the ability of discovering clusters according to non-spatial, spatial and temporal values of the objects. In this paper, we also present a spatial-temporal data warehouse system designed for storing and clustering a wide range of spatial-temporal data. We show an implementation of our algorithm by using this data warehouse and present the data mining results.

...read moreread less

1,081 citations

Journal Article•DOI•

Spatio-temporal outlier detection in large databases

[...]

Derya Birant¹, Alp Kut¹•Institutions (1)

Dokuz Eylül University¹

09 Oct 2006

TL;DR: A new outlier detection algorithm is introduced to find small groups of data objects that are exceptional when compared with rest large amount of data to detect spatio-temporal outliers in large databases.

...read moreread less

Abstract: Outlier detection is one of the major data mining methods. This paper proposes a three-step approach to detect spatio-temporal outliers in large databases. These steps are clustering, checking spatial neighbors, and checking temporal neighbors. In this paper, we introduce a new outlier detection algorithm to find small groups of data objects that are exceptional when compared with rest large amount of data. In contrast to the existing outlier detection algorithms, new algorithm has the ability of discovering outliers according to the non-spatial, spatial and temporal values of the objects. In order to demonstrate the new algorithm, this paper also presents an example application using a data warehouse

...read moreread less

108 citations

Journal Article•DOI•

An incremental genetic algorithm for classification and sensitivity analysis of its parameters

[...]

Gözde Bakırlı¹, Derya Birant¹, Alp Kut¹•Institutions (1)

Dokuz Eylül University¹

01 Mar 2011-Expert Systems With Applications

TL;DR: Experimental results show that the incremental genetic algorithm considerably decreases the time needed for training to construct a new classifier with the new dataset, which is highly desirable to perform these updates incrementally.

...read moreread less

Abstract: Traditionally, data mining tasks such as classification and clustering are performed on data warehouses. Usually, updates are collected and applied to the data warehouse frequent time periods. For this reason, all patterns derived from the data warehouse have to be updated frequently as well. Due to the very large volumes of data, it is highly desirable to perform these updates incrementally. This study proposes a new incremental genetic algorithm for classification for efficiently handling new transactions. It presents the comparison results of traditional genetic algorithm and incremental genetic algorithm for classification. Experimental results show that our incremental genetic algorithm considerably decreases the time needed for training to construct a new classifier with the new dataset. This study also includes the sensitivity analysis of the incremental genetic algorithm parameters such as crossover probability, mutation probability, elitism and population size. In this analysis, many specific models were created using the same training dataset but with different parameter values, and then the performances of the models were compared.

...read moreread less

25 citations

Book Chapter•DOI•

SOM++: integration of self-organizing map and k-means++ algorithms

[...]

Yunus Dogan¹, Derya Birant¹, Alp Kut¹•Institutions (1)

Dokuz Eylül University¹

19 Jul 2013

TL;DR: A new clustering algorithm SOM++ is introduced, which first uses K-Means++ method to determine the initial weight values and the starting points, and then uses Self-Organizing Map (SOM) to find the final clustering solution.

...read moreread less

Abstract: Data clustering is an important and widely used task of data mining that groups similar items together into subsets. This paper introduces a new clustering algorithm SOM++, which first uses K-Means++ method to determine the initial weight values and the starting points, and then uses Self-Organizing Map (SOM) to find the final clustering solution. The purpose of this algorithm is to provide a useful technique to improve the solution of the data clustering and data mining in terms of runtime, the rate of unstable data points and internal error. This paper also presents the comparison of our algorithm with simple SOM and K-Means + SOM by using a real world data. The results show that SOM++ has a good performance in stability and significantly outperforms three other methods training time.

...read moreread less

20 citations

Proceedings Article•DOI•

Comparative analysis of ensemble learning methods for signal classification

[...]

Pelin Yildirim¹, Kokten Ulas Birant², Vladimir Radevski³, Alp Kut², Derya Birant² - Show less +1 more•Institutions (3)

Celal Bayar University¹, Dokuz Eylül University², South East European University³

02 May 2018

TL;DR: The application of four fundamental ensemble learning methods with five different classification algorithms with the most optimal parameter values on signal datasets with the best classification performance was obtained with the Random Forest algorithm which is a Bagging based method.

...read moreread less

Abstract: In recent years, the machine learning algorithms commenced to be used widely in signal classification area as well as many other areas. Ensemble learning has become one of the most popular Machine Learning approaches due to the high classification performance it provides. In this study, the application of four fundamental ensemble learning methods (Bagging, Boosting, Stacking, and Voting) with five different classification algorithms (Neural Network, Support Vector Machines, k-Nearest Neighbor, Naive Bayes, and C4.5) with the most optimal parameter values on signal datasets is presented. In the experimental studies, ensemble learning methods were applied on 14 different signal datasets and the results were compared in terms of classification accuracy rates. According to the results, the best classification performance was obtained with the Random Forest algorithm which is a Bagging based method.

...read moreread less

12 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Book•

Outlier Analysis

[...]

Charu C. Aggarwal

11 Jan 2013

TL;DR: Outlier Analysis is a comprehensive exposition, as understood by data mining experts, statisticians and computer scientists, and emphasis was placed on simplifying the content, so that students and practitioners can also benefit.

...read moreread less

Abstract: With the increasing advances in hardware technology for data collection, and advances in software technology (databases) for data organization, computer scientists have increasingly participated in the latest advancements of the outlier analysis field. Computer scientists, specifically, approach this field based on their practical experiences in managing large amounts of data, and with far fewer assumptions the data can be of any type, structured or unstructured, and may be extremely large. Outlier Analysisis a comprehensive exposition, as understood by data mining experts, statisticians and computer scientists. The book has been organized carefully, and emphasis was placed on simplifying the content, so that students and practitioners can also benefit. Chapters will typically cover one of three areas: methods and techniques commonly used in outlier analysis, such as linear methods, proximity-based methods, subspace methods, and supervised methods; data domains, such as, text, categorical, mixed-attribute, time-series, streaming, discrete sequence, spatial and network data; and key applications of these methods as applied to diverse domains such as credit card fraud detection, intrusion detection, medical diagnosis, earth science, web log analytics, and social network analysis are covered.

...read moreread less

1,278 citations

Journal Article•DOI•

A Comprehensive Survey of Clustering Algorithms

[...]

Dongkuan Xu¹, Yingjie Tian¹•Institutions (1)

Chinese Academy of Sciences¹

12 Aug 2015-Annals of Data Science

TL;DR: This review paper begins at the definition of clustering, takes the basic elements involved in the clustering process, such as the distance or similarity measurement and evaluation indicators, into consideration, and analyzes the clustered algorithms from two perspectives, the traditional ones and the modern ones.

...read moreread less

Abstract: Data analysis is used as a common method in modern science research, which is across communication science, computer science and biology science. Clustering, as the basic composition of data analysis, plays a significant role. On one hand, many tools for cluster analysis have been created, along with the information increase and subject intersection. On the other hand, each clustering algorithm has its own strengths and weaknesses, due to the complexity of information. In this review paper, we begin at the definition of clustering, take the basic elements involved in the clustering process, such as the distance or similarity measurement and evaluation indicators, into consideration, and analyze the clustering algorithms from two perspectives, the traditional ones and the modern ones. All the discussed clustering algorithms will be compared in detail and comprehensively shown in Appendix Table 22.

...read moreread less

1,234 citations

C4.5: Programs for Machine Learning (書評)

[...]

重郎金田

01 May 1995

1,164 citations

Journal Article•DOI•

Machine Learning for Medical Imaging.

[...]

Bradley J. Erickson¹, Panagiotis Korfiatis¹, Zeynettin Akkus¹, Timothy L. Kline¹•Institutions (1)

Mayo Clinic¹

17 Feb 2017-Radiographics

TL;DR: Deep learning has started to be used; this method has the benefit that it does not require image feature identification and calculation as a first step; rather, features are identified as part of the learning process.

...read moreread less

Abstract: Machine learning is a technique for recognizing patterns that can be applied to medical images. Although it is a powerful tool that can help in rendering medical diagnoses, it can be misapplied. Machine learning typically begins with the machine learning algorithm system computing the image features that are believed to be of importance in making the prediction or diagnosis of interest. The machine learning algorithm system then identifies the best combination of these image features for classifying the image or computing some metric for the given image region. There are several methods that can be used, each with different strengths and weaknesses. There are open-source versions of most of these machine learning methods that make them easy to try and apply to images. Several metrics for measuring the performance of an algorithm exist; however, one must be aware of the possible associated pitfalls that can result in misleading metrics. More recently, deep learning has started to be used; this method has the benefit that it does not require image feature identification and calculation as a first step; rather, features are identified as part of the learning process. Machine learning has been used in medical imaging and will have a greater influence in the future. Those working in medical imaging must be aware of how machine learning works. ©RSNA, 2017.

...read moreread less

870 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse