Home
/
Authors
/
Ioannis Katakis

Author

Ioannis Katakis

National and Kapodistrian University of Athens

Other affiliations: Aristotle University of Thessaloniki, University of Cyprus, University of Nicosia ...read more

Bio: Ioannis Katakis is an academic researcher from National and Kapodistrian University of Athens. The author has contributed to research in topics: Sentiment analysis & Voting. The author has an hindex of 17, co-authored 49 publications receiving 5465 citations. Previous affiliations of Ioannis Katakis include Aristotle University of Thessaloniki & University of Cyprus.

Topics: Sentiment analysis, Voting, Cluster analysis, Social media, Data management ...read more

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004

Papers

PDF

Open Access

More filters

Journal Article•DOI•

An adaptive personalized news dissemination system

[...]

Ioannis Katakis¹, Grigorios Tsoumakas¹, Evangelos Banos¹, Nick Bassiliades¹, Ioannis Vlahavas¹ - Show less +1 more•Institutions (1)

Aristotle University of Thessaloniki¹

01 Apr 2009

TL;DR: This work implements a web-based news reader enhanced with a specifically designed machine learning framework for dynamic content personalization, and discusses the effectiveness of machine learning methods for the classification of real-world text streams.

...read moreread less

Abstract: With the explosive growth of the Word Wide Web, information overload became a crucial concern. In a data-rich information-poor environment like the Web, the discrimination of useful or desirable information out of tons of mostly worthless data became a tedious task. The role of Machine Learning in tackling this problem is thoroughly discussed in the literature, but few systems are available for public use. In this work, we bridge theory to practice, by implementing a web-based news reader enhanced with a specifically designed machine learning framework for dynamic content personalization. This way, we get the chance to examine applicability and implementation issues and discuss the effectiveness of machine learning methods for the classification of real-world text streams. The main features of our system named PersoNews are: (a) the aggregation of many different news sources that offer an RSS version of their content, (b) incremental filtering, offering dynamic personalization of the content not only per user but also per each feed a user is subscribed to, and (c) the ability for every user to watch a more abstracted topic of interest by filtering through a taxonomy of topics. PersoNews is freely available for public use on the WWW ( http://news.csd.auth.gr ).

...read moreread less

85 citations

Book Chapter•DOI•

Detecting Events in Online Social Networks: Definitions, Trends and Challenges

[...]

Nikolaos Panagiotou¹, Ioannis Katakis¹, Dimitrios Gunopulos¹•Institutions (1)

National and Kapodistrian University of Athens¹

01 Jan 2016

TL;DR: A wide range of event detection algorithms, architectures and evaluation methodologies are presented and a compact representation of the recent developments in the field is provided to aid the reader in understanding the main challenges tackled so far as well as identifying interesting future research directions.

...read moreread less

Abstract: Event detection is a research area that attracted attention during the last years due to the widespread availability of social media data. The problem of event detection has been examined in multiple social media sources like Twitter, Flickr, YouTube and Facebook. The task comprises many challenges including the processing of large volumes of data and high levels of noise. In this article, we present a wide range of event detection algorithms, architectures and evaluation methodologies. In addition, we extensively discuss on available datasets, potential applications and open research issues. The main objective is to provide a compact representation of the recent developments in the field and aid the reader in understanding the main challenges tackled so far as well as identifying interesting future research directions.

...read moreread less

80 citations

Book Chapter•DOI•

Effective voting of heterogeneous classifiers

[...]

Grigorios Tsoumakas¹, Ioannis Katakis¹, Ioannis Vlahavas¹•Institutions (1)

Aristotle University of Thessaloniki¹

20 Sep 2004

TL;DR: In this paper, the authors focus on the Classifier Evaluation and Selection (ES) method, that evaluates each of the models (typically using 10-fold cross-validation) and selects the best one.

...read moreread less

Abstract: This paper deals with the combination of classification models that have been derived from running different (heterogeneous) learning algorithms on the same data set. We focus on the Classifier Evaluation and Selection (ES) method, that evaluates each of the models (typically using 10-fold cross-validation) and selects the best one. We examine the performance of this method in comparison with the Oracle selecting the best classifier for the test set and show that 10-fold cross-validation has problems in detecting the best classifier. We then extend ES by applying a statistical test to the 10-fold accuracies of the models and combining through voting the most significant ones. Experimental results show that the proposed method, Effective Voting, performs comparably with the state-of-the-art method of Stacking with Multi-Response Model Trees without the additional computational cost of meta-training.

...read moreread less

65 citations

Book Chapter•DOI•

On the utility of incremental feature selection for the classification of textual data streams

[...]

Ioannis Katakis¹, Grigorios Tsoumakas¹, Ioannis Vlahavas¹•Institutions (1)

Aristotle University of Thessaloniki¹

11 Nov 2005

TL;DR: This paper proposes the coupling of an incremental feature ranking method and an incremental learning algorithm that can consider different subsets of the feature vector during prediction (what they call a feature based classifier), in order to deal with the above problem.

...read moreread less

Abstract: In this paper we argue that incrementally updating the features that a text classification algorithm considers is very important for real-world textual data streams, because in most applications the distribution of data and the description of the classification concept changes over time. We propose the coupling of an incremental feature ranking method and an incremental learning algorithm that can consider different subsets of the feature vector during prediction (what we call a feature based classifier), in order to deal with the above problem. Experimental results with a longitudinal database of real spam and legitimate emails shows that our approach can adapt to the changing nature of streaming data and works much better than classical incremental learning algorithms.

...read moreread less

63 citations

Proceedings Article•DOI•

An Ensemble of Classifiers for coping with Recurring Contexts in Data Streams

[...]

Ioannis Katakis¹, Grigorios Tsoumakas¹, Ioannis Vlahavas¹•Institutions (1)

Aristotle University of Thessaloniki¹

27 Jun 2008

TL;DR: A transformation function that maps batches of examples into a new conceptual feature space is proposed to achieve this, and the clustering algorithm is applied in order to group different concepts and identify recurring contexts.

...read moreread less

Abstract: This paper proposes a general framework for classifying data streams by exploiting incremental clustering in order to dynamically build and update an ensemble of incremental classifiers. To achieve this, a transformation function that maps batches of examples into a new conceptual feature space is proposed. The clustering algorithm is then applied in order to group different concepts and identify recurring contexts. The ensemble is produced by maintaining an classifier for every concept discovered in the stream The full version of this paper as well as the datasets used for evaluation can be found at: http://mlkd.csd.auth.gr/concept_drift.html

...read moreread less

60 citations

1
2
3
4
5
…
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

[신간의 별자리x] 우리/미술, 그리고 ‘슬픔의 박물관’

[...]

이화영

01 Jan 2015

12,972 citations

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Journal Article•

Data Mining Practical Machine Learning Tools and Techniques

[...]

อนิรุธ สืบสิงห์

01 Jan 2014-Journal of management science

9,185 citations

Proceedings Article•DOI•

node2vec: Scalable Feature Learning for Networks

[...]

Aditya Grover¹, Jure Leskovec¹•Institutions (1)

Stanford University¹

13 Aug 2016

TL;DR: Node2vec as mentioned in this paper learns a mapping of nodes to a low-dimensional space of features that maximizes the likelihood of preserving network neighborhoods of nodes by using a biased random walk procedure.

...read moreread less

Abstract: Prediction tasks over nodes and edges in networks require careful effort in engineering features used by learning algorithms. Recent research in the broader field of representation learning has led to significant progress in automating prediction by learning the features themselves. However, present feature learning approaches are not expressive enough to capture the diversity of connectivity patterns observed in networks. Here we propose node2vec, an algorithmic framework for learning continuous feature representations for nodes in networks. In node2vec, we learn a mapping of nodes to a low-dimensional space of features that maximizes the likelihood of preserving network neighborhoods of nodes. We define a flexible notion of a node's network neighborhood and design a biased random walk procedure, which efficiently explores diverse neighborhoods. Our algorithm generalizes prior work which is based on rigid notions of network neighborhoods, and we argue that the added flexibility in exploring neighborhoods is the key to learning richer representations. We demonstrate the efficacy of node2vec over existing state-of-the-art techniques on multi-label classification and link prediction in several real-world networks from diverse domains. Taken together, our work represents a new way for efficiently learning state-of-the-art task-independent representations in complex networks.

...read moreread less

7,072 citations

Journal Article•DOI•

A Review On Multi-Label Learning Algorithms

[...]

Min-Ling Zhang¹, Zhi-Hua Zhou²•Institutions (2)

Southeast University¹, Nanjing University²

01 Aug 2014-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This paper aims to provide a timely review on this area with emphasis on state-of-the-art multi-label learning algorithms with relevant analyses and discussions.

...read moreread less

Abstract: Multi-label learning studies the problem where each example is represented by a single instance while associated with a set of labels simultaneously. During the past decade, significant amount of progresses have been made toward this emerging machine learning paradigm. This paper aims to provide a timely review on this area with emphasis on state-of-the-art multi-label learning algorithms. Firstly, fundamentals on multi-label learning including formal definition and evaluation metrics are given. Secondly and primarily, eight representative multi-label learning algorithms are scrutinized under common notations with relevant analyses and discussions. Thirdly, several related learning settings are briefly summarized. As a conclusion, online resources and open research problems on multi-label learning are outlined for reference purposes.

...read moreread less

2,495 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse