scispace - formally typeset
Open AccessJournal ArticleDOI

Swarm intelligence for self-organized clustering

Michael C. Thrun, +1 more
- Vol. 290, pp 103237
Reads0
Chats0
TLDR
The Databionic swarm, called DBS, is introduced which is able to adapt itself to structures of high-dimensional data characterized by distance and/or density-based structures in the data space and can outperform common clustering methods such as K-means, PAM, single linkage, spectral clustering, model-based clustering and Ward.
Abstract
Algorithms implementing populations of agents which interact with one another and sense their environment may exhibit emergent behavior such as self-organization and swarm intelligence. Here a swarm system, called Databionic swarm (DBS), is introduced which is able to adapt itself to structures of high-dimensional data characterized by distance and/or density-based structures in the data space. By exploiting the interrelations of swarm intelligence, self-organization and emergence, DBS serves as an alternative approach to the optimization of a global objective function in the task of clustering. The swarm omits the usage of a global objective function and is parameter-free because it searches for the Nash equilibrium during its annealing process. To our knowledge, DBS is the first swarm combining these approaches. Its clustering can outperform common clustering methods such as K-means, PAM, single linkage, spectral clustering, model-based clustering, and Ward, if no prior knowledge about the data is available. A central problem in clustering is the correct estimation of the number of clusters. This is addressed by a DBS visualization called topographic map which allows assessing the number of clusters. It is known that all clustering algorithms construct clusters, irrespective of the data set contains clusters or not. In contrast to most other clustering algorithms, the topographic map identifies, that clustering of the data is meaningless if the data contains no (natural) clusters. The performance of DBS is demonstrated on a set of benchmark data, which are constructed to pose difficult clustering problems and in two real-world applications.

read more

Citations
More filters

Swarm Intelligence From Natural To Artificial Systems

TL;DR: Swarm intelligence from natural to artificial systems, where people have search hundreds of times for their chosen books, but end up in malicious downloads instead of reading a good book with a cup of coffee in the afternoon.
Journal ArticleDOI

Analyzing the fine structure of distributions.

TL;DR: A new visualization tool called the mirrored density plot (MD plot), which is specifically designed to discover interesting structures in continuous features, is proposed, and it is shown that when statistical testing poses a great difficulty, only the MD plots can identify the structure of their PDFs.
Journal ArticleDOI

Using Projection-Based Clustering to Find Distance- and Density-Based Clusters in High-Dimensional Data

TL;DR: A comparison showed that PBC is always able to find the correct cluster structure, while the performance of the best of the 32 clustering algorithms varies depending on the dataset.
References
More filters
Journal Article

R: A language and environment for statistical computing.

R Core Team
- 01 Jan 2014 - 
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.

Some methods for classification and analysis of multivariate observations

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.
Journal ArticleDOI

A note on two problems in connexion with graphs

TL;DR: A tree is a graph with one and only one path between every two nodes, where at least one path exists between any two nodes and the length of each branch is given.
Journal ArticleDOI

Particle swarm optimization

TL;DR: A snapshot of particle swarming from the authors’ perspective, including variations in the algorithm, current and ongoing research, applications and open problems, is included.
Related Papers (5)