A Comparison Study of Validity Indices on Swarm-Intelligence-Based Clustering

doi:10.1109/TSMCB.2012.2188509

Journal ArticleDOI

A Comparison Study of Validity Indices on Swarm-Intelligence-Based Clustering

Rui Xu, +2 more

- Vol. 42, Iss: 4, pp 1243-1256

Chats0

TLDR

This work compares the performances of eight well-known and widely used clustering validity indices and finds that the silhouette statistic index stands out in most of the data sets that are examined.

Abstract:

Swarm intelligence has emerged as a worthwhile class of clustering methods due to its convenient implementation, parallel capability, ability to avoid local minima, and other advantages. In such applications, clustering validity indices usually operate as fitness functions to evaluate the qualities of the obtained clusters. However, as the validity indices are usually data dependent and are designed to address certain types of data, the selection of different indices as the fitness functions may critically affect cluster quality. Here, we compare the performances of eight well-known and widely used clustering validity indices, namely, the Calinski-Harabasz index, the CS index, the Davies-Bouldin index, the Dunn index with two of its generalized versions, the I index, and the silhouette statistic index, on both synthetic and real data sets in the framework of differential-evolution-particle-swarm-optimization (DEPSO)-based clustering. DEPSO is a hybrid evolutionary algorithm of the stochastic optimization approach (differential evolution) and the swarm intelligence method (particle swarm optimization) that further increases the search capability and achieves higher flexibility in exploring the problem space. According to the experimental results, we find that the silhouette statistic index stands out in most of the data sets that we examined. Meanwhile, we suggest that users reach their conclusions not just based on only one index, but after considering the results of several indices to achieve reliable clustering structures.

A Comparison Study of Validity Indices on Swarm-Intelligence-Based Clustering

Citations

A Comprehensive Survey of Clustering Algorithms

A survey on nature inspired metaheuristic algorithms for partitional clustering

Simulation and Hardware Implementation of New Maximum Power Point Tracking Technique for Partially Shaded PV System Using Hybrid DEPSO Method

SIMULATED ANNEALING AND BOLTZMANN MACHINES A Stochastic Approach to Combinatorial Optimization and Neural Computing

Dynamic clustering with improved binary artificial bee colony algorithm

References

Some methods for classification and analysis of multivariate observations

Differential Evolution – A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces

Algorithms for clustering data

Finding Groups in Data: An Introduction to Chster Analysis

A Cluster Separation Measure

Related Papers (5)

A Cluster Separation Measure

A dendrite method for cluster analysis

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

Particle swarm optimization

A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters