A Survey of Evolutionary Algorithms for Clustering
read more
Citations
A survey on nature inspired metaheuristic algorithms for partitional clustering
Efficient agglomerative hierarchical clustering
Relative clustering validity criteria: A comparative overview
Relative clustering validity criteria: A comparative overview: Relative Clustering Validity Criteria
Automatic clustering using nature-inspired metaheuristics
References
Genetic algorithms in search, optimization, and machine learning
Maximum likelihood from incomplete data via the EM algorithm
Adaptation in natural and artificial systems
Neural Networks: A Comprehensive Foundation
Some methods for classification and analysis of multivariate observations
Related Papers (5)
Frequently Asked Questions (11)
Q2. What are the future works in this paper?
The authors believe that this issue is an important research area for future work. To the best of their knowledge, only Lozano and Larrañaga [ 93 ] address this topic, which is a possible venue for future research. The book by Falkenauer [ 43 ] can be considered as a pioneering work in this research direction. Doing so, the author argues that the premises of the schema theorem can be satisfied, and valid approaches can be derived to tackle grouping problems by using genetic algorithms.
Q3. What is the advantage of density-based clustering methods?
Density-based clustering methods usually have the advantage of being flexible enough to discover clusters of arbitrary shape [38].
Q4. What is the way to avoid empty clusters?
Of course, empty clusters could be avoided by enforcing the objects closer to multiple identical medoids to be shared among the corresponding clusters.
Q5. What is the purpose of a density-based clustering algorithm?
In such criteria, a cluster is essentially a group of objects in the same dense region in the data space, and the goal of a density-based clustering algorithm is to find high-density regions (each region corresponding to a cluster) that are separated by low-density regions.
Q6. Why is the evolutionary algorithm used as a wrapper around a clustering algorithm?
This is because the evolutionary algorithm is used as a wrapper around a clustering algorithm, so that the fitness of an individual (i.e., a candidate set of selected attributes) is computed by running a clustering algorithm with the selected attributes and measuring the corresponding clustering validity criteria.
Q7. What is the mechanism used to control the rate of application of the individual mutation operators?
It consists of the use of a self-adjusting procedure that automatically controls the rates of application of the individual mutation operators based upon their relative success/failure averaged over past generations.
Q8. What is the way to represent a partition by means of an integer encoding scheme?
Another way of representing a partition by means of an integer encoding scheme involves using an array of k elements to provide a medoid-based representation of the data set.
Q9. What is the choice among the evaluated variants?
Based upon an experimental evaluation, the authors argue that the pairwise nearest neighbor operator is the best choice among the assessed variants.
Q10. How can one evaluate the clustering solutions?
In order to do so, repeated runs of the evolutionary algorithm might be performed for different values of k, and the obtained clustering solutions could be comparatively assessed by some measure that reflects the partition quality6.
Q11. What is the fitness function used to assess partitions containing a given number of clusters?
3) Fitness Function: Many clustering validity criteria can be used for assessing partitions containing a given number (k) of clusters (e.g., see [72], [40], [75]).