Journal ArticleDOI
Validity index for crisp and fuzzy clusters
TLDR
A cluster validity index and its fuzzification is described, which can provide a measure of goodness of clustering on different partitions of a data set, and results demonstrating the superiority of the PBM-index in appropriately determining the number of clusters are provided.About:
This article is published in Pattern Recognition.The article was published on 2004-03-01. It has received 710 citations till now. The article focuses on the topics: Fuzzy clustering & Correlation clustering.read more
Citations
More filters
Proceedings ArticleDOI
A Cluster Validity Index for Fuzzy Clustering Based on Non-distance
Jiashun Chen,Dechang Pi +1 more
TL;DR: A new non-distance cluster index is proposed that not only recognizes overlapping clusters but also is insensitive to noisy data, and has more efficiency.
Proceedings ArticleDOI
Data-driven feature word selection for clustering online news comments
Heeryon Cho,Jong-Seok Lee +1 more
TL;DR: This paper presents a data-driven feature word selection method which realizes structurally superior clustering of online comments, and found that online comments clustered using distinct nouns producedStructurally superior clusters when compared to the other types of nouns, local and global.
Journal ArticleDOI
A Genetic K-means Clustering Algorithm Based on the Optimized Initial Centers
Min Feng,Zhenyan Wang +1 more
TL;DR: To obtain effective cluster and accurate cluster, the optimized K-means algorithm and genetic algorithm are combined into a hybrid algorithm (PGKM), which can not only improve compactness and separation of the algorithm but also automatically search for the best cluster number k, then cluster after optimizing the k-centers.
Posted Content
The Area Under the ROC Curve as a Measure of Clustering Quality.
TL;DR: This work elaborate on the use of AUC as an internal/relative measure of clustering quality, which is referred to as Area Under the Curve for Clustering (AUCC), and demonstrates that the AUCC of a given candidate clustering solution has an expected value under a null model of random clustering solutions, regardless of the size of the dataset.
References
More filters
Book
Genetic algorithms in search, optimization, and machine learning
TL;DR: In this article, the authors present the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields, including computer programming and mathematics.
Genetic algorithms in search, optimization and machine learning
TL;DR: This book brings together the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields.
Book
Applied Multivariate Statistical Analysis
R. A. Johnson,Dean W. Wichern +1 more
TL;DR: In this article, the authors present an overview of the basic concepts of multivariate analysis, including matrix algebra and random vectors, as well as a strategy for analyzing multivariate models.
Journal ArticleDOI
Applied Multivariate Statistical Analysis.
TL;DR: In this article, the authors present an overview of the basic concepts of multivariate analysis, including matrix algebra and random vectors, as well as a strategy for analyzing multivariate models.