scispace - formally typeset
Search or ask a question
Topic

Dunn index

About: Dunn index is a research topic. Over the lifetime, 150 publications have been published within this topic receiving 24021 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: This paper proposes combining WOA with TS (WOATS) for data clustering, a meta-heuristic method which uses memory components to explore and exploit search space and uses an objective function inspired by partitional clustering to maintain the quality of clustering solutions.

20 citations

Journal ArticleDOI
TL;DR: In this paper, a wavelet transform based multiscale entropy (WME) and wavelet-based multi-scale relative entropy (WMRE) approach was used to analyze and gage the complexity of the precipitation series and spatially classify the raingauges in Iran.
Abstract: The hydrologic process and dynamic system of precipitation is influenced by many physical factors which are excessively complex and variable. Present study used a wavelet transform based multiscale entropy (WME) and wavelet-based multiscale relative entropy (WMRE) approach in order to analyze and gage the complexity of the precipitation series and spatially classify the raingauges in Iran. For this end, historical annual precipitation data of 51 years (1960–2010) from 31 raingauges was decomposed using WT in which smooth Daubechies (db) mother wavelet (db5–db10), optimal level of decomposition and boundary extensions were considered. Next, entropy concept was applied for components obtained from WT to measure of dispersion, uncertainty, disorderliness and diversification in a multi-scale form. Spatial classification of raingauges was performed using WME and WMRE values as input data to SOM and k-means approaches. Three validity indices namely Davis Bouldin (DB), Silhouette coefficient (SC) and Dunn index were used to validate the proposed model’s efficiency. Based on results, it was observed that k-means approach had better performance in determining homogenous areas with SC = 0.337, DB = 0.769 and Dunn = 1.42. Finally, spatial structure of precipitation variation in latitude and longitude directions demonstrated that WME and WMRE values had a decreasing trend with latitude, however, it was seen that WME and WMRE had an increasing relationship with longitude in Iran.

20 citations

Journal ArticleDOI
TL;DR: The results using this methodology showed a high classification accuracy and proved that both learning frameworks can be combined to optimize the selection of classification features.

19 citations

Journal ArticleDOI
TL;DR: The optimizing factor in the threshold function is computed via interpolation and found to be effective in forming better quality clusters as verified by visual assessment and various standard validation indices like the global silhouette index, partition index, separation index, and Dunn index.
Abstract: This paper presents a self-optimal clustering (SOC) technique which is an advanced version of improved mountain clustering (IMC) technique. The proposed clustering technique is equipped with major changes and modifications in its previous versions of algorithm. SOC is compared with some of the widely used clustering techniques such as K-means, fuzzy C-means, Expectation and Maximization, and K-medoid. Also, the comparison of the proposed technique is shown with IMC and its last updated version. The quantitative and qualitative performances of all these well-known clustering techniques are presented and compared with the aid of case studies and examples on various benchmarked validation indices. SOC has been evaluated via cluster compactness within itself and separation with other clusters. The optimizing factor in the threshold function is computed via interpolation and found to be effective in forming better quality clusters as verified by visual assessment and various standard validation indices like the global silhouette index, partition index, separation index, and Dunn index.

19 citations

Journal ArticleDOI
TL;DR: This work has attempted to make clustering algorithms, which could find reasonable number of representative conformer ensembles automatically with asymmetric dissimilarity matrix generated from openeye tool kit with respect to the compactness and the explanatory power.
Abstract: The accuracy of any 3D-QSAR, Pharmacophore and 3D-similarity based chemometric target fishing models are highly dependent on a reasonable sample of active conformations. Since a number of diverse conformational sampling algorithm exist, which exhaustively generate enough conformers, however model building methods relies on explicit number of common conformers. In this work, we have attempted to make clustering algorithms, which could find reasonable number of representative conformer ensembles automatically with asymmetric dissimilarity matrix generated from openeye tool kit. RMSD was the important descriptor (variable) of each column of the N × N matrix considered as N variables describing the relationship (network) between the conformer (in a row) and the other N conformers. This approach used to evaluate the performance of the well-known clustering algorithms by comparison in terms of generating representative conformer ensembles and test them over different matrix transformation functions considering the stability. In the network, the representative conformer group could be resampled for four kinds of algorithms with implicit parameters. The directed dissimilarity matrix becomes the only input to the clustering algorithms. Dunn index, Davies–Bouldin index, Eta-squared values and omega-squared values were used to evaluate the clustering algorithms with respect to the compactness and the explanatory power. The evaluation includes the reduction (abstraction) rate of the data, correlation between the sizes of the population and the samples, the computational complexity and the memory usage as well. Every algorithm could find representative conformers automatically without any user intervention, and they reduced the data to 14–19% of the original values within 1.13 s per sample at the most. The clustering methods are simple and practical as they are fast and do not ask for any explicit parameters. RCDTC presented the maximum Dunn and omega-squared values of the four algorithms in addition to consistent reduction rate between the population size and the sample size. The performance of the clustering algorithms was consistent over different transformation functions. Moreover, the clustering method can also be applied to molecular dynamics sampling simulation results.

19 citations


Network Information
Related Topics (5)
Feature selection
41.4K papers, 1M citations
70% related
Support vector machine
73.6K papers, 1.7M citations
69% related
Genetic algorithm
67.5K papers, 1.2M citations
68% related
Cluster analysis
146.5K papers, 2.9M citations
68% related
Web service
57.6K papers, 989K citations
66% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202120
202028
201917
201813
201710
201611