An Efficient K -Medoids-Based Algorithm Using Previous Medoid Index, Triangular Inequality Elimination Criteria, and Partial Distance Search

doi:10.1007/3-540-46145-0_7

Book ChapterDOI

An Efficient K -Medoids-Based Algorithm Using Previous Medoid Index, Triangular Inequality Elimination Criteria, and Partial Distance Search

Shu-Chuan Chu, +2 more

- pp 63-72

Chats0

TLDR

A novel and efficient approach is proposed to reduce the computational complexity of such k-medoids-based algorithms by using previous medoid index, triangular inequality elimination criteria and partial distance search.

Abstract:

Clustering in data mining is a discovery process that groups similar objects into the same cluster. Various clustering algorithms have been designed to fit various requirements and constraints of application. In this paper, we study several k-medoids-based algorithms including the PAM, CLARA and CLARANS algorithms. A novel and efficient approach is proposed to reduce the computational complexity of such k-medoids-based algorithms by using previous medoid index, triangular inequality elimination criteria and partial distance search. Experimental results based on elliptic, curve and Gauss-Markov databases demonstrate that the proposed algorithm applied to CLARANS may reduce the number of distance calculations by 67% to 92% while retaining the same average distance per object. In terms of the running time, the proposed algorithm may reduce computation time by 38% to 65% compared with the CLARANS algorithm.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Survey: Multi-criterion Pareto based particle swarm optimized polynomial neural network for classification: A review and state-of-the-art

S. Dehuri, +1 more

- 01 Feb 2009 -

Computer Science Review

TL;DR: An extensive experimental study has been carried out to compare the importance and effectiveness of the proposed MOPPSO method with the chosen state-of-the-art multi-objective particle swarm optimization (MOPSO) algorithm using several benchmark datasets.

...read moreread less

Journal ArticleDOI

Accelerating k-medoid-based algorithms through metric access methods

Maria Camila Nardini Barioni, +3 more

- 01 Mar 2008 -

Journal of Systems and Software

TL;DR: This paper addresses the task of scaling up k-medoid-based algorithms through the utilization of metric access methods, allowing clustering algorithms to be executed by database management systems in a fraction of the time usually required by the traditional approaches.

...read moreread less

Journal ArticleDOI

An Efficient Density based Improved K- Medoids Clustering algorithm

Raghuvira Pratap A, +3 more

- 01 Jan 2011 -

International Journal of Advanced Comput...

TL;DR: This paper proposes an efficient density based k-medoids clustering algorithm that will perform better than DBSCAN while handling clusters of circularly distributed data points and slightly overlapped clusters.

...read moreread less

Journal ArticleDOI

A hybrid spatial data clustering method for site selection: The data driven approach of GIS mining

Bo Fan

- 01 Mar 2009 -

Expert Systems With Applications

TL;DR: A new data model for recording all the information of customer management is given, which transforms the traditional model-driven strategy to data-oriented method and a hybrid spatial clustering method named OETTC-MEANS-CLASA algorithm is proposed to solve site selection of the service center.

...read moreread less

Proceedings ArticleDOI

Placement in Clouds for Application-Level Latency Requirements

Fangzhe Chang, +2 more

TL;DR: This paper presents placement algorithms that exploit the Euclidean triangular inequality property of network topologies, and shows that Message Sequence Charts (MSCs), a widely-used mechanism for describing the execution of application procedures, can be naturally translated into the formalism of collective latency expressions.

...read moreread less

References

PDF

Open Access

More filters

Some methods for classification and analysis of multivariate observations

James B. MacQueen

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.

...read moreread less

Proceedings Article

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

Martin Ester, +3 more

TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.

...read moreread less

Proceedings Article

A density-based algorithm for discovering clusters in large spatial Databases with Noise

Martin Ester, +3 more

TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.

...read moreread less

Book

Finding Groups in Data: An Introduction to Cluster Analysis

Leonard Kaufman, +1 more

TL;DR: An electrical signal transmission system, applicable to the transmission of signals from trackside hot box detector equipment for railroad locomotives and rolling stock, wherein a basic pulse train is transmitted whereof the pulses are of a selected first amplitude and represent a train axle count.

...read moreread less

BookDOI

Finding Groups in Data

Leonard Kaufman, +1 more

TL;DR: In this article, an electrical signal transmission system for railway locomotives and rolling stock is proposed, where a basic pulse train is transmitted whereof the pulses are of a selected first amplitude and represent a train axle count, and a spike pulse of greater selected amplitude is transmitted, occurring immediately after the axle count pulse to which it relates, whenever an overheated axle box is detected.

...read moreread less

Collapse

IEEE Transactions on Knowledge and Data ...

Finding Groups in Data: An Introduction to Cluster Analysis

Leonard Kaufman, +1 more

Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification

Martin Ester, +2 more

An Efficient K -Medoids-Based Algorithm Using Previous Medoid Index, Triangular Inequality Elimination Criteria, and Partial Distance Search

Citations

Survey: Multi-criterion Pareto based particle swarm optimized polynomial neural network for classification: A review and state-of-the-art

Accelerating k-medoid-based algorithms through metric access methods

An Efficient Density based Improved K- Medoids Clustering algorithm

A hybrid spatial data clustering method for site selection: The data driven approach of GIS mining

Placement in Clouds for Application-Level Latency Requirements

References

Some methods for classification and analysis of multivariate observations

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

A density-based algorithm for discovering clusters in large spatial Databases with Noise

Finding Groups in Data: An Introduction to Cluster Analysis

Finding Groups in Data

Related Papers (5)

BIRCH: an efficient data clustering method for very large databases

Efficient and Effective Clustering Methods for Spatial Data Mining

CLARANS: a method for clustering objects for spatial data mining

Finding Groups in Data: An Introduction to Cluster Analysis

Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification