A simple and fast algorithm for K-medoids clustering

doi:10.1016/J.ESWA.2008.01.039

Journal ArticleDOI

A simple and fast algorithm for K-medoids clustering

Hae-Sang Park, +1 more

- 01 Mar 2009 -

Expert Systems With Applications

- Vol. 36, Iss: 2, pp 3336-3341

Chats0

TLDR

Experimental results show that the proposed algorithm takes a significantly reduced time in computation with comparable performance against the partitioning around medoids.

Abstract:

This paper proposes a new algorithm for K-medoids clustering which runs like the K-means algorithm and tests several methods for selecting initial medoids. The proposed algorithm calculates the distance matrix once and uses it for finding new medoids at every iterative step. To evaluate the proposed algorithm, we use some real and artificial data sets and compare with the results of other algorithms in terms of the adjusted Rand index. Experimental results show that the proposed algorithm takes a significantly reduced time in computation with comparable performance against the partitioning around medoids.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Learning best views of 3D shapes from sketch contour

Long Zhao, +3 more

- 01 Jun 2015 -

The Visual Computer

TL;DR: This paper introduces a novel learning-based approach to automatically select the best views of 3D shapes using a new prior, and reveals the connection between sketches and viewpoints by taking context information of their contours into account.

...read moreread less

BookDOI

Sentiment Analysis in the Bio-Medical Domain

Ranjan Satapathy, +2 more

TL;DR: This introductory chapter reviews the general area of sentiment analysis research and posits a case for incorporating commonsense knowledge in machines, as a means to better understand natural language.

...read moreread less

Journal ArticleDOI

PLS regression-based chemometric modeling of odorant properties of diverse chemical constituents of black tea and coffee

Probir Kumar Ojha, +1 more

- 09 Jan 2018 -

RSC Advances

TL;DR: In this article, the authors investigated the key structural features which regulate the odorant properties of constituents present in black tea and coffee using regression-based chemometric models and also investigated the structural properties which create the odor difference between tea and black coffee.

...read moreread less

Posted Content

A fast and recursive algorithm for clustering large datasets with $k$-medians

Hervé Cardot, +2 more

- 21 Jan 2011 -

arXiv: Computation

TL;DR: In this paper, a recursive stochastic gradient algorithm designed for the $k$-medians loss criterion is proposed, which is very fast and is well adapted to deal with large samples of data that are allowed to arrive sequentially.

...read moreread less

Proceedings ArticleDOI

Beyond Value Perturbation: Local Differential Privacy in the Temporal Setting

Qingqing Ye, +5 more

TL;DR: Li et al. as discussed by the authors proposed local differential privacy in the temporal setting (TLDP) as the privacy notion for time series data, and quantified the utility of a temporal perturbation mechanism in terms of the costs of a missing, repeated, empty, or delayed value.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Some methods for classification and analysis of multivariate observations

James B. MacQueen

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.

...read moreread less

Journal ArticleDOI

Silhouettes: a graphical aid to the interpretation and validation of cluster analysis

Peter J. Rousseeuw

- 01 Nov 1987 -

Journal of Computational and Applied Mat...

TL;DR: A new graphical display is proposed for partitioning techniques, where each cluster is represented by a so-called silhouette, which is based on the comparison of its tightness and separation, and provides an evaluation of clustering validity.

...read moreread less

Book

Finding Groups in Data: An Introduction to Cluster Analysis

Leonard Kaufman, +1 more

TL;DR: An electrical signal transmission system, applicable to the transmission of signals from trackside hot box detector equipment for railroad locomotives and rolling stock, wherein a basic pulse train is transmitted whereof the pulses are of a selected first amplitude and represent a train axle count.

...read moreread less

BookDOI

Finding Groups in Data

Leonard Kaufman, +1 more

TL;DR: In this article, an electrical signal transmission system for railway locomotives and rolling stock is proposed, where a basic pulse train is transmitted whereof the pulses are of a selected first amplitude and represent a train axle count, and a spike pulse of greater selected amplitude is transmitted, occurring immediately after the axle count pulse to which it relates, whenever an overheated axle box is detected.

...read moreread less

Journal ArticleDOI