Topic

Rand index

About: Rand index is a research topic. Over the lifetime, 630 publications have been published within this topic receiving 20373 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Objective Criteria for the Evaluation of Clustering Methods

[...]

William M. Rand¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Dec 1971-Journal of the American Statistical Association

TL;DR: This article proposes several criteria which isolate specific aspects of the performance of a method, such as its retrieval of inherent structure, its sensitivity to resampling and the stability of its results in the light of new data.

...read moreread less

Abstract: Many intuitively appealing methods have been suggested for clustering data, however, interpretation of their results has been hindered by the lack of objective criteria. This article proposes several criteria which isolate specific aspects of the performance of a method, such as its retrieval of inherent structure, its sensitivity to resampling and the stability of its results in the light of new data. These criteria depend on a measure of similarity between two different clusterings of the same set of data; the measure essentially considers how each pair of data points is assigned in each clustering.

...read moreread less

6,179 citations

Journal Article•

On comparing partitions

[...]

Marjan Cugmas¹, Anuška Ferligoj¹•Institutions (1)

University of Ljubljana¹

18 Apr 2015-International Federation of Classification Societies

TL;DR: In this paper, Hubert and Arabie corrected the Rand Index for chance (Adjusted Rand Index) and presented some alternative indices, which do not assume one set of units for two partitions.

...read moreread less

Abstract: Rand (1971) proposed the Rand Index to measure the stability of two partitions of one set of units. Hubert and Arabie (1985) corrected the Rand Index for chance (Adjusted Rand Index). In this paper, we present some alternative indices. The proposed indices do not assume one set of units for two partitions. Here, one set of units can be a subset of the other set of units. According to the purpose of the comparison of two partitions, the merging and splitting of clusters in two partitions can have different impact on the value of the indices. Therefore, we proposed different modified Rand Indices.

...read moreread less

2,417 citations

Journal Article•DOI•

A simple and fast algorithm for K-medoids clustering

[...]

Hae-Sang Park¹, Chi-Hyuck Jun¹•Institutions (1)

Pohang University of Science and Technology¹

01 Mar 2009-Expert Systems With Applications

TL;DR: Experimental results show that the proposed algorithm takes a significantly reduced time in computation with comparable performance against the partitioning around medoids.

...read moreread less

Abstract: This paper proposes a new algorithm for K-medoids clustering which runs like the K-means algorithm and tests several methods for selecting initial medoids. The proposed algorithm calculates the distance matrix once and uses it for finding new medoids at every iterative step. To evaluate the proposed algorithm, we use some real and artificial data sets and compare with the results of other algorithms in terms of the adjusted Rand index. Experimental results show that the proposed algorithm takes a significantly reduced time in computation with comparable performance against the partitioning around medoids.

...read moreread less

1,629 citations

Proceedings Article•DOI•

Information theoretic measures for clusterings comparison: is a correction for chance necessary?

[...]

Nguyen Xuan Vinh¹, Julien Epps¹, James Bailey²•Institutions (2)

University of New South Wales¹, University of Melbourne²

14 Jun 2009

TL;DR: This paper derives the analytical formula for the expected mutual information value between a pair of clusterings, and proposes the adjusted version for several popular information theoretic based measures.

...read moreread less

Abstract: Information theoretic based measures form a fundamental class of similarity measures for comparing clusterings, beside the class of pair-counting based and set-matching based measures. In this paper, we discuss the necessity of correction for chance for information theoretic based measures for clusterings comparison. We observe that the baseline for such measures, i.e. average value between random partitions of a data set, does not take on a constant value, and tends to have larger variation when the ratio between the number of data points and the number of clusters is small. This effect is similar in some other non-information theoretic based measures such as the well-known Rand Index. Assuming a hypergeometric model of randomness, we derive the analytical formula for the expected mutual information value between a pair of clusterings, and then propose the adjusted version for several popular information theoretic based measures. Some examples are given to demonstrate the need and usefulness of the adjusted measures.

...read moreread less

748 citations

Proceedings Article•DOI•

Comparing clusterings: an axiomatic view

[...]

Marina Meilǎ¹•Institutions (1)

University of Washington¹

07 Aug 2005

TL;DR: This paper views clusterings as elements of a lattice and gives an axiomatic characterization of some criteria for comparing clusterings, including the variation of information and the unadjusted Rand index, and proves an impossibility result: there is no "sensible" criterion for comparing clusters that is simultaneously aligned with the lattice of partitions, convexely additive, and bounded.

...read moreread less

Abstract: This paper views clusterings as elements of a lattice. Distances between clusterings are analyzed in their relationship to the lattice. From this vantage point, we first give an axiomatic characterization of some criteria for comparing clusterings, including the variation of information and the unadjusted Rand index. Then we study other distances between partitions w.r.t these axioms and prove an impossibility result: there is no "sensible" criterion for comparing clusterings that is simultaneously (1) aligned with the lattice of partitions, (2) convexely additive, and (3) bounded.

...read moreread less

655 citations

Collapse

Network Information

Performance

Metrics

660

Papers

24,443

Citations

No. of papers in the topic in previous years
Year	Papers
2023	8
2022	22
2021	70
2020	64
2019	45
2018	42

Rand index

Papers published on a yearly basis

Papers

Trending Questions (5)

Network Information

Related Topics (5)

Performance

Metrics