Variable selection in clustering

doi:10.1007/BF01897164

Journal ArticleDOI

Variable selection in clustering

E. B. Fowlkes, +2 more

- 01 Sep 1988 -

Journal of Classification

- Vol. 5, Iss: 2, pp 205-228

Chats0

TLDR

A forward selection procedure for identifying the subset of variables is proposed and studied in the context of complete linkage hierarchical clustering, and can be applied to other clustering methods, too.

Abstract:

Standard clustering algorithms can completely fail to identify clear cluster structure if that structure is confined to a subset of the variables. A forward selection procedure for identifying the subset is proposed and studied in the context of complete linkage hierarchical clustering. The basic approach can be applied to other clustering methods, too.

Citations

PDF

Open Access

More filters

Monographs on statistics and applied probability

V. Isham, +5 more

Journal ArticleDOI

K‐means clustering: A half‐century synthesis

Douglas Steinley

- 01 May 2006 -

British Journal of Mathematical and Stat...

TL;DR: This paper synthesizes the results, methodology, and research conducted concerning the K-means clustering method over the last fifty years, leading to a unifying treatment of K-Means and some of its extensions.

...read moreread less

Journal ArticleDOI

Automated variable weighting in k-means type clustering

Joshua Zhexue Huang, +3 more

- 01 May 2005 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A new step is introduced to the k-means clustering process to iteratively update variable weights based on the current partition of data and a formula for weight calculation is proposed, and the convergency theorem of the new clustered process is given.

...read moreread less

Journal ArticleDOI

Subspace clustering

Hans-Peter Kriegel, +2 more

TL;DR: The problems motivating subspace clustering are sketched, different definitions and usages of subspaces for clusteringare described, and exemplary algorithmic solutions are discussed.

...read moreread less

Journal ArticleDOI

Clustering objects on subsets of attributes (with discussion)

Jerome H. Friedman, +1 more

- 01 Nov 2004 -

Journal of The Royal Statistical Society...

TL;DR: A new procedure is proposed for clustering attribute value data that encourages those algorithms to detect automatically subgroups of objects that preferentially cluster on subsets of the attribute variables rather than on all of them simultaneously.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Some methods for classification and analysis of multivariate observations

James B. MacQueen

TL;DR: The k-means algorithm as mentioned in this paper partitions an N-dimensional population into k sets on the basis of a sample, which is a generalization of the ordinary sample mean, and it is shown to give partitions which are reasonably efficient in the sense of within-class variance.

...read moreread less

Book

Clustering Algorithms

John A. Hartigan

Journal ArticleDOI

Direct Clustering of a Data Matrix

J. A. Hartigan

- 01 Mar 1972 -

Journal of the American Statistical Asso...

TL;DR: This article presents a model, and a technique, for clustering cases and variables simultaneously and the principal advantage in this approach is the direct interpretation of the clusters on the data.

...read moreread less

Book

Methods for Statistical Data Analysis of Multivariate Observations

R. Gnanadesikan

TL;DR: In this paper, the authors present an assessment of specific aspects of multivariate statistical models, including reduction of dimensionality, reduction of dependence, and clustering of multidimensional dependencies.

...read moreread less

Journal ArticleDOI

A study of standardization of variables in cluster analysis

Glenn W. Milligan, +1 more

- 01 Sep 1988 -

Journal of Classification

TL;DR: The present simulation study examined the standardization problem and found that those approaches which standardize by division by the range of the variable gave consistently superior recovery of the underlying cluster structure.

...read moreread less

Variable selection in clustering

Citations

Monographs on statistics and applied probability

K‐means clustering: A half‐century synthesis

Automated variable weighting in k-means type clustering

Subspace clustering

Clustering objects on subsets of attributes (with discussion)

References

Some methods for classification and analysis of multivariate observations

Clustering Algorithms

Direct Clustering of a Data Matrix

Methods for Statistical Data Analysis of Multivariate Observations

A study of standardization of variables in cluster analysis

Related Papers (5)

An examination of the effect of six types of error perturbation on fifteen clustering algorithms

Variable Selection for Model-Based Clustering

Some methods for classification and analysis of multivariate observations

An examination of procedures for determining the number of clusters in a data set

Objective Criteria for the Evaluation of Clustering Methods