Density-based clustering

doi:10.1002/WIDM.30

Journal ArticleDOI

Density-based clustering

Hans-Peter Kriegel, +3 more

- 01 May 2011 -

Wiley Interdisciplinary Reviews-Data Min...

- Vol. 1, Iss: 3, pp 231-240

TLDR

In this article, a density-based clustering is defined as the task of identifying groups or clusters in a data set, a cluster is a set of data objects spread in the data space over a contiguous region of high density of objects.

Abstract:

Clustering refers to the task of identifying groups or clusters in a data set. In density-based clustering, a cluster is a set of data objects spread in the data space over a contiguous region of high density of objects. Density-based clusters are separated from each other by contiguous regions of low density of objects. Data objects located in low-density regions are typically considered noise or outliers. © 2011 John Wiley & Sons, Inc. WIREs Data Mining Knowl Discov 2011 1 231–240 DOI: 10.1002/widm.30 This article is categorized under: Technologies > Structure Discovery and Clustering

Citations

PDF

Open Access

More filters

Book ChapterDOI

Density-Based Clustering Based on Hierarchical Density Estimates

Ricardo J. G. B. Campello, +2 more

TL;DR: This work proposes a theoretically and practically improved density-based, hierarchical clustering method, providing a clustering hierarchy from which a simplified tree of significant clusters can be constructed, and proposes a novel cluster stability measure.

...read moreread less

Journal ArticleDOI

A survey on unsupervised outlier detection in high-dimensional numerical data

Arthur Zimek, +2 more

- 01 Oct 2012 -

Statistical Analysis and Data Mining

TL;DR: This survey article discusses some important aspects of the ‘curse of dimensionality’ in detail and surveys specialized algorithms for outlier detection from both categories.

...read moreread less

Journal ArticleDOI

Machine Learning for Internet of Things Data Analysis: A Survey

Mohammad Saeid Mahdavinejad, +7 more

- 12 Oct 2017 -

Digital Communications and Networks

TL;DR: This article assesses the different machine learning methods that deal with the challenges in IoT data by considering smart cities as the main use case and presents a taxonomy of machine learning algorithms explaining how different techniques are applied to the data in order to extract higher level information.

...read moreread less

Journal ArticleDOI

Subspace clustering

Hans-Peter Kriegel, +2 more

TL;DR: The problems motivating subspace clustering are sketched, different definitions and usages of subspaces for clusteringare described, and exemplary algorithmic solutions are discussed.

...read moreread less

Journal ArticleDOI

Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection

Ricardo J. G. B. Campello, +3 more

- 22 Jul 2015 -

ACM Transactions on Knowledge Discovery ...

TL;DR: An integrated framework for density-based cluster analysis, outlier detection, and data visualization is introduced, consisting of an algorithm to compute hierarchical estimates of the level sets of a density, following Hartigan’s classic model of density-contour clusters and trees.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Maximum likelihood from incomplete data via the EM algorithm

Arthur P. Dempster, +2 more

- 01 Sep 1977 -

Journal of the royal statistical society...

Journal ArticleDOI

The WEKA data mining software: an update

Mark Hall, +5 more

- 16 Nov 2009 -

Sigkdd Explorations

TL;DR: This paper provides an introduction to the WEKA workbench, reviews the history of the project, and, in light of the recent 3.6 stable release, briefly discusses what has been added since the last stable version (Weka 3.4) released in 2003.

...read moreread less

Proceedings Article

A density-based algorithm for discovering clusters in large spatial Databases with Noise

Martin Ester, +3 more

TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.

...read moreread less