Open AccessProceedings Article
A density-based algorithm for discovering clusters in large spatial Databases with Noise
Martin Ester,Hans-Peter Kriegel,Jörg Sander,Xiaowei Xu +3 more
- pp 226-231
Reads0
Chats0
TLDR
DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.Abstract:
Clustering algorithms are attractive for the task of class identification in spatial databases. However, the application to large spatial databases rises the following requirements for clustering algorithms: minimal requirements of domain knowledge to determine the input parameters, discovery of clusters with arbitrary shape and good efficiency on large databases. The well-known clustering algorithms offer no solution to the combination of these requirements. In this paper, we present the new clustering algorithm DBSCAN relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape. DBSCAN requires only one input parameter and supports the user in determining an appropriate value for it. We performed an experimental evaluation of the effectiveness and efficiency of DBSCAN using synthetic data and real data of the SEQUOIA 2000 benchmark. The results of our experiments demonstrate that (1) DBSCAN is significantly more effective in discovering clusters of arbitrary shape than the well-known algorithm CLARANS, and that (2) DBSCAN outperforms CLARANS by a factor of more than 100 in terms of efficiency.read more
Citations
More filters
Journal ArticleDOI
A new approach for semi-automatic rock mass joints recognition from 3D point clouds
TL;DR: This work was partially funded by the University of Alicante (vigrob-157, uausti11–11, and gre09–40 projects), the Swiss National Science Foundation (FNS-138015 and FNS-144040 projects) and by the Generalitat Valenciana (project GV/2011/044).
Journal ArticleDOI
How to find an appropriate clustering for mixed-type variables with application to socio-economic stratification
Christian Hennig,Tim Futing Liao +1 more
TL;DR: The application of a philosophy of cluster analysis to economic data from the 2007 US Survey of Consumer Finances demonstrates techniques and decisions required to obtain an interpretable clustering, and the clustering is shown to be significantly more structured than a suitable null model.
Journal ArticleDOI
Evolution of Extensively Drug-Resistant Tuberculosis over Four Decades: Whole Genome Sequencing and Dating Analysis of Mycobacterium tuberculosis Isolates from KwaZulu-Natal
Keira A. Cohen,Thomas Abeel,Abigail Manson McGuire,Christopher A. Desjardins,Vanisha Munsamy,Terrance Shea,Bruce J. Walker,Nonkqubela Bantubani,Deepak V. Almeida,Lucia Alvarado,Sinéad B. Chapman,Nomonde R. Mvelase,Eamon Y. Duffy,Michael Fitzgerald,Pamla Govender,Sharvari Gujja,Susanna Hamilton,Clinton Howarth,Jeffrey D. Larimer,Kashmeel Maharaj,Matthew D. Pearson,Margaret Priest,Qiandong Zeng,Nesri Padayatchi,Jacques H. Grosset,Sarah Young,Jennifer R. Wortman,Koleka Mlisana,Max R. O'Donnell,Bruce W. Birren,William R. Bishai,Alexander S. Pym,Ashlee M. Earl +32 more
TL;DR: The first whole genome-based analysis of the emergence of drug resistance among clinical isolates of M. tuberculosis shows that the ancestral precursor of the LAM4 XDR outbreak strain in Tugela Ferry gained mutations to first-line drugs at the beginning of the antibiotic era.
Journal ArticleDOI
Mining Travel Patterns from Geotagged Photos
TL;DR: This study aims to leverage the wealth of these enriched online photos to analyze people’s travel patterns at the local level of a tour destination by building a statistically reliable database of travel paths from a noisy pool of community-contributed geotagged photos on the Internet.
Journal ArticleDOI
Live-cell superresolution microscopy reveals the organization of RNA polymerase in the bacterial nucleoid
Mathew Stracy,Christian Lesterlin,Federico Garza de Leon,Stephan Uphoff,Pawel Zawadzki,Achillefs N. Kapanidis +5 more
TL;DR: This work characterize how RNA polymerase accesses transcription sites on DNA, and shows that active transcription can cause spatial reorganization of the nucleoid, with movement of gene loci out of the bulk of DNA as levels of transcription increase.
References
More filters
Book
Finding Groups in Data: An Introduction to Cluster Analysis
TL;DR: An electrical signal transmission system, applicable to the transmission of signals from trackside hot box detector equipment for railroad locomotives and rolling stock, wherein a basic pulse train is transmitted whereof the pulses are of a selected first amplitude and represent a train axle count.
Proceedings ArticleDOI
The R*-tree: an efficient and robust access method for points and rectangles
TL;DR: The R*-tree is designed which incorporates a combined optimization of area, margin and overlap of each enclosing rectangle in the directory which clearly outperforms the existing R-tree variants.
Proceedings Article
Efficient and Effective Clustering Methods for Spatial Data Mining
Raymond T. Ng,Jiawei Han +1 more
TL;DR: The analysis and experiments show that with the assistance of CLAHANS, these two algorithms are very effective and can lead to discoveries that are difficult to find with current spatial data mining algorithms.
Journal ArticleDOI
An introduction to spatial database systems
TL;DR: This work surveys data modeling, querying, data structures and algorithms, and system architecture for spatial database systems, with the emphasis on describing known technology in a coherent manner, rather than listing open problems.