scispace - formally typeset
Open AccessProceedings ArticleDOI

Arrays of (locality-sensitive) Count Estimators (ACE): Anomaly Detection on the Edge

Chen Luo, +1 more
- pp 1439-1448
Reads0
Chats0
TLDR
This paper proposes ACE (Arrays of (locality-sensitive) Count Estimators) algorithm that can be 60x faster than most state-of-the-art unsupervised anomaly detection algorithms and has appealing privacy properties.
Abstract
Anomaly detection is one of the frequent and important subroutines deployed in large-scale data processing applications. Even being a well-studied topic, existing techniques for unsupervised anomaly detection require storing significant amounts of data, which is prohibitive from memory, latency and privacy perspectives, especially for small mobile devices which has ultra-low memory budget and limited computational power. In this paper, we propose ACE (Arrays of (locality-sensitive) Count Estimators) algorithm that can be 60x faster than most state-of-the-art unsupervised anomaly detection algorithms. In addition, ACE has appealing privacy properties. Our experiments show that ACE algorithm has significantly smaller memory footprints (∠ 4MB in our experiments) which can exploit Level 3 cache of any modern processor. At the core of the ACE algorithm, there is a novel statistical estimator which is derived from the sampling view of Locality Sensitive Hashing (LSH). This view is significantly different and efficient than the widely popular view of LSH for near-neighbor search. We show the superiority of ACE algorithm over 11 popular baselines on 3 benchmark datasets, including the KDD-Cup99 data which is the largest available public benchmark comprising of more than half a million entries with ground truth anomaly labels.

read more

Citations
More filters
Journal Article

Ranking outliers using symmetric neighborhood relationship

TL;DR: In this article, the authors proposed a measure on local outliers based on a symmetric neighborhood relationship, which considers both neighbors and reverse neighbors of an object when estimating its density distribution.
Journal ArticleDOI

A neural data structure for novelty detection.

TL;DR: This work found that the fruit fly olfactory circuit evolved a variant of a Bloom filter to assess the novelty of odors, and develops a class of distance- and time-sensitive Bloom filters that outperform prior filters when evaluated on several biological and computational datasets.
Proceedings Article

Space and Time Efficient Kernel Density Estimation in High Dimensions

TL;DR: This work instantiate their framework with the Laplacian and Exponential kernels, two popular kernels which possess the aforementioned property, and presents an improvement to their framework that retains the same query time, while requiring only linear space and linear preprocessing time.
Journal ArticleDOI

Review and State of Art of Fog Computing

TL;DR: This paper describes fog computing technology, infrastructure, and applications, and presents the latest development of fog networking, quality of experience, cloud at the edge, platforms, security, and privacy.
Proceedings Article

Rehashing Kernel Evaluation in High Dimensions

TL;DR: This paper proposes and implements provable and practical procedures for adaptive sample size selection, preprocessing time reduction, and refined variance bounds that quantify the datadependent performance of random sampling and hashing-based kernel evaluation methods.
References
More filters
Journal ArticleDOI

Anomaly detection: A survey

TL;DR: This survey tries to provide a structured and comprehensive overview of the research on anomaly detection by grouping existing techniques into different categories based on the underlying approach adopted by each technique.
Journal ArticleDOI

LOF: identifying density-based local outliers

TL;DR: This paper contends that for many scenarios, it is more meaningful to assign to each object a degree of being an outlier, called the local outlier factor (LOF), and gives a detailed formal analysis showing that LOF enjoys many desirable properties.
Proceedings ArticleDOI

Approximate nearest neighbors: towards removing the curse of dimensionality

TL;DR: In this paper, the authors present two algorithms for the approximate nearest neighbor problem in high-dimensional spaces, for data sets of size n living in R d, which require space that is only polynomial in n and d.
Proceedings Article

Similarity Search in High Dimensions via Hashing

TL;DR: Experimental results indicate that the novel scheme for approximate similarity search based on hashing scales well even for a relatively large number of dimensions, and provides experimental evidence that the method gives improvement in running time over other methods for searching in highdimensional spaces based on hierarchical tree decomposition.
Related Papers (5)