scispace - formally typeset
Search or ask a question

Showing papers by "Jeff Calder published in 2016"


Journal ArticleDOI
TL;DR: A method for similarity-based anomaly detection using a novel multicriteria dissimilarity measure, the Pareto depth, which is provably better than using linear combinations of the criteria, and shows superior performance on experiments with synthetic and real data sets.
Abstract: We consider the problem of identifying patterns in a data set that exhibits anomalous behavior, often referred to as anomaly detection. Similarity-based anomaly detection algorithms detect abnormally large amounts of similarity or dissimilarity, e.g., as measured by the nearest neighbor Euclidean distances between a test sample and the training samples. In many application domains, there may not exist a single dissimilarity measure that captures all possible anomalous patterns. In such cases, multiple dissimilarity measures can be defined, including nonmetric measures, and one can test for anomalies by scalarizing using a nonnegative linear combination of them. If the relative importance of the different dissimilarity measures are not known in advance, as in many anomaly detection applications, the anomaly detection algorithm may need to be executed multiple times with different choices of weights in the linear combination. In this paper, we propose a method for similarity-based anomaly detection using a novel multicriteria dissimilarity measure, the Pareto depth. The proposed Pareto depth analysis (PDA) anomaly detection algorithm uses the concept of Pareto optimality to detect anomalies under multiple criteria without having to run an algorithm multiple times with different choices of weights. The proposed PDA approach is provably better than using linear combinations of the criteria, and shows superior performance on experiments with synthetic and real data sets.

25 citations


Journal ArticleDOI
TL;DR: In this article, it was shown that nondominated sorting of random points has a Hamilton-Jacobi equation (HJE) continuum limit, which completely avoids the variational interpretation.
Abstract: Nondominated sorting is a combinatorial algorithm that sorts points in Euclidean space into layers according to a partial order. It was recently shown that nondominated sorting of random points has a Hamilton–Jacobi equation continuum limit. The original proof, given in Calder et al. (2014), relies on a continuum variational problem. In this paper, we give a new proof using a direct verification argument that completely avoids the variational interpretation. We believe this may be generalized to apply to other stochastic homogenization problems for which there is no obvious underlying variational principle.

5 citations


Posted Content
TL;DR: A fast real-time streaming version of the PDA algorithm for anomaly detection and classification that exploits the computational advantages of partial differential equation (PDE) continuum limits is proposed.
Abstract: Nondominated sorting, or Pareto Depth Analysis (PDA), is widely used in multi-objective optimization and has recently found important applications in multi-criteria anomaly detection. We propose in this paper a fast real-time streaming version of the PDA algorithm for anomaly detection and classification that exploits the computational advantages of partial differential equation (PDE) continuum limits. We prove convergence rates for the continuum approximations and present the results of numerical experiments.

2 citations


Journal ArticleDOI
TL;DR: A fast real-time streaming version of the PDA algorithm for anomaly detection that exploits the computational advantages of PDE continuum limits is proposed and statistical convergence rates for PDE-based ranking are proved.
Abstract: Nondominated sorting, also called Pareto Depth Analysis (PDA), is widely used in multi-objective optimization and has recently found important applications in multi-criteria anomaly detection. Recently, a partial differential equation (PDE) continuum limit was discovered for nondominated sorting leading to a very fast approximate sorting algorithm called PDE-based ranking. We propose in this paper a fast real-time streaming version of the PDA algorithm for anomaly detection that exploits the computational advantages of PDE continuum limits. Furthermore, we derive new PDE continuum limits for sorting points within their nondominated layers and show how the new PDEs can be used to classify anomalies based on which criterion was more significantly violated. We also prove statistical convergence rates for PDE-based ranking, and present the results of numerical experiments with both synthetic and real data.

1 citations