Graph based anomaly detection and description: a survey

doi:10.1007/S10618-014-0365-Y

Open AccessJournal ArticleDOI

Graph based anomaly detection and description: a survey

Leman Akoglu, +2 more

- 01 May 2015 -

Data Mining and Knowledge Discovery

- Vol. 29, Iss: 3, pp 626-688

TLDR

This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs, and gives a general framework for the algorithms categorized under various settings.

Abstract:

Detecting anomalies in data is a vital task, with numerous high-impact applications in areas such as security, finance, health care, and law enforcement. While numerous techniques have been developed in past years for spotting outliers and anomalies in unstructured collections of multi-dimensional points, with graph data becoming ubiquitous, techniques for structured graph data have been of focus recently. As objects in graphs have long-range correlations, a suite of novel technology has been developed for anomaly detection in graph data. This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs. As a key contribution, we give a general framework for the algorithms categorized under various settings: unsupervised versus (semi-)supervised approaches, for static versus dynamic graphs, for attributed versus plain graphs. We highlight the effectiveness, scalability, generality, and robustness aspects of the methods. What is more, we stress the importance of anomaly attribution and highlight the major techniques that facilitate digging out the root cause, or the `why', of the detected anomalies for further analysis and sense-making. Finally, we present several real-world applications of graph-based anomaly detection in diverse domains, including financial, auction, computer traffic, and social networks. We conclude our survey with a discussion on open theoretical and practical challenges in the field.

Graph based anomaly detection and description: a survey

Citations

CFOF: A Concentration Free Measure for Anomaly Detection

Structural Predictability Optimization Against Inference Attacks in Data Publishing

Learning to count: A deep learning framework for graphlet count estimation

Diverse Power Iteration Embeddings: Theory and Practice

UN-AVOIDS: Unsupervised and Nonparametric Approach for Visualizing Outliers and Invariant Detection Scoring

References

Collective dynamics of small-world networks

Matrix computations

Emergence of Scaling in Random Networks

An introduction to probability theory and its applications

Time series analysis, forecasting and control

Related Papers (5)

Anomaly detection: A survey

LOF: identifying density-based local outliers

node2vec: Scalable Feature Learning for Networks

DeepWalk: online learning of social representations

Outlier Analysis