scispace - formally typeset
Open AccessJournal ArticleDOI

Graph based anomaly detection and description: a survey

TLDR
This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs, and gives a general framework for the algorithms categorized under various settings.
Abstract
Detecting anomalies in data is a vital task, with numerous high-impact applications in areas such as security, finance, health care, and law enforcement. While numerous techniques have been developed in past years for spotting outliers and anomalies in unstructured collections of multi-dimensional points, with graph data becoming ubiquitous, techniques for structured graph data have been of focus recently. As objects in graphs have long-range correlations, a suite of novel technology has been developed for anomaly detection in graph data. This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs. As a key contribution, we give a general framework for the algorithms categorized under various settings: unsupervised versus (semi-)supervised approaches, for static versus dynamic graphs, for attributed versus plain graphs. We highlight the effectiveness, scalability, generality, and robustness aspects of the methods. What is more, we stress the importance of anomaly attribution and highlight the major techniques that facilitate digging out the root cause, or the `why', of the detected anomalies for further analysis and sense-making. Finally, we present several real-world applications of graph-based anomaly detection in diverse domains, including financial, auction, computer traffic, and social networks. We conclude our survey with a discussion on open theoretical and practical challenges in the field.

read more

Citations
More filters
Proceedings ArticleDOI

An Efficient Approach to Event Detection and Forecasting in Dynamic Multivariate Social Media Networks

TL;DR: This paper generalizes traditional nonparametric statistics, and proposes a new class of scan statistic functions for measuring the joint significance of evolving subgraphs and subsets of attributes to indicate the ongoing or forthcoming event in dynamic multivariate networks.
Journal ArticleDOI

Estimation of Graphlet Counts in Massive Networks

TL;DR: An unbiased graphlet estimation framework that is fast with large speedups compared to the state of the art; parallel with nearly linear speedups; accurate with less than 1% relative error; scalable and space efficient for massive networks with billions of edges; and effective for a variety of real-world settings.
Journal ArticleDOI

Fake news outbreak 2021: Can we stop the viral spread?

TL;DR: This survey paper extensively analyse a wide range of different solutions for the early detection of fake news in the existing literature and examines Machine Learning (ML) models for the identification and classification offake news, online fake news detection competitions, statistical outputs as well as the advantages and disadvantages of some of the available data sets.
Journal ArticleDOI

A Structural Evolution-Based Anomaly Detection Method for Generalized Evolving Social Networks

TL;DR: Experimental results on real-world evolving social networks with artificial and real anomalies show that the proposed novel structural evolution-based anomaly detection method, $SeaDM$ outperforms the state-of-the-art anomaly detection methods.
Proceedings ArticleDOI

Error-Bounded Graph Anomaly Loss for GNNs

TL;DR: This work proposes a novel loss function to train GNNs for anomaly-detectable node representations that evaluates node similarity using global grouping patterns discovered from graph mining algorithms and proves that the prediction error is bounded given the proposed loss function.
References
More filters
Journal ArticleDOI

Collective dynamics of small-world networks

TL;DR: Simple models of networks that can be tuned through this middle ground: regular networks ‘rewired’ to introduce increasing amounts of disorder are explored, finding that these systems can be highly clustered, like regular lattices, yet have small characteristic path lengths, like random graphs.
Book

Matrix computations

Gene H. Golub
Journal ArticleDOI

Emergence of Scaling in Random Networks

TL;DR: A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.
Book

Time series analysis, forecasting and control

TL;DR: In this article, a complete revision of a classic, seminal, and authoritative book that has been the model for most books on the topic written since 1970 is presented, focusing on practical techniques throughout, rather than a rigorous mathematical treatment of the subject.