scispace - formally typeset
Open AccessJournal ArticleDOI

Graph based anomaly detection and description: a survey

TLDR
This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs, and gives a general framework for the algorithms categorized under various settings.
Abstract
Detecting anomalies in data is a vital task, with numerous high-impact applications in areas such as security, finance, health care, and law enforcement. While numerous techniques have been developed in past years for spotting outliers and anomalies in unstructured collections of multi-dimensional points, with graph data becoming ubiquitous, techniques for structured graph data have been of focus recently. As objects in graphs have long-range correlations, a suite of novel technology has been developed for anomaly detection in graph data. This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs. As a key contribution, we give a general framework for the algorithms categorized under various settings: unsupervised versus (semi-)supervised approaches, for static versus dynamic graphs, for attributed versus plain graphs. We highlight the effectiveness, scalability, generality, and robustness aspects of the methods. What is more, we stress the importance of anomaly attribution and highlight the major techniques that facilitate digging out the root cause, or the `why', of the detected anomalies for further analysis and sense-making. Finally, we present several real-world applications of graph-based anomaly detection in diverse domains, including financial, auction, computer traffic, and social networks. We conclude our survey with a discussion on open theoretical and practical challenges in the field.

read more

Citations
More filters
Proceedings ArticleDOI

Monitor: An Abnormality Detection Approach in Buildings Energy Consumption

TL;DR: This paper presents a novel approach, called Monitor, which first identifies patterns in past consumption data and then uses these patterns to detect abnormalities, which reduces the rate of false positive alarms significantly and makes it more suitable for real-world deployments.
Journal ArticleDOI

Internet Financial Fraud Detection Based on a Distributed Big Data Approach With Node2vec

TL;DR: In this paper, an intelligent and distributed Big Data approach for Internet financial fraud detections is proposed to implement graph embedding algorithm Node2Vec to learn and represent the topological features in the financial network graph into low-dimensional dense vectors, so as to intelligently and efficiently classify and predict the data samples of the large-scale dataset with the deep neural network.
Journal ArticleDOI

Insights into Antidepressant Prescribing Using Open Health Data

TL;DR: This study examines how data science and machine learning methods can be applied to a variety of open health datasets, including GP prescribing data, disease prevalence data and economic deprivation data, and suggests that the depression prevalence hypothesis is flawed.
Proceedings ArticleDOI

Outlier Detection and Trend Detection: Two Sides of the Same Coin

TL;DR: This paper discusses the close relationship of trends and outliers, and calls to action to investigate this further, to carry over insights, ideas, and algorithms from one domain to the other.
Posted Content

FairJudge: Trustworthy User Prediction in Rating Platforms.

TL;DR: The paper proposes an iterative algorithm, FairJudge, a system to identify fraudulent users, and proposes three metrics: the fairness of a user that quantifies how trustworthy the user is in rating the products, the reliability of a rating that measures how reliable the rating is, and the goodness of a product that measures the quality of the product.
References
More filters
Journal ArticleDOI

Collective dynamics of small-world networks

TL;DR: Simple models of networks that can be tuned through this middle ground: regular networks ‘rewired’ to introduce increasing amounts of disorder are explored, finding that these systems can be highly clustered, like regular lattices, yet have small characteristic path lengths, like random graphs.
Book

Matrix computations

Gene H. Golub
Journal ArticleDOI

Emergence of Scaling in Random Networks

TL;DR: A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.
Book

Time series analysis, forecasting and control

TL;DR: In this article, a complete revision of a classic, seminal, and authoritative book that has been the model for most books on the topic written since 1970 is presented, focusing on practical techniques throughout, rather than a rigorous mathematical treatment of the subject.