Foundations and Modeling of Dynamic Networks Using Dynamic Graph Neural Networks: A Survey

doi:10.1109/ACCESS.2021.3082932

Home
/
Papers
/
Foundations and Modeling of Dynamic Networks Using Dynamic Graph Neural Networks: A Survey

Journal Article•DOI•

Foundations and Modeling of Dynamic Networks Using Dynamic Graph Neural Networks: A Survey

Joakim Skarding¹, Bogdan Gabrys¹, Katarzyna Musial¹•Institutions (1)

University of Technology, Sydney¹

25 May 2021-IEEE Access (IEEE)-Vol. 9, pp 79143-79168

TL;DR: This work establishes a foundation of dynamic networks with consistent, detailed terminology and notation and presents a comprehensive survey of dynamic graph neural network models using the proposed terminology.

read less

Abstract: Dynamic networks are used in a wide range of fields, including social network analysis, recommender systems and epidemiology. Representing complex networks as structures changing over time allow network models to leverage not only structural but also temporal patterns. However, as dynamic network literature stems from diverse fields and makes use of inconsistent terminology, it is challenging to navigate. Meanwhile, graph neural networks (GNNs) have gained a lot of attention in recent years for their ability to perform well on a range of network science tasks, such as link prediction and node classification. Despite the popularity of graph neural networks and the proven benefits of dynamic network models, there has been little focus on graph neural networks for dynamic networks. To address the challenges resulting from the fact that this research crosses diverse fields as well as to survey dynamic graph neural networks, this work is split into two main parts. First, to address the ambiguity of the dynamic network terminology we establish a foundation of dynamic networks with consistent, detailed terminology and notation. Second, we present a comprehensive survey of dynamic graph neural network models using the proposed terminology.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Survival and Event History Analysis: A Process Point of View by AALEN, O. O., BORGAN, O., and GJESSING, H. K.

[...]

Patricia Grambsch¹•Institutions (1)

University of Minnesota¹

01 Jun 2009-Biometrics

193 citations

Journal Article•DOI•

Towards multi-modal causability with Graph Neural Networks enabling information fusion for explainable AI

[...]

Andreas Holzinger¹, Bernd Malle¹, Anna Saranti¹, Bastian Pfeifer¹•Institutions (1)

University of Graz¹

01 Jul 2021-Information Fusion

TL;DR: In this article, the authors argue for using Graph Neural Networks as a method-of-choice, enabling information fusion for multi-modal causability (causability is the measurable extent to which an explanation to a human expert achieves a specified level of causal understanding).

...read moreread less

182 citations

Journal Article•DOI•

Gated Graph Recurrent Neural Networks

[...]

Luana Ruiz¹, Fernando Gama¹, Alejandro Ribeiro¹•Institutions (1)

University of Pennsylvania¹

26 Oct 2020-IEEE Transactions on Signal Processing

TL;DR: In this paper, the authors introduce Graph Recurrent Neural Networks (GRNNs) as a general learning framework that achieves this goal by leveraging the notion of a recurrent hidden state together with graph signal processing (GSP).

...read moreread less

Abstract: Graph processes exhibit a temporal structure determined by the sequence index and and a spatial structure determined by the graph support. To learn from graph processes, an information processing architecture must then be able to exploit both underlying structures. We introduce Graph Recurrent Neural Networks (GRNNs) as a general learning framework that achieves this goal by leveraging the notion of a recurrent hidden state together with graph signal processing (GSP). In the GRNN, the number of learnable parameters is independent of the length of the sequence and of the size of the graph, guaranteeing scalability. We prove that GRNNs are permutation equivariant and that they are stable to perturbations of the underlying graph support. To address the problem of vanishing gradients, we also put forward gated GRNNs with three different gating mechanisms: time, node and edge gates. In numerical experiments involving both synthetic and real datasets, time-gated GRNNs are shown to improve upon GRNNs in problems with long term dependencies, while node and edge gates help encode long range dependencies present in the graph. The numerical results also show that GRNNs outperform GNNs and RNNs, highlighting the importance of taking both the temporal and graph structures of a graph process into account.

...read moreread less

79 citations

Proceedings Article•DOI•

Discrete-time Temporal Network Embedding via Implicit Hierarchical Learning in Hyperbolic Space

[...]

Menglin Yang¹, Min Zhou², Marcus Kalander², Zengfeng Huang³, Irwin King¹ - Show less +1 more•Institutions (3)

The Chinese University of Hong Kong¹, Huawei², Fudan University³

08 Jul 2021-arXiv: Social and Information Networks

TL;DR: Zhang et al. as discussed by the authors proposed a hyperbolic temporal graph network (HTGN) to capture the inherent complex and hierarchical properties in many real-world temporal networks, leading to sub-optimal embeddings.

...read moreread less

Abstract: Representation learning over temporal networks has drawn considerable attention in recent years. Efforts are mainly focused on modeling structural dependencies and temporal evolving regularities in Euclidean space which, however, underestimates the inherent complex and hierarchical properties in many real-world temporal networks, leading to sub-optimal embeddings. To explore these properties of a complex temporal network, we propose a hyperbolic temporal graph network (HTGN) that fully takes advantage of the exponential capacity and hierarchical awareness of hyperbolic geometry. More specially, HTGN maps the temporal graph into hyperbolic space, and incorporates hyperbolic graph neural network and hyperbolic gated recurrent neural network, to capture the evolving behaviors and implicitly preserve hierarchical information simultaneously. Furthermore, in the hyperbolic space, we propose two important modules that enable HTGN to successfully model temporal networks: (1) hyperbolic temporal contextual self-attention (HTA) module to attend to historical states and (2) hyperbolic temporal consistency (HTC) module to ensure stability and generalization. Experimental results on multiple real-world datasets demonstrate the superiority of HTGN for temporal graph embedding, as it consistently outperforms competing methods by significant margins in various temporal link prediction tasks. Specifically, HTGN achieves AUC improvement up to 9.98% for link prediction and 11.4% for new link prediction. Moreover, the ablation study further validates the representational ability of hyperbolic geometry and the effectiveness of the proposed HTA and HTC modules.

...read moreread less

44 citations

Journal Article•DOI•

A Survey on Embedding Dynamic Graphs

[...]

D T BarrosClaudio, R F MendonçaMatheus, B VieiraAlex, ZivianiArtur

23 Nov 2021-ACM Computing Surveys

TL;DR: In this paper, the authors propose to embed static graphs in low-dimensional vector spaces, which plays a key role in network analytics and inference, supporting applications like node classification, link prediction, and graph visualization.

...read moreread less

Abstract: Embedding static graphs in low-dimensional vector spaces plays a key role in network analytics and inference, supporting applications like node classification, link prediction, and graph visualizat...

...read moreread less

37 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Long short-term memory

[...]

Sepp Hochreiter¹, Jürgen Schmidhuber²•Institutions (2)

Technische Universität München¹, Dalle Molle Institute for Artificial Intelligence Research²

01 Nov 1997-Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Abstract: Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O. 1. Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.

...read moreread less

72,897 citations

"Foundations and Modeling of Dynamic..." refers background in this paper

...where LSTM is a normal LSTM [87] and Vp ∈ Rn is defined as Vp = δpi where δ is the Kronecker delta....
[...]

Proceedings Article•

Attention is All you Need

[...]

Ashish Vaswani¹, Noam Shazeer¹, Niki Parmar², Jakob Uszkoreit¹, Llion Jones¹, Aidan N. Gomez¹, Lukasz Kaiser¹, Illia Polosukhin¹ - Show less +4 more•Institutions (2)

Google¹, University of Southern California²

12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Abstract: The dominant sequence transduction models are based on complex recurrent orconvolutional neural networks in an encoder and decoder configuration. The best performing such models also connect the encoder and decoder through an attentionm echanisms. We propose a novel, simple network architecture based solely onan attention mechanism, dispensing with recurrence and convolutions entirely.Experiments on two machine translation tasks show these models to be superiorin quality while being more parallelizable and requiring significantly less timeto train. Our single model with 165 million parameters, achieves 27.5 BLEU onEnglish-to-German translation, improving over the existing best ensemble result by over 1 BLEU. On English-to-French translation, we outperform the previoussingle state-of-the-art with model by 0.7 BLEU, achieving a BLEU score of 41.1.

...read moreread less

52,856 citations

Journal Article•DOI•

Generative Adversarial Nets

[...]

Ian Goodfellow¹, Jean Pouget-Abadie¹, Mehdi Mirza¹, Bing Xu¹, David Warde-Farley¹, Sherjil Ozair², Aaron Courville¹, Yoshua Bengio¹ - Show less +4 more•Institutions (2)

Université de Montréal¹, Indian Institute of Technology Delhi²

08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Abstract: We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G. The training procedure for G is to maximize the probability of D making a mistake. This framework corresponds to a minimax two-player game. In the space of arbitrary functions G and D, a unique solution exists, with G recovering the training data distribution and D equal to ½ everywhere. In the case where G and D are defined by multilayer perceptrons, the entire system can be trained with backpropagation. There is no need for any Markov chains or unrolled approximate inference networks during either training or generation of samples. Experiments demonstrate the potential of the framework through qualitative and quantitative evaluation of the generated samples.

...read moreread less

38,211 citations

"Foundations and Modeling of Dynamic..." refers background in this paper

...Available: http://arxiv.org/abs/1906.01529 [106] K. Lei, M. Qin, B. Bai, G. Zhang, and M. Yang, ‘‘GCN-GAN: A non-linear temporal link prediction model for weighted dynamic networks,’’ in Proc....
[...]
...GCN-GAN [106] and DynGraphGAN [107] are two such models....
[...]
...GCN-GAN use a stacked DGNN as a generator and a dense feed-forward networks as a discriminator [106] and DynGraphGAN use a shallow generator and a GCN [75] stacked with a CNN as a discriminator [107]....
[...]
...These include: PATCHY-SAN, DyGGNN, RgGNN, StrGNN, EvolveGCN, JODIE, GC-LSTM, GCN-GAN, DynGraphGAN and DyREP. appearing and existing links disappearing....
[...]
...Generative adversarial networks (GAN) [104] have proven to be very successful in the computer vision field [105]....
[...]

Journal Article•DOI•

Emergence of Scaling in Random Networks

[...]

Albert-László Barabási¹, Réka Albert¹•Institutions (1)

University of Notre Dame¹

15 Oct 1999-Science

TL;DR: A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.

...read moreread less

Abstract: Systems as diverse as genetic networks or the World Wide Web are best described as networks with complex topology. A common property of many large networks is that the vertex connectivities follow a scale-free power-law distribution. This feature was found to be a consequence of two generic mechanisms: (i) networks expand continuously by the addition of new vertices, and (ii) new vertices attach preferentially to sites that are already well connected. A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.

...read moreread less

33,771 citations

"Foundations and Modeling of Dynamic..." refers background in this paper

...Many models define rules for how links are established [29], [30]....
[...]
...models such as preferential attachment [29], forest fire [30] and GraphRNN [31]....
[...]

Posted Content•

Semi-Supervised Classification with Graph Convolutional Networks

[...]

Thomas Kipf¹, Max Welling¹•Institutions (1)

University of Amsterdam¹

09 Sep 2016-arXiv: Learning

TL;DR: A scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs which outperforms related methods by a significant margin.

...read moreread less

Abstract: We present a scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs. We motivate the choice of our convolutional architecture via a localized first-order approximation of spectral graph convolutions. Our model scales linearly in the number of graph edges and learns hidden layer representations that encode both local graph structure and features of nodes. In a number of experiments on citation networks and on a knowledge graph dataset we demonstrate that our approach outperforms related methods by a significant margin.

...read moreread less

15,696 citations