Home
/
Authors
/
Hongyan Bao

Author

Hongyan Bao

King Abdullah University of Science and Technology

Bio: Hongyan Bao is an academic researcher from King Abdullah University of Science and Technology. The author has contributed to research in topics: Computer science & Embedding. The author has an hindex of 4, co-authored 4 publications receiving 93 citations.

Topics: Computer science, Embedding

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Co-Embedding Attributed Networks

[...]

Zaiqiao Meng¹, Shangsong Liang¹, Hongyan Bao¹, Xiangliang Zhang¹•Institutions (1)

King Abdullah University of Science and Technology¹

30 Jan 2019

TL;DR: A Co-embedding model for Attributed Networks (CAN), which learns low-dimensional representations of both attributes and nodes in the same semantic space such that the affinities between them can be effectively captured and measured, and a variational auto-encoder that embeds each node and attribute with means and variances of Gaussian distributions.

...read moreread less

Abstract: Existing embedding methods for attributed networks aim at learning low-dimensional vector representations for nodes only but not for both nodes and attributes, resulting in the fact that they cannot capture the affinities between nodes and attributes. However, capturing such affinities is of great importance to the success of many real-world attributed network applications, such as attribute inference and user profiling. Accordingly, in this paper, we introduce a Co-embedding model for Attributed Networks (CAN), which learns low-dimensional representations of both attributes and nodes in the same semantic space such that the affinities between them can be effectively captured and measured. To obtain high-quality embeddings, we propose a variational auto-encoder that embeds each node and attribute with means and variances of Gaussian distributions. Experimental results on real-world networks demonstrate that our model yields excellent performance in a number of applications compared with state-of-the-art techniques.

...read moreread less

119 citations

Posted Content•

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

[...]

Zhize Li¹, Hongyan Bao¹, Xiangliang Zhang¹, Peter Richtárik¹•Institutions (1)

King Abdullah University of Science and Technology¹

25 Aug 2020-arXiv: Learning

TL;DR: The results demonstrate that PAGE not only converges much faster than SGD in training but also achieves the higher test accuracy, validating the theoretical results and confirming the practical superiority of PAGE.

...read moreread less

Abstract: In this paper, we propose a novel stochastic gradient estimator -- ProbAbilistic Gradient Estimator (PAGE) -- for nonconvex optimization. PAGE is easy to implement as it is designed via a small adjustment to vanilla SGD: in each iteration, PAGE uses the vanilla minibatch SGD update with probability $p_t$ or reuses the previous gradient with a small adjustment, at a much lower computational cost, with probability $1-p_t$. We give a simple formula for the optimal choice of $p_t$. Moreover, we prove the first tight lower bound $\Omega(n+\frac{\sqrt{n}}{\epsilon^2})$ for nonconvex finite-sum problems, which also leads to a tight lower bound $\Omega(b+\frac{\sqrt{b}}{\epsilon^2})$ for nonconvex online problems, where $b:= \min\{\frac{\sigma^2}{\epsilon^2}, n\}$. Then, we show that PAGE obtains the optimal convergence results $O(n+\frac{\sqrt{n}}{\epsilon^2})$ (finite-sum) and $O(b+\frac{\sqrt{b}}{\epsilon^2})$ (online) matching our lower bounds for both nonconvex finite-sum and online problems. Besides, we also show that for nonconvex functions satisfying the Polyak-Łojasiewicz (PL) condition, PAGE can automatically switch to a faster linear convergence rate $O(\cdot\log \frac{1}{\epsilon})$. Finally, we conduct several deep learning experiments (e.g., LeNet, VGG, ResNet) on real datasets in PyTorch showing that PAGE not only converges much faster than SGD in training but also achieves the higher test accuracy, validating the optimal theoretical results and confirming the practical superiority of PAGE.

...read moreread less

57 citations

Proceedings Article•DOI•

Attackability Characterization of Adversarial Evasion Attack on Discrete Data

[...]

Yutong Wang¹, Yufei Han, Hongyan Bao¹, Yun Shen, Fenglong Ma², Jin Li³, Xiangliang Zhang¹ - Show less +3 more•Institutions (3)

King Abdullah University of Science and Technology¹, Pennsylvania State University², Guangzhou University³

23 Aug 2020

TL;DR: This paper proposes a computationally efficient orthogonal matching pursuit-guided attack method for evasion attack on discrete data that provides provably computational efficiency and attack performances and substantial experimental results validate the proposed attackability conditions and the effectiveness.

...read moreread less

Abstract: Evasion attack on discrete data is a challenging, while practically interesting research topic. It is intrinsically an NP-hard combinatorial optimization problem. Characterizing the conditions guaranteeing the solvability of an evasion attack task thus becomes the key to understand the adversarial threat. Our study is inspired by the weak submodularity theory. We characterize the attackability of a targeted classifier on discrete data in evasion attack by bridging the attackability measurement and the regularity of the targeted classifier. Based on our attackability analysis, we propose a computationally efficient orthogonal matching pursuit-guided attack method for evasion attack on discrete data. It provides provably computational efficiency and attack performances. Substantial experimental results on real-world datasets validate the proposed attackability conditions and the effectiveness of the proposed attack method.

...read moreread less

15 citations

Proceedings Article•

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

[...]

Zhize Li¹, Hongyan Bao¹, Xiangliang Zhang¹, Peter Richtárik¹•Institutions (1)

King Abdullah University of Science and Technology¹

18 Jul 2021

TL;DR: In this paper, a stochastic gradient estimator called ProbAbilistic Gradient Estimator (PAGE) was proposed for non-convex optimization, which can achieve a faster linear convergence rate.

...read moreread less

7 citations

Proceedings Article•

Towards Understanding the Robustness Against Evasion Attack on Categorical Data

[...]

Hongyan Bao, Yufei Han, Yujun Zhou, Yun Shen, Xiangliang Zhang - Show less +1 more

4 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

AM-GCN: Adaptive Multi-channel Graph Convolutional Networks

[...]

Xiao Wang¹, Meiqi Zhu¹, Deyu Bo¹, Peng Cui², Chuan Shi¹, Jian Pei³ - Show less +2 more•Institutions (3)

Beijing University of Posts and Telecommunications¹, Tsinghua University², Simon Fraser University³

23 Aug 2020

TL;DR: This paper proposes an adaptive multi-channel graph convolutional networks for semi-supervised classification (AM-GCN), which extracts the specific and common embeddings from node features, topological structures, and their combinations simultaneously, and uses the attention mechanism to learn adaptive importance weights of the embeddeddings.

...read moreread less

Abstract: Graph Convolutional Networks (GCNs) have gained great popularity in tackling various analytics tasks on graph and network data. However, some recent studies raise concerns about whether GCNs can optimally integrate node features and topological structures in a complex graph with rich information. In this paper, we first present an experimental investigation. Surprisingly, our experimental results clearly show that the capability of the state-of-the-art GCNs in fusing node features and topological structures is distant from optimal or even satisfactory. The weakness may severely hinder the capability of GCNs in some classification tasks, since GCNs may not be able to adaptively learn some deep correlation information between topological structures and node features. Can we remedy the weakness and design a new type of GCNs that can retain the advantages of the state-of-the-art GCNs and, at the same time, enhance the capability of fusing topological structures and node features substantially? We tackle the challenge and propose an adaptive multi-channel graph convolutional networks for semi-supervised classification (AM-GCN). The central idea is that we extract the specific and common embeddings from node features, topological structures, and their combinations simultaneously, and use the attention mechanism to learn adaptive importance weights of the embeddings. Our extensive experiments on benchmark data sets clearly show that AM-GCN extracts the most correlated information from both node features and topological structures substantially, and improves the classification accuracy with a clear margin.

...read moreread less

253 citations

Posted Content•

Unsupervised Attributed Multiplex Network Embedding

[...]

Chanyoung Park¹, Donghyun Kim², Jiawei Han¹, Hwanjo Yu³•Institutions (3)

University of Illinois at Urbana–Champaign¹, Yahoo!², Pohang University of Science and Technology³

15 Nov 2019-arXiv: Learning

TL;DR: This work presents a simple yet effective unsupervised network embedding method for attributed multiplex network called DMGI, inspired by Deep Graph Infomax (DGI) that maximizes the mutual information between local patches of a graph, and the global representation of the entire graph.

...read moreread less

Abstract: Nodes in a multiplex network are connected by multiple types of relations. However, most existing network embedding methods assume that only a single type of relation exists between nodes. Even for those that consider the multiplexity of a network, they overlook node attributes, resort to node labels for training, and fail to model the global properties of a graph. We present a simple yet effective unsupervised network embedding method for attributed multiplex network called DMGI, inspired by Deep Graph Infomax (DGI) that maximizes the mutual information between local patches of a graph, and the global representation of the entire graph. We devise a systematic way to jointly integrate the node embeddings from multiple graphs by introducing 1) the consensus regularization framework that minimizes the disagreements among the relation-type specific node embeddings, and 2) the universal discriminator that discriminates true samples regardless of the relation types. We also show that the attention mechanism infers the importance of each relation type, and thus can be useful for filtering unnecessary relation types as a preprocessing step. Extensive experiments on various downstream tasks demonstrate that DMGI outperforms the state-of-the-art methods, even though DMGI is fully unsupervised.

...read moreread less

86 citations

Posted Content•

Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers

[...]

Robin M. Schmidt¹, Frank Schneider¹, Philipp Hennig¹•Institutions (1)

University of Tübingen¹

03 Jul 2020-arXiv: Learning

TL;DR: An extensive, standardized benchmark of more than a dozen particularly popular deep learning optimizers is performed, identifying a significantly reduced subset of specific algorithms and parameter choices that generally provided competitive results in the authors' experiments.

...read moreread less

Abstract: Choosing the optimizer is considered to be among the most crucial design decisions in deep learning, and it is not an easy one. The growing literature now lists hundreds of optimization methods. In the absence of clear theoretical guidance and conclusive empirical evidence, the decision is often made based on anecdotes. In this work, we aim to replace these anecdotes, if not with a conclusive ranking, then at least with evidence-backed heuristics. To do so, we perform an extensive, standardized benchmark of fifteen particularly popular deep learning optimizers while giving a concise overview of the wide range of possible choices. Analyzing more than $50,000$ individual runs, we contribute the following three points: (i) Optimizer performance varies greatly across tasks. (ii) We observe that evaluating multiple optimizers with default parameters works approximately as well as tuning the hyperparameters of a single, fixed optimizer. (iii) While we cannot discern an optimization method clearly dominating across all tested tasks, we identify a significantly reduced subset of specific optimizers and parameter choices that generally lead to competitive results in our experiments: Adam remains a strong contender, with newer methods failing to significantly and consistently outperform it. Our open-sourced results are available as challenging and well-tuned baselines for more meaningful evaluations of novel optimization methods without requiring any further computational efforts.

...read moreread less

85 citations

Proceedings Article•DOI•

AM-GCN: Adaptive Multi-channel Graph Convolutional Networks

[...]

Xiao Wang¹, Meiqi Zhu¹, Deyu Bo¹, Peng Cui², Chuan Shi¹, Jian Pei³ - Show less +2 more•Institutions (3)

Beijing University of Posts and Telecommunications¹, Tsinghua University², Simon Fraser University³

05 Jul 2020-arXiv: Learning

TL;DR: In this article, an adaptive multi-channel graph convolutional networks for semi-supervised classification (AM-GCN) is proposed, where the specific and common embeddings from node features, topological structures, and their combinations simultaneously, and use the attention mechanism to learn adaptive importance weights of the embedding.

...read moreread less

83 citations

Proceedings Article•DOI•

HDMI: High-order Deep Multiplex Infomax

[...]

Baoyu Jing¹, Chanyoung Park², Hanghang Tong¹•Institutions (2)

University of Illinois at Urbana–Champaign¹, KAIST²

15 Feb 2021-arXiv: Learning

TL;DR: Li et al. as discussed by the authors proposed a high-order Deep Multiplex Infomax (HDMI) for learning node embedding on multiplex networks in a self-supervised way.

...read moreread less

Abstract: Networks have been widely used to represent the relations between objects such as academic networks and social networks, and learning embedding for networks has thus garnered plenty of research attention. Self-supervised network representation learning aims at extracting node embedding without external supervision. Recently, maximizing the mutual information between the local node embedding and the global summary (e.g. Deep Graph Infomax, or DGI for short) has shown promising results on many downstream tasks such as node classification. However, there are two major limitations of DGI. Firstly, DGI merely considers the extrinsic supervision signal (i.e., the mutual information between node embedding and global summary) while ignores the intrinsic signal (i.e., the mutual dependence between node embedding and node attributes). Secondly, nodes in a real-world network are usually connected by multiple edges with different relations, while DGI does not fully explore the various relations among nodes. To address the above-mentioned problems, we propose a novel framework, called High-order Deep Multiplex Infomax (HDMI), for learning node embedding on multiplex networks in a self-supervised way. To be more specific, we first design a joint supervision signal containing both extrinsic and intrinsic mutual information by high-order mutual information, and we propose a High-order Deep Infomax (HDI) to optimize the proposed supervision signal. Then we propose an attention based fusion module to combine node embedding from different layers of the multiplex network. Finally, we evaluate the proposed HDMI on various downstream tasks such as unsupervised clustering and supervised classification. The experimental results show that HDMI achieves state-of-the-art performance on these tasks.

...read moreread less

79 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40

Collapse