Home
/
Authors
/
Zhengyan Zhang

Author

Zhengyan Zhang

Bio: Zhengyan Zhang is an academic researcher from Tsinghua University. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 12, co-authored 35 publications receiving 2413 citations.

Topics: Computer science, Language model, Backdoor, Deep learning, Vertex (geometry) ...read more

Papers

PDF

Open Access

More filters

Posted Content•

Graph Neural Networks: A Review of Methods and Applications

[...]

Jie Zhou¹, Ganqu Cui¹, Shengding Hu¹, Zhengyan Zhang¹, Cheng Yang², Zhiyuan Liu¹, Lifeng Wang³, Changcheng Li³, Maosong Sun¹ - Show less +5 more•Institutions (3)

Tsinghua University¹, Beijing University of Posts and Telecommunications², Tencent³

20 Dec 2018-arXiv: Learning

TL;DR: A detailed review over existing graph neural network models is provided, systematically categorize the applications, and four open problems for future research are proposed.

...read moreread less

Abstract: Lots of learning tasks require dealing with graph data which contains rich relation information among elements. Modeling physics systems, learning molecular fingerprints, predicting protein interface, and classifying diseases demand a model to learn from graph inputs. In other domains such as learning from non-structural data like texts and images, reasoning on extracted structures (like the dependency trees of sentences and the scene graphs of images) is an important research topic which also needs graph reasoning models. Graph neural networks (GNNs) are neural models that capture the dependence of graphs via message passing between the nodes of graphs. In recent years, variants of GNNs such as graph convolutional network (GCN), graph attention network (GAT), graph recurrent network (GRN) have demonstrated ground-breaking performances on many deep learning tasks. In this survey, we propose a general design pipeline for GNN models and discuss the variants of each component, systematically categorize the applications, and propose four open problems for future research.

...read moreread less

2,494 citations

Journal Article•DOI•

Graph Neural Networks: A Review of Methods and Applications

[...]

Jie Zhou¹, Ganqu Cui¹, Shengding Hu¹, Zhengyan Zhang¹, Cheng Yang², Zhiyuan Liu¹, Lifeng Wang³, Changcheng Li³, Maosong Sun¹ - Show less +5 more•Institutions (3)

Tsinghua University¹, Beijing University of Posts and Telecommunications², Tencent³

01 Jan 2020

TL;DR: In this paper, the authors propose a general design pipeline for GNN models and discuss the variants of each component, systematically categorize the applications, and propose four open problems for future research.

...read moreread less

1,266 citations

Proceedings Article•DOI•

ERNIE: Enhanced Language Representation with Informative Entities

[...]

Zhengyan Zhang¹, Xu Han¹, Zhiyuan Liu¹, Xin Jiang², Maosong Sun¹, Qun Liu² - Show less +2 more•Institutions (2)

Tsinghua University¹, Huawei²

17 May 2019

TL;DR: This paper utilizes both large-scale textual corpora and KGs to train an enhanced language representation model (ERNIE) which can take full advantage of lexical, syntactic, and knowledge information simultaneously, and is comparable with the state-of-the-art model BERT on other common NLP tasks.

...read moreread less

Abstract: Neural language representation models such as BERT pre-trained on large-scale corpora can well capture rich semantic patterns from plain text, and be fine-tuned to consistently improve the performance of various NLP tasks. However, the existing pre-trained language models rarely consider incorporating knowledge graphs (KGs), which can provide rich structured knowledge facts for better language understanding. We argue that informative entities in KGs can enhance language representation with external knowledge. In this paper, we utilize both large-scale textual corpora and KGs to train an enhanced language representation model (ERNIE), which can take full advantage of lexical, syntactic, and knowledge information simultaneously. The experimental results have demonstrated that ERNIE achieves significant improvements on various knowledge-driven tasks, and meanwhile is comparable with the state-of-the-art model BERT on other common NLP tasks. The code and datasets will be available in the future.

...read moreread less

1,076 citations

Posted Content•

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation

[...]

Xiaozhi Wang¹, Tianyu Gao¹, Zhaocheng Zhu¹, Zhengyan Zhang¹, Zhiyuan Liu², Juanzi Li³, Jian Tang⁴ - Show less +3 more•Institutions (4)

Tsinghua University¹, Princeton University², Université de Montréal³, HEC Montréal⁴

13 Nov 2019-arXiv: Computation and Language

TL;DR: A unified model for Knowledge Embedding and Pre-trained LanguagERepresentation (KEPLER), which can not only better integrate factual knowledge into PLMs but also produce effective text-enhanced KE with the strong PLMs is proposed.

...read moreread less

Abstract: Pre-trained language representation models (PLMs) cannot well capture factual knowledge from text. In contrast, knowledge embedding (KE) methods can effectively represent the relational facts in knowledge graphs (KGs) with informative entity embeddings, but conventional KE models cannot take full advantage of the abundant textual information. In this paper, we propose a unified model for Knowledge Embedding and Pre-trained LanguagE Representation (KEPLER), which can not only better integrate factual knowledge into PLMs but also produce effective text-enhanced KE with the strong PLMs. In KEPLER, we encode textual entity descriptions with a PLM as their embeddings, and then jointly optimize the KE and language modeling objectives. Experimental results show that KEPLER achieves state-of-the-art performances on various NLP tasks, and also works remarkably well as an inductive KE model on KG link prediction. Furthermore, for pre-training and evaluating KEPLER, we construct Wikidata5M, a large-scale KG dataset with aligned entity descriptions, and benchmark state-of-the-art KE methods on it. It shall serve as a new KE benchmark and facilitate the research on large KG, inductive KE, and KG with text. The source code can be obtained from this https URL.

...read moreread less

269 citations

Journal Article•DOI•

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation

[...]

Xiaozhi Wang¹, Tianyu Gao¹, Zhaocheng Zhu, Zhengyan Zhang¹, Zhiyuan Liu¹, Juanzi Li¹, Jian Tang - Show less +3 more•Institutions (1)

Tsinghua University¹

11 Mar 2021-Transactions of the Association for Computational Linguistics

TL;DR: The authors proposed a unified model for knowledge embedding and pre-trained LanguagE representation (KEPLER), which can not only better integrate factual knowledge into PLMs but also produce effective text-enhanced KE with the strong PLMs.

...read moreread less

179 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Deep Learning for Generic Object Detection: A Survey

[...]

Li Liu¹, Li Liu², Wanli Ouyang³, Xiaogang Wang⁴, Paul Fieguth⁵, Jie Chen¹, Xinwang Liu², Matti Pietikäinen¹ - Show less +4 more•Institutions (5)

University of Oulu¹, National University of Defense Technology², University of Sydney³, The Chinese University of Hong Kong⁴, University of Waterloo⁵

01 Feb 2020-International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

Abstract: Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this period of rapid evolution, the goal of this paper is to provide a comprehensive survey of the recent achievements in this field brought about by deep learning techniques. More than 300 research contributions are included in this survey, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics. We finish the survey by identifying promising directions for future research.

...read moreread less

1,897 citations

Posted Content•

Open Graph Benchmark: Datasets for Machine Learning on Graphs

[...]

Weihua Hu¹, Matthias Fey², Marinka Zitnik³, Yuxiao Dong¹, Hongyu Ren¹, Bowen Liu¹, Michele Catasta¹, Jure Leskovec¹ - Show less +4 more•Institutions (3)

Stanford University¹, Technical University of Dortmund², Harvard University³

02 May 2020-arXiv: Learning

TL;DR: The OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs, indicating fruitful opportunities for future research.

...read moreread less

Abstract: We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research. OGB datasets are large-scale, encompass multiple important graph ML tasks, and cover a diverse range of domains, ranging from social and information networks to biological networks, molecular graphs, source code ASTs, and knowledge graphs. For each dataset, we provide a unified evaluation protocol using meaningful application-specific data splits and evaluation metrics. In addition to building the datasets, we also perform extensive benchmark experiments for each dataset. Our experiments suggest that OGB datasets present significant challenges of scalability to large-scale graphs and out-of-distribution generalization under realistic data splits, indicating fruitful opportunities for future research. Finally, OGB provides an automated end-to-end graph ML pipeline that simplifies and standardizes the process of graph data loading, experimental setup, and model evaluation. OGB will be regularly updated and welcomes inputs from the community. OGB datasets as well as data loaders, evaluation scripts, baseline code, and leaderboards are publicly available at this https URL .

...read moreread less

1,097 citations

Journal Article•DOI•

A Survey on Knowledge Graphs: Representation, Acquisition and Applications

[...]

Shaoxiong Ji¹, Shirui Pan², Erik Cambria³, Pekka Marttinen¹, Philip S. Yu⁴ - Show less +1 more•Institutions (4)

Aalto University¹, Monash University, Clayton campus², Nanyang Technological University³, University of Illinois at Chicago⁴

26 Apr 2021-IEEE Transactions on Neural Networks

TL;DR: A comprehensive review of the knowledge graph covering overall research topics about: 1) knowledge graph representation learning; 2) knowledge acquisition and completion; 3) temporal knowledge graph; and 4) knowledge-aware applications and summarize recent breakthroughs and perspective directions to facilitate future research.

...read moreread less

Abstract: Human knowledge provides a formal understanding of the world. Knowledge graphs that represent structural relations between entities have become an increasingly popular research direction toward cognition and human-level intelligence. In this survey, we provide a comprehensive review of the knowledge graph covering overall research topics about: 1) knowledge graph representation learning; 2) knowledge acquisition and completion; 3) temporal knowledge graph; and 4) knowledge-aware applications and summarize recent breakthroughs and perspective directions to facilitate future research. We propose a full-view categorization and new taxonomies on these topics. Knowledge graph embedding is organized from four aspects of representation space, scoring function, encoding models, and auxiliary information. For knowledge acquisition, especially knowledge graph completion, embedding methods, path inference, and logical rule reasoning are reviewed. We further explore several emerging topics, including metarelational learning, commonsense reasoning, and temporal knowledge graphs. To facilitate future research on knowledge graphs, we also provide a curated collection of data sets and open-source libraries on different tasks. In the end, we have a thorough outlook on several promising research directions.

...read moreread less

1,025 citations

Journal Article•DOI•

SpanBERT: Improving Pre-training by Representing and Predicting Spans

[...]

Mandar Joshi¹, Danqi Chen², Yinhan Liu³, Daniel S. Weld¹, Luke Zettlemoyer¹, Omer Levy³ - Show less +2 more•Institutions (3)

University of Washington¹, Princeton University², Facebook³

12 Mar 2020-Transactions of the Association for Computational Linguistics

TL;DR: The approach extends BERT by masking contiguous random spans, rather than random tokens, and training the span boundary representations to predict the entire content of the masked span, without relying on the individual token representations within it.

...read moreread less

Abstract: We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text. Our approach extends BERT by (1) masking contiguous random spans, rather than random token...

...read moreread less

1,018 citations

Journal Article•DOI•

A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

[...]

Moloud Abdar¹, Farhad Pourpanah², Sadiq Hussain³, Dana Rezazadegan⁴, Li Liu⁵, Mohammad Ghavamzadeh⁶, Paul Fieguth⁷, Xiaochun Cao⁸, Abbas Khosravi¹, U. Rajendra Acharya⁹, U. Rajendra Acharya¹⁰, U. Rajendra Acharya¹¹, Vladimir Makarenkov¹², Saeid Nahavandi¹ - Show less +10 more•Institutions (12)

Deakin University¹, Shenzhen University², Dibrugarh University³, Swinburne University of Technology⁴, University of Oulu⁵, Google⁶, University of Waterloo⁷, Chinese Academy of Sciences⁸, Ngee Ann Polytechnic⁹, Asia University (Taiwan)¹⁰, National University of Singapore¹¹, Université du Québec¹²

12 Nov 2020-arXiv: Learning

TL;DR: This study reviews recent advances in UQ methods used in deep learning and investigates the application of these methods in reinforcement learning (RL), and outlines a few important applications of UZ methods.

...read moreread less

Abstract: Uncertainty quantification (UQ) plays a pivotal role in reduction of uncertainties during both optimization and decision making processes. It can be applied to solve a variety of real-world applications in science and engineering. Bayesian approximation and ensemble learning techniques are two most widely-used UQ methods in the literature. In this regard, researchers have proposed different UQ methods and examined their performance in a variety of applications such as computer vision (e.g., self-driving cars and object detection), image processing (e.g., image restoration), medical image analysis (e.g., medical image classification and segmentation), natural language processing (e.g., text classification, social media texts and recidivism risk-scoring), bioinformatics, etc. This study reviews recent advances in UQ methods used in deep learning. Moreover, we also investigate the application of these methods in reinforcement learning (RL). Then, we outline a few important applications of UQ methods. Finally, we briefly highlight the fundamental research challenges faced by UQ methods and discuss the future research directions in this field.

...read moreread less

809 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse