Zhilin Yang

Researcher at Carnegie Mellon University

Publications - 69

Citations - 16355

Zhilin Yang is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 31, co-authored 57 publications receiving 11112 citations. Previous affiliations of Zhilin Yang include Tsinghua University.

Papers

PDF

Open Access

More filters

Proceedings Article

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang, +5 more

TL;DR: The authors proposes XLNet, a generalized autoregressive pretraining method that enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and overcomes the limitations of BERT The authors.

...read moreread less

Posted Content

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Zhilin Yang, +5 more

- 19 Jun 2019 -

arXiv: Computation and Language

TL;DR: XLNet is proposed, a generalized autoregressive pretraining method that enables learning bidirectional contexts by maximizing the expected likelihood over all permutations of the factorization order and overcomes the limitations of BERT thanks to its autore progressive formulation.

...read moreread less

Proceedings ArticleDOI

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context.

Zihang Dai, +5 more

TL;DR: This work proposes a novel neural architecture Transformer-XL that enables learning dependency beyond a fixed length without disrupting temporal coherence, which consists of a segment-level recurrence mechanism and a novel positional encoding scheme.

...read moreread less

Proceedings Article

Revisiting semi-supervised learning with graph embeddings

Zhilin Yang, +2 more

TL;DR: In this article, a semi-supervised learning framework based on graph embeddings is proposed, where given a graph between instances, an embedding for each instance is trained to jointly predict the class label and the neighborhood context in the graph.

...read moreread less

Proceedings ArticleDOI

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering

Zhilin Yang, +6 more

TL;DR: HotpotQA as discussed by the authors is a dataset with 113k Wikipedia-based question-answer pairs with four key features: finding and reasoning over multiple supporting documents to answer; the questions are diverse and not constrained to any pre-existing knowledge bases or knowledge schemas; providing sentence-level supporting facts required for reasoning; and offering a new type of factoid comparison questions to test QA systems' ability to extract relevant facts and perform necessary comparison.

...read moreread less

Collapse