Huaixiu Zheng

Publications - 5

Citations - 719

Huaixiu Zheng is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 5, co-authored 5 publications receiving 719 citations.

Papers

PDF

Open Access

More filters

Journal Article

LaMDA: Language Models for Dialog Applications

Romal Thoppilan, +56 more

- 20 Jan 2022 -

arXiv.org

TL;DR: The authors presented LaMDA: Language Models for Dialog Applications, a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text and demonstrate that fine-tuning with annotated data and enabling the model to consult external knowledge sources can lead to significant improvements towards the two key challenges of safety and factual grounding.

...read moreread less

Journal Article

Unifying Language Learning Paradigms

Yi Tay, +8 more

arXiv.org

TL;DR: UL2 achieves SOTA performance on 50 well-established supervised NLP tasks ranging from language generation, language understanding, text classiﬁcation, question answering, commonsense reasoning, long text reasoning, structured knowledge grounding and information retrieval.

...read moreread less

Proceedings ArticleDOI

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Yun He, +11 more

TL;DR: Through extensive empirical experiments, it is demonstrated that HyperPrompt can achieve superior performances over strong T5 multi-task learning baselines and parameter-efﬁcient adapter variants including Prompt-Tuning and HyperFormer++ on Natural Language Understanding benchmarks of GLUE and SuperGLUE across many model sizes.

...read moreread less

Proceedings Article

UL2: Unifying Language Learning Paradigms

Yi Tay, +12 more

TL;DR: By scaling the model up to 20B parameters, this paper achieves SOTA performance on 50 well-established supervised NLP tasks ranging from language generation, language understanding, text classiﬁcation, question answering, commonsense reasoning, long text reasoning, structured knowledge grounding and information retrieval.

...read moreread less

Journal ArticleDOI

Transcending Scaling Laws with 0.1% Extra Compute

Yi Tay, +15 more

- 20 Oct 2022 -

arXiv.org

TL;DR: U-PaLM outperforms PaLM on many few-shot setups, i.e., English NLP tasks, reasoning tasks with chain-of-thought, multilingual tasks, MMLU and challenging BIG-Bench tasks, and is able to substantially improve the scaling properties of large language models on downstream metrics.

...read moreread less