Dongju Park

Publications - 4

Citations - 21

Dongju Park is an academic researcher. The author has contributed to research in topics: Language model & Tokenization (data security). The author has an hindex of 2, co-authored 4 publications receiving 12 citations.

Papers

PDF

Open Access

More filters

Posted Content

GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation

Kang Min Yoo, +4 more

- 18 Apr 2021 -

arXiv: Computation and Language

TL;DR: This paper proposed a data augmentation technique that leverages large-scale language models to generate realistic text samples from a mixture of real samples, effectively distilling knowledge from the language models and creating textual perturbations simultaneously.

...read moreread less

Posted Content

What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

Boseop Kim, +36 more

- 10 Sep 2021 -

arXiv: Computation and Language

TL;DR: HyperCLOVA as discussed by the authors is a Korean variant of 82B GPT-3 trained on a Korean-centric corpus of 560B tokens, which shows state-of-the-art zero-shot and few-shot learning performances on various downstream tasks in Korean.

...read moreread less

Proceedings Article

GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation

Kang Min Yoo, +4 more

TL;DR: The authors proposed a data augmentation technique that leverages large-scale language models to generate realistic text samples from a mixture of real samples, effectively distilling knowledge from the language models and creating textual perturbations simultaneously.

...read moreread less

Proceedings Article

What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

Boseop Kim, +36 more

TL;DR: HyperCLOVA as mentioned in this paper is a Korean variant of 82B GPT-3 trained on a Korean-centric corpus of 560B tokens, which shows state-of-the-art zero-shot and few-shot learning performances on various downstream tasks in Korean.

...read moreread less