KLUE: Korean Language Understanding Evaluation.

Open AccessPosted Content

KLUE: Korean Language Understanding Evaluation.

- 20 May 2021 -

TLDR

The Korean Language Understanding Evaluation (KLUE) benchmark as mentioned in this paper is a collection of 8 Korean NLP tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking.

Abstract:

We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a collection of 8 Korean natural language understanding (NLU) tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking. We build all of the tasks from scratch from diverse source corpora while respecting copyrights, to ensure accessibility for anyone without any restrictions. With ethical considerations in mind, we carefully design annotation protocols. Along with the benchmark tasks and data, we provide suitable evaluation metrics and fine-tuning recipes for pretrained language models for each task. We furthermore release the pretrained language models (PLM), KLUE-BERT and KLUE-RoBERTa, to help reproducing baseline models on KLUE and thereby facilitate future research. We make a few interesting observations from the preliminary experiments using the proposed KLUE benchmark suite, already demonstrating the usefulness of this new benchmark suite. First, we find KLUE-RoBERTa-large outperforms other baselines, including multilingual PLMs and existing open-source Korean PLMs. Second, we see minimal degradation in performance even when we replace personally identifiable information from the pretraining corpus, suggesting that privacy and NLU capability are not at odds with each other. Lastly, we find that using BPE tokenization in combination with morpheme-level pre-tokenization is effective in tasks involving morpheme-level tagging, detection and generation. In addition to accelerating Korean NLP research, our comprehensive documentation on creating KLUE will facilitate creating similar resources for other languages in the future. KLUE is available at this https URL.

KLUE: Korean Language Understanding Evaluation.

Citations

What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

Enhancing Korean Named Entity Recognition With Linguistic Tokenization Strategies

AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing

Language Models are Few-shot Multilingual Learners

Language Models are Few-shot Multilingual Learners

References

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Language Models are Few-Shot Learners

ROUGE: A Package for Automatic Evaluation of Summaries

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

KLEJ: Comprehensive Benchmark for Polish Language Understanding

PhoBERT: Pre-trained language models for Vietnamese

CamemBERT: a Tasty French Language Model

mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer