A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios.

doi:10.18653/V1/2021.NAACL-MAIN.201

Open AccessProceedings ArticleDOI

A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios.

- pp 2545-2568

TLDR

A structured overview of methods that enable learning when training data is sparse including mechanisms to create additional labeled data like data augmentation and distant supervision as well as transfer learning settings that reduce the need for target supervision are given.

Abstract:

Deep neural networks and huge language models are becoming omnipresent in natural language applications. As they are known for requiring large amounts of training data, there is a growing body of work to improve the performance in low-resource settings. Motivated by the recent fundamental changes towards neural models and the popular pre-train and fine-tune paradigm, we survey promising approaches for low-resource natural language processing. After a discussion about the different dimensions of data availability, we give a structured overview of methods that enable learning when training data is sparse. This includes mechanisms to create additional labeled data like data augmentation and distant supervision as well as transfer learning settings that reduce the need for target supervision. A goal of our survey is to explain how these methods differ in their requirements as understanding them is essential for choosing a technique suited for a specific low-resource setting. Further key aspects of this work are to highlight open issues and to outline promising directions for future research.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Self-Training with Weak Supervision

Giannis Karamanolakis, +3 more

TL;DR: This work develops a weak supervision framework (ASTRA) that leverages all the available data for a given task and develops a rule attention network (teacher) that learns how to aggregate student pseudo-labels with weak rule labels, conditioned on their fidelity and the underlying context of an instance.

...read moreread less

Journal ArticleDOI

mGPT: Few-Shot Learners Go Multilingual

Oleh Shliazhko, +5 more

- 15 Apr 2022 -

arXiv.org

TL;DR: This paper introduces two autoregressive GPT-like models with 1.3 billion and 13 billion parameters trained on 60 languages from 25 language families using Wikipedia and Colossal Clean Crawled Corpus, and trains small versions of the model to choose the most optimal multilingual tokenization strategy.

...read moreread less

Posted Content

Neural Machine Translation for Low-Resource Languages: A Survey.

Surangika Ranathunga, +5 more

- 29 Jun 2021 -

arXiv: Computation and Language

TL;DR: A detailed survey of research advancements in low-resource language NMT (LRL-NMT), along with a quantitative analysis aimed at identifying the most popular solutions is presented in this paper.

...read moreread less

Book ChapterDOI

ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling

Alexandre Alcoforado, +6 more

- 04 Jan 2022 -

Lecture Notes in Computer Science

TL;DR: ZeroBERTo as discussed by the authors leverages an unsupervised clustering step to obtain a compressed data representation before the classification task, which has better performance for long inputs and shorter execution time.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Journal ArticleDOI

A Survey on Transfer Learning

Sinno Jialin Pan, +1 more

- 01 Oct 2010 -

IEEE Transactions on Knowledge and Data ...

TL;DR: The relationship between transfer learning and other related machine learning techniques such as domain adaptation, multitask learning and sample selection bias, as well as covariate shift are discussed.

...read moreread less

Posted Content

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, +9 more

- 26 Jul 2019 -

arXiv: Computation and Language

TL;DR: It is found that BERT was significantly undertrained, and can match or exceed the performance of every model published after it, and the best model achieves state-of-the-art results on GLUE, RACE and SQuAD.

...read moreread less

Collapse

A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios.

Citations

Self-Training with Weak Supervision

mGPT: Few-Shot Learners Go Multilingual

Neural Natural Language Generation: A Survey on Multilinguality, Multimodality, Controllability and Learning

Neural Machine Translation for Low-Resource Languages: A Survey.

ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling

References

Attention is All you Need

Generative Adversarial Nets

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

A Survey on Transfer Learning

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

Unsupervised Cross-lingual Representation Learning at Scale

Introduction to the CoNLL-2003 shared task: language-independent named entity recognition

Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog