Release Strategies and the Social Impacts of Language Models.

Open AccessPosted Content

Release Strategies and the Social Impacts of Language Models.

- 24 Aug 2019 -

TLDR

This report discusses OpenAI's work related to the release of its GPT-2 language model and discusses staged release, which allows time between model releases to conduct risk and benefit analyses as model sizes increased.

Citations

PDF

Open Access

More filters

Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

Posted Content

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020 -

arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

Proceedings ArticleDOI

Training language models to follow instructions with human feedback

Long Ouyang, +19 more

TL;DR: The results show that fine-tuning with human feedback is a promising direction for aligning language models with human intent and showing improvements in truthfulness and reductions in toxic output generation while having minimal performance regressions on public NLP datasets.

...read moreread less

Posted Content

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick S. H. Lewis, +11 more

- 22 May 2020 -

arXiv: Computation and Language

TL;DR: A general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation, and finds that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.

...read moreread less

Proceedings Article

Defending Against Neural Fake News

Rowan Zellers, +6 more

TL;DR: A model for controllable text generation called Grover, found that best current discriminators can classify neural fake news from real, human-written, news with 73% accuracy, assuming access to a moderate level of training data, and the best defense against Grover turns out to be Grover itself, with 92% accuracy.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Semantics derived automatically from language corpora contain human-like biases

Aylin Caliskan, +3 more

- 14 Apr 2017 -

Science

TL;DR: This article showed that applying machine learning to ordinary human language results in human-like semantic biases and replicated a spectrum of known biases, as measured by the Implicit Association Test, using a widely used, purely statistical machine-learning model trained on a standard corpus of text from the World Wide Web.

...read moreread less

Posted Content

The Curious Case of Neural Text Degeneration

Ari Holtzman, +4 more

- 22 Apr 2019 -

arXiv: Computation and Language

TL;DR: This paper showed that decoding strategies alone alone can dramatically affect the quality of machine text, even when generated from exactly the same neural language model, and they proposed Nucleus Sampling, a simple but effective method to draw the best out of neural generation.

...read moreread less

Posted Content

Semi-supervised Sequence Learning

Andrew M. Dai, +1 more

- 04 Nov 2015 -

arXiv: Learning

TL;DR: This paper used unlabeled data to improve sequence learning with recurrent networks, which can be used as a "pre-training" step for a later supervised sequence learning algorithm, so that the parameters obtained from the unsupervised step can be a starting point for other supervised training models.

...read moreread less

Journal ArticleDOI

Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science

Emily M. Bender, +1 more

- 31 Dec 2018 -

Transactions of the Association for Comp...

TL;DR: It is argued that data statements will help alleviate issues related to exclusion and bias in language technology, lead to better precision in claims about how natural language processing research can generalize and thus better engineering results, protect companies from public embarrassment, and ultimately lead to language technology that meets its users in their own preferred linguistic style.

...read moreread less

Proceedings Article

Defending Against Neural Fake News

Rowan Zellers, +6 more

TL;DR: A model for controllable text generation called Grover, found that best current discriminators can classify neural fake news from real, human-written, news with 73% accuracy, assuming access to a moderate level of training data, and the best defense against Grover turns out to be Grover itself, with 92% accuracy.

...read moreread less

Collapse

arXiv: Computation and Language

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

Release Strategies and the Social Impacts of Language Models.

Citations

Language Models are Few-Shot Learners

Language Models are Few-Shot Learners

Training language models to follow instructions with human feedback

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Defending Against Neural Fake News

References

Semantics derived automatically from language corpora contain human-like biases

The Curious Case of Neural Text Degeneration

Semi-supervised Sequence Learning

Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science

Defending Against Neural Fake News

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention is All you Need

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Language Models are Few-Shot Learners

Bleu: a Method for Automatic Evaluation of Machine Translation

Trending Questions (3)