Tom B. Brown

Researcher at OpenAI

Publications - 30

Citations - 16934

Tom B. Brown is an academic researcher from OpenAI. The author has contributed to research in topics: Computer science & Language model. The author has an hindex of 18, co-authored 19 publications receiving 5251 citations.

Papers

PDF

Open Access

More filters

Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

Posted Content

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020 -

arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

Posted Content

Scaling Laws for Neural Language Models

Jared Kaplan, +9 more

- 23 Jan 2020 -

arXiv: Learning

TL;DR: Larger models are significantly more sample-efficient, such that optimally compute-efficient training involves training very large models on a relatively modest amount of data and stopping significantly before convergence.

...read moreread less

Posted Content

Extracting Training Data from Large Language Models

Nicholas Carlini, +11 more

- 14 Dec 2020 -

arXiv: Cryptography and Security

TL;DR: This paper demonstrates that in such settings, an adversary can perform a training data extraction attack to recover individual training examples by querying the language model, and finds that larger models are more vulnerable than smaller models.

...read moreread less

Posted Content

Adversarial Patch

Tom B. Brown, +4 more

- 27 Dec 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A method to create universal, robust, targeted adversarial image patches in the real world, which can be printed, added to any scene, photographed, and presented to image classifiers; even when the patches are small, they cause the classifiers to ignore the other items in the scene and report a chosen target class.

...read moreread less

Collapse