Arvind Neelakantan

Researcher at University of Massachusetts Amherst

Publications - 41

Citations - 15055

Arvind Neelakantan is an academic researcher from University of Massachusetts Amherst. The author has contributed to research in topics: Artificial neural network & Knowledge base. The author has an hindex of 23, co-authored 37 publications receiving 5594 citations. Previous affiliations of Arvind Neelakantan include BBN Technologies & Google.

Papers

PDF

Open Access

More filters

Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

Posted Content

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020 -

arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

Proceedings ArticleDOI

Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space

Arvind Neelakantan, +3 more

TL;DR: An extension to the Skip-gram model that efficiently learns multiple embeddings per word type is presented, and its scalability is demonstrated by training with one machine on a corpus of nearly 1 billion tokens in less than 6 hours.

...read moreread less

Posted Content

Adding Gradient Noise Improves Learning for Very Deep Networks

Arvind Neelakantan, +6 more

- 21 Nov 2015 -

arXiv: Machine Learning

TL;DR: This paper explores the low-overhead and easy-to-implement optimization technique of adding annealed Gaussian noise to the gradient, which it is found surprisingly effective when training these very deep architectures.

...read moreread less

Proceedings ArticleDOI

Chains of Reasoning over Entities, Relations, and Text using Recurrent Neural Networks

Rajarshi Das, +3 more

TL;DR: The authors combine the rich multi-step inference of symbolic logical reasoning with the generalization capabilities of neural networks for complex reasoning about entities and relations in text and large-scale knowledge bases (KBs).

...read moreread less

Collapse