Girish Sastry

Researcher at OpenAI

Publications - 17

Citations - 13037

Girish Sastry is an academic researcher from OpenAI. The author has contributed to research in topics: Language model & Usability. The author has an hindex of 8, co-authored 9 publications receiving 3765 citations. Previous affiliations of Girish Sastry include Indian Institute of Technology Kharagpur & University of Oxford.

Papers

PDF

Open Access

More filters

Proceedings Article

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

TL;DR: GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic.

...read moreread less

Posted Content

Language Models are Few-Shot Learners

Tom B. Brown, +30 more

- 28 May 2020 -

arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

Posted Content

Learning Transferable Visual Models From Natural Language Supervision

Alec Radford, +11 more

- 26 Feb 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, a pre-training task of predicting which caption goes with which image is used to learn SOTA image representations from scratch on a dataset of 400 million (image, text) pairs collected from the internet.

...read moreread less

Posted Content

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

Miles Brundage, +60 more

- 15 Apr 2020 -

arXiv: Computers and Society

TL;DR: This report suggests various steps that different stakeholders can take to improve the verifiability of claims made about AI systems and their associated development processes, with a focus on providing evidence about the safety, security, fairness, and privacy protection of AI systems.

...read moreread less

Posted Content

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

William Saunders, +3 more

- 17 Jul 2017 -

arXiv: Artificial Intelligence

TL;DR: This work formalizes human intervention for RL and shows how to reduce the human labor required by training a supervised learner to imitate the human's intervention decisions, and outlines extensions of the scheme that are necessary if the authors are to train model-free agents without a single catastrophe.

...read moreread less