Institution

OpenAI

About: OpenAI is a based out in . It is known for research contribution in the topics: Reinforcement learning & Artificial neural network. The organization has 105 authors who have published 213 publications receiving 68067 citations. The organization is also known as: Open AI & OpenAI LP.

...read moreread less

Topics: Reinforcement learning, Artificial neural network, Computer science, Language model, Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Posted Content•

Improving Variational Inference with Inverse Autoregressive Flow

[...]

Diederik P. Kingma¹, Tim Salimans², Rafal Jozefowicz³, Xi Chen⁴, Ilya Sutskever³, Max Welling⁵ - Show less +2 more•Institutions (5)

University of Amsterdam¹, OpenAI², Google³, University of California, Berkeley⁴, Canadian Institute for Advanced Research⁵

15 Jun 2016-arXiv: Learning

TL;DR: This paper proposed a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces, and demonstrated that a novel type of variational autoencoder, coupled with IAF, is competitive with neural autoregression models in terms of attained log-likelihood on natural images, while allowing significantly faster synthesis.

...read moreread less

Abstract: The framework of normalizing flows provides a general strategy for flexible variational inference of posteriors over latent variables. We propose a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces. The proposed flow consists of a chain of invertible transformations, where each transformation is based on an autoregressive neural network. In experiments, we show that IAF significantly improves upon diagonal Gaussian approximate posteriors. In addition, we demonstrate that a novel type of variational autoencoder, coupled with IAF, is competitive with neural autoregressive models in terms of attained log-likelihood on natural images, while allowing significantly faster synthesis.

...read moreread less

193 citations

Posted Content•

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

[...]

Miles Brundage¹, Shahar Avin², Jasmine Wang³, Haydn Belfield², Gretchen Krueger¹, Gillian K. Hadfield⁴, Gillian K. Hadfield¹, Heidy Khlaaf, Jingying Yang, Helen Toner, Ruth Fong⁵, Tegan Maharaj, Pang Wei Koh⁶, Sara Hooker⁷, Jade Leung⁵, Andrew Trask⁵, Emma Bluemke⁵, Jonathan Lebensbold³, Cullen O'Keefe¹, Mark Koren⁶, Théo Ryffel⁸, J. B. Rubinovitz, Tamay Besiroglu², Federica Carugati⁶, Jack Clark¹, Peter Eckersley, Sarah de Haas⁷, Maritza Johnson⁷, Ben Laurie⁷, Alex Ingerman⁷, Igor Krawczuk⁹, Amanda Askell¹, Rosario Cammarota¹⁰, Andrew J. Lohn¹¹, David Krueger¹², Charlotte Stix¹³, Peter Henderson⁶, Logan Graham⁵, Carina E. A. Prunkl⁵, Bianca Martin¹, Elizabeth Seger², Noa Zilberman⁵, Seán Ó hÉigeartaigh², Frens Kroeger, Girish Sastry¹, Rebecca Kagan, Adrian Weller¹⁴, Adrian Weller², Brian Tse⁵, Elizabeth A. Barnes¹, Allan Dafoe⁵, Paul Scharre¹⁵, Ariel Herbert-Voss¹, Martijn Rasser¹⁵, Shagun Sodhani¹², Carrick Flynn, Thomas Krendl Gilbert¹⁶, Lisa Dyer, Saif Khan, Yoshua Bengio¹², Markus Anderljung⁵ - Show less +57 more•Institutions (16)

OpenAI¹, University of Cambridge², McGill University³, University of Toronto⁴, University of Oxford⁵, Stanford University⁶, Google⁷, École Normale Supérieure⁸, École Polytechnique Fédérale de Lausanne⁹, Intel¹⁰, RAND Corporation¹¹, Université de Montréal¹², Eindhoven University of Technology¹³, The Turing Institute¹⁴, Center for a New American Security¹⁵, University of California¹⁶

15 Apr 2020-arXiv: Computers and Society

TL;DR: This report suggests various steps that different stakeholders can take to improve the verifiability of claims made about AI systems and their associated development processes, with a focus on providing evidence about the safety, security, fairness, and privacy protection of AI systems.

...read moreread less

Abstract: With the recent wave of progress in artificial intelligence (AI) has come a growing awareness of the large-scale impacts of AI systems, and recognition that existing regulations and norms in industry and academia are insufficient to ensure responsible AI development. In order for AI developers to earn trust from system users, customers, civil society, governments, and other stakeholders that they are building AI responsibly, they will need to make verifiable claims to which they can be held accountable. Those outside of a given organization also need effective means of scrutinizing such claims. This report suggests various steps that different stakeholders can take to improve the verifiability of claims made about AI systems and their associated development processes, with a focus on providing evidence about the safety, security, fairness, and privacy protection of AI systems. We analyze ten mechanisms for this purpose--spanning institutions, software, and hardware--and make recommendations aimed at implementing, exploring, or improving those mechanisms.

...read moreread less

191 citations

Posted Content•

Asymmetric Actor Critic for Image-Based Robot Learning

[...]

Lerrel Pinto¹, Marcin Andrychowicz², Peter Welinder², Wojciech Zaremba², Pieter Abbeel³ - Show less +1 more•Institutions (3)

Carnegie Mellon University¹, OpenAI², University of California, Berkeley³

18 Oct 2017-arXiv: Robotics

TL;DR: This work exploits the full state observability in the simulator to train better policies which take as input only partial observations (RGBD images) and combines this method with domain randomization and shows real robot experiments for several tasks like picking, pushing, and moving a block.

...read moreread less

Abstract: Deep reinforcement learning (RL) has proven a powerful technique in many sequential decision making domains. However, Robotics poses many challenges for RL, most notably training on a physical system can be expensive and dangerous, which has sparked significant interest in learning control policies using a physics simulator. While several recent works have shown promising results in transferring policies trained in simulation to the real world, they often do not fully utilize the advantage of working with a simulator. In this work, we exploit the full state observability in the simulator to train better policies which take as input only partial observations (RGBD images). We do this by employing an actor-critic training algorithm in which the critic is trained on full states while the actor (or policy) gets rendered images as input. We show experimentally on a range of simulated tasks that using these asymmetric inputs significantly improves performance. Finally, we combine this method with domain randomization and show real robot experiments for several tasks like picking, pushing, and moving a block. We achieve this simulation to real world transfer without training on any real world data.

...read moreread less

170 citations

Proceedings Article•DOI•

Asymmetric Actor Critic for Image-Based Robot Learning

[...]

Lerrel Pinto¹, Marcin Andrychowicz², Peter Welinder², Wojciech Zaremba², Pieter Abbeel³ - Show less +1 more•Institutions (3)

Carnegie Mellon University¹, OpenAI², University of California, Berkeley³

26 Jun 2018

TL;DR: The authors exploit the full state observability in the simulator to train better policies which take as input only partial observations (RGBD images) by employing an actor-critic training algorithm in which the critic is trained on full states while the actor (or policy) gets rendered images as input.

...read moreread less

170 citations

Posted Content•

Zero-Shot Text-to-Image Generation

[...]

Aditya Ramesh¹, Mikhail Pavlov¹, Gabriel Goh¹, Scott Gray¹, Chelsea Voss¹, Alec Radford¹, Mark Chen¹, Ilya Sutskever¹ - Show less +4 more•Institutions (1)

OpenAI¹

24 Feb 2021-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposed a simple approach for text-to-image generation based on a transformer that autoregressively models the text and image tokens as a single stream of data, which is competitive with previous domain-specific models when evaluated in a zero-shot fashion.

...read moreread less

Abstract: Text-to-image generation has traditionally focused on finding better modeling assumptions for training on a fixed dataset. These assumptions might involve complex architectures, auxiliary losses, or side information such as object part labels or segmentation masks supplied during training. We describe a simple approach for this task based on a transformer that autoregressively models the text and image tokens as a single stream of data. With sufficient data and scale, our approach is competitive with previous domain-specific models when evaluated in a zero-shot fashion.

...read moreread less

162 citations

Collapse

Authors

Showing all 105 results

Name	H-index	Papers	Citations
Geoffrey E. Hinton	157	414	409047
Pieter Abbeel	126	589	70911
Ian Goodfellow	85	137	135390
Ilya Sutskever	75	131	235539
Kenneth O. Stanley	60	223	16921
Phillip Isola	48	101	45099
John Schulman	48	67	30168
Jeff Clune	48	140	21194
Wojciech Zaremba	39	58	34954
Elizabeth A. Barnes	39	132	5281
Igor Mordatch	36	89	6604
Dario Amodei	34	49	13108
Joel Lehman	33	98	5588
Gillian K. Hadfield	28	101	2420
Marcin Andrychowicz	28	49	6638