Institution

OpenAI

About: OpenAI is a based out in . It is known for research contribution in the topics: Reinforcement learning & Artificial neural network. The organization has 105 authors who have published 213 publications receiving 68067 citations. The organization is also known as: Open AI & OpenAI LP.

...read moreread less

Topics: Reinforcement learning, Artificial neural network, Computer science, Language model, Deep learning ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Improved Variational Inference with Inverse Autoregressive Flow

[...]

Durk P. Kingma, Tim Salimans¹, Rafal Jozefowicz², Xi Chen³, Ilya Sutskever², Max Welling⁴ - Show less +2 more•Institutions (4)

OpenAI¹, Google², University of California, Berkeley³, University of Amsterdam⁴

01 Jan 2016

TL;DR: A new type of normalizing flow, inverse autoregressive flow (IAF), is proposed that, in contrast to earlier published flows, scales well to high-dimensional latent spaces and significantly improves upon diagonal Gaussian approximate posteriors.

...read moreread less

Abstract: The framework of normalizing flows provides a general strategy for flexible variational inference of posteriors over latent variables. We propose a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces. The proposed flow consists of a chain of invertible transformations, where each transformation is based on an autoregressive neural network. In experiments, we show that IAF significantly improves upon diagonal Gaussian approximate posteriors. In addition, we demonstrate that a novel type of variational autoencoder, coupled with IAF, is competitive with neural autoregressive models in terms of attained log-likelihood on natural images, while allowing significantly faster synthesis.

...read moreread less

901 citations

Proceedings Article•

Generative Pretraining From Pixels

[...]

Mark Chen¹, Alec Radford¹, Rewon Child¹, Jeffrey Wu¹, Heewoo Jun¹, David Luan¹, Ilya Sutskever¹ - Show less +3 more•Institutions (1)

OpenAI¹

12 Jul 2020

TL;DR: This work trains a sequence Transformer to auto-regressively predict pixels, without incorporating knowledge of the 2D input structure, and finds that a GPT-2 scale model learns strong image representations as measured by linear probing, fine-tuning, and low-data classification.

...read moreread less

Abstract: Inspired by progress in unsupervised representation learning for natural language, we examine whether similar models can learn useful representations for images. We train a sequence Transformer to auto-regressively predict pixels, without incorporating knowledge of the 2D input structure. Despite training on low-resolution ImageNet without labels, we find that a GPT-2 scale model learns strong image representations as measured by linear probing, fine-tuning, and low-data classification. On CIFAR-10, we achieve 96.3% accuracy with a linear probe, outperforming a supervised Wide ResNet, and 99.0% accuracy with full finetuning, matching the top supervised pre-trained models. An even larger model trained on a mixture of ImageNet and web images is competitive with self-supervised benchmarks on ImageNet, achieving 72.0% top-1 accuracy on a linear probe of our features.

...read moreread less

849 citations

Posted Content•

Practical Black-Box Attacks against Machine Learning

[...]

Nicolas Papernot¹, Patrick McDaniel¹, Ian Goodfellow², Somesh Jha³, Z. Berkay Celik¹, Ananthram Swami⁴ - Show less +2 more•Institutions (4)

Pennsylvania State University¹, OpenAI², University of Wisconsin-Madison³, United States Army Research Laboratory⁴

08 Feb 2016-arXiv: Cryptography and Security

TL;DR: In this article, a black-box attack strategy consists in training a local model to substitute for the target DNN, using inputs synthetically generated by an adversary and labeled by the targeted DNN.

...read moreread less

Abstract: Machine learning (ML) models, e.g., deep neural networks (DNNs), are vulnerable to adversarial examples: malicious inputs modified to yield erroneous model outputs, while appearing unmodified to human observers. Potential attacks include having malicious content like malware identified as legitimate or controlling vehicle behavior. Yet, all existing adversarial example attacks require knowledge of either the model internals or its training data. We introduce the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge. Indeed, the only capability of our black-box adversary is to observe labels given by the DNN to chosen inputs. Our attack strategy consists in training a local model to substitute for the target DNN, using inputs synthetically generated by an adversary and labeled by the target DNN. We use the local substitute to craft adversarial examples, and find that they are misclassified by the targeted DNN. To perform a real-world and properly-blinded evaluation, we attack a DNN hosted by MetaMind, an online deep learning API. We find that their DNN misclassifies 84.24% of the adversarial examples crafted with our substitute. We demonstrate the general applicability of our strategy to many ML techniques by conducting the same attack against models hosted by Amazon and Google, using logistic regression substitutes. They yield adversarial examples misclassified by Amazon and Google at rates of 96.19% and 88.94%. We also find that this black-box attack strategy is capable of evading defense strategies previously found to make adversarial example crafting harder.

...read moreread less

824 citations

Proceedings Article•

Weight normalization: a simple reparameterization to accelerate training of deep neural networks

[...]

Tim Salimans¹, Diederik P. Kingma¹•Institutions (1)

OpenAI¹

05 Dec 2016

TL;DR: A reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction is presented, improving the conditioning of the optimization problem and speeding up convergence of stochastic gradient descent.

...read moreread less

Abstract: We present weight normalization: a reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction. By reparameterizing the weights in this way we improve the conditioning of the optimization problem and we speed up convergence of stochastic gradient descent. Our reparameterization is inspired by batch normalization but does not introduce any dependencies between the examples in a minibatch. This means that our method can also be applied successfully to recurrent models such as LSTMs and to noise-sensitive applications such as deep reinforcement learning or generative models, for which batch normalization is less well suited. Although our method is much simpler, it still provides much of the speed-up of full batch normalization. In addition, the computational overhead of our method is lower, permitting more optimization steps to be taken in the same amount of time. We demonstrate the usefulness of our method on applications in supervised image recognition, generative modelling, and deep reinforcement learning.

...read moreread less

787 citations

Proceedings Article•

Improving Variational Inference with Inverse Autoregressive Flow

[...]

Diederik P. Kingma¹, Tim Salimans², Rafal Jozefowicz³, Xi Chen⁴, Ilya Sutskever³, Max Welling⁵ - Show less +2 more•Institutions (5)

University of Amsterdam¹, OpenAI², Google³, University of California, Berkeley⁴, Canadian Institute for Advanced Research⁵

15 Jun 2016

TL;DR: This article proposed a data transformation called inverse autoregressive flows (IAF) to transform a simple distribution over the latent variables into a much more flexible distribution, while still allowing us to compute the resulting variables' probability density function.

...read moreread less

Abstract: We propose a simple and scalable method for improving the flexibility of variational inference through a transformation with autoregressive neural networks. Autoregressive neural networks, such as RNNs or the PixelCNN, are very powerful models and potentially interesting for use as variational posterior approximation. However, ancestral sampling in such networks is a long sequential operation, and therefore typically very slow on modern parallel hardware, such as GPUs. We show that by inverting autoregressive neural networks we can obtain equally powerful posterior models from which we can sample efficiently on modern hardware. We show that such data transformations, inverse autoregressive flows (IAF), can be used to transform a simple distribution over the latent variables into a much more flexible distribution, while still allowing us to compute the resulting variables' probability density function. The method is simple to implement, can be made arbitrarily flexible and, in contrast with previous work, is well applicable to models with high-dimensional latent spaces, such as convolutional generative models. The method is applied to a novel deep architecture of variational auto-encoders. In experiments with natural images, we demonstrate that autoregressive flow leads to significant performance gains.

...read moreread less

767 citations

Collapse

Authors

Showing all 105 results

Name	H-index	Papers	Citations
Geoffrey E. Hinton	157	414	409047
Pieter Abbeel	126	589	70911
Ian Goodfellow	85	137	135390
Ilya Sutskever	75	131	235539
Kenneth O. Stanley	60	223	16921
Phillip Isola	48	101	45099
John Schulman	48	67	30168
Jeff Clune	48	140	21194
Wojciech Zaremba	39	58	34954
Elizabeth A. Barnes	39	132	5281
Igor Mordatch	36	89	6604
Dario Amodei	34	49	13108
Joel Lehman	33	98	5588
Gillian K. Hadfield	28	101	2420
Marcin Andrychowicz	28	49	6638

Network Information

Related Institutions (5)

Facebook

10.9K papers, 570.1K citations

89% related

Google

39.8K papers, 2.1M citations

88% related

Microsoft

86.9K papers, 4.1M citations

86% related

Adobe Systems

8K papers, 214.7K citations

85% related

Carnegie Mellon University

104.3K papers, 5.9M citations

84% related

Performance

Metrics

213

Papers

121,112

Citations

No. of papers from the Institution in previous years
Year	Papers
2021	29
2020	52
2019	21
2018	51
2017	36
2016	23