Showing papers by "Rob Fergus published in 2020"

PDF

Open Access

Posted Content•

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

[...]

Ilya Kostrikov¹, Denis Yarats¹, Rob Fergus¹•Institutions (1)

28 Apr 2020-arXiv: Learning

TL;DR: The addition of the augmentation method dramatically improves SAC's performance, enabling it to reach state-of-the-art performance on the DeepMind control suite, surpassing model-based methods and recently proposed contrastive learning (CURL).

...read moreread less

Abstract: We propose a simple data augmentation technique that can be applied to standard model-free reinforcement learning algorithms, enabling robust learning directly from pixels without the need for auxiliary losses or pre-training. The approach leverages input perturbations commonly used in computer vision tasks to regularize the value function. Existing model-free approaches, such as Soft Actor-Critic (SAC), are not able to train deep networks effectively from image pixels. However, the addition of our augmentation method dramatically improves SAC's performance, enabling it to reach state-of-the-art performance on the DeepMind control suite, surpassing model-based (Dreamer, PlaNet, and SLAC) methods and recently proposed contrastive learning (CURL). Our approach can be combined with any model-free reinforcement learning algorithm, requiring only minor modifications. An implementation can be found at this https URL.

...read moreread less

395 citations

Posted Content•

Automatic Data Augmentation for Generalization in Deep Reinforcement Learning.

[...]

Roberta Raileanu, Max Goldstein, Denis Yarats, Ilya Kostrikov, Rob Fergus - Show less +1 more

23 Jun 2020-arXiv: Learning

TL;DR: This paper compares three approaches for automatically finding an appropriate augmentation and shows that their agent outperforms other baselines specifically designed to improve generalization in RL and learns policies and representations that are more robust to changes in the environment that do not affect the agent.

...read moreread less

Abstract: Deep reinforcement learning (RL) agents often fail to generalize to unseen scenarios, even when they are trained on many instances of semantically similar environments Data augmentation has recently been shown to improve the sample efficiency and generalization of RL agents However, different tasks tend to benefit from different kinds of data augmentation In this paper, we compare three approaches for automatically finding an appropriate augmentation These are combined with two novel regularization terms for the policy and value function, required to make the use of data augmentation theoretically sound for certain actor-critic algorithms We evaluate our methods on the Procgen benchmark which consists of 16 procedurally-generated environments and show that it improves test performance by ~40% relative to standard RL algorithms Our agent outperforms other baselines specifically designed to improve generalization in RL In addition, we show that our agent learns policies and representations that are more robust to changes in the environment that do not affect the agent, such as the background Our implementation is available at this https URL

...read moreread less

76 citations

Posted Content•

Energy-based models for atomic-resolution protein conformations

[...]

Yilun Du¹, Joshua Meier², Jerry Ma², Rob Fergus², Alexander Rives³ - Show less +1 more•Institutions (3)

Massachusetts Institute of Technology¹, Facebook², New York University³

27 Apr 2020-arXiv: Learning

TL;DR: An investigation of the model’s outputs and hidden representations find that it captures physicochemical properties relevant to protein energy.

...read moreread less

Abstract: We propose an energy-based model (EBM) of protein conformations that operates at atomic scale. The model is trained solely on crystallized protein data. By contrast, existing approaches for scoring conformations use energy functions that incorporate knowledge of physical principles and features that are the complex product of several decades of research and tuning. To evaluate the model, we benchmark on the rotamer recovery task, the problem of predicting the conformation of a side chain from its context within a protein structure, which has been used to evaluate energy functions for protein design. The model achieves performance close to that of the Rosetta energy function, a state-of-the-art method widely used in protein structure prediction and design. An investigation of the model's outputs and hidden representations finds that it captures physicochemical properties relevant to protein energy.

...read moreread less

30 citations

Proceedings Article•

Energy-based models for atomic-resolution protein conformations

[...]

Yilun Du¹, Joshua Meier², Jerry Ma², Rob Fergus³, Alexander Rives³ - Show less +1 more•Institutions (3)

Massachusetts Institute of Technology¹, Facebook², New York University³

30 Apr 2020

TL;DR: In this paper, an energy-based model (EBM) of protein conformations that operates at atomic scale is proposed. But the model is trained solely on crystallized protein data.

...read moreread less

Abstract: We propose an energy-based model (EBM) of protein conformations that operates at atomic scale. The model is trained solely on crystallized protein data. By contrast, existing approaches for scoring conformations use energy functions that incorporate knowledge of physical principles and features that are the complex product of several decades of research and tuning. To evaluate our model, we benchmark on the rotamer recovery task, a restricted problem setting used to evaluate energy functions for protein design. Our model achieves comparable performance to the Rosetta energy function, a state-of-the-art method widely used in protein structure prediction and design. An investigation of the model’s outputs and hidden representations find that it captures physicochemical properties relevant to protein energy.

...read moreread less

17 citations

Posted Content•

Fast Adaptation via Policy-Dynamics Value Functions.

[...]

Roberta Raileanu, Max Goldstein, Arthur Szlam, Rob Fergus

06 Jul 2020-arXiv: Learning

TL;DR: Policy-Dynamics Value Functions (PD-VF) is introduced, a novel approach for rapidly adapting to dynamics different from those previously seen in training that can rapidly adapt to new dynamics on a set of MuJoCo domains.

...read moreread less

Abstract: Standard RL algorithms assume fixed environment dynamics and require a significant amount of interaction to adapt to new environments. We introduce Policy-Dynamics Value Functions (PD-VF), a novel approach for rapidly adapting to dynamics different from those previously seen in training. PD-VF explicitly estimates the cumulative reward in a space of policies and environments. An ensemble of conventional RL policies is used to gather experience on training environments, from which embeddings of both policies and environments can be learned. Then, a value function conditioned on both embeddings is trained. At test time, a few actions are sufficient to infer the environment embedding, enabling a policy to be selected by maximizing the learned value function (which requires no additional environment interaction). We show that our method can rapidly adapt to new dynamics on a set of MuJoCo domains. Code available at this https URL.

...read moreread less

10 citations