scispace - formally typeset
D

Denis Yarats

Researcher at New York University

Publications -  34
Citations -  4467

Denis Yarats is an academic researcher from New York University. The author has contributed to research in topics: Reinforcement learning & Computer science. The author has an hindex of 15, co-authored 30 publications receiving 3124 citations. Previous affiliations of Denis Yarats include Facebook & Courant Institute of Mathematical Sciences.

Papers
More filters
Proceedings Article

Convolutional Sequence to Sequence Learning

TL;DR: The authors introduced an architecture based entirely on convolutional neural networks, where computations over all elements can be fully parallelized during training and optimization is easier since the number of nonlinearities is fixed and independent of the input length.
Posted Content

Convolutional Sequence to Sequence Learning

TL;DR: The authors introduced an architecture based entirely on convolutional neural networks, where computations over all elements can be fully parallelized during training and optimization is easier since the number of nonlinearities is fixed and independent of the input length.
Posted Content

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

TL;DR: The addition of the augmentation method dramatically improves SAC's performance, enabling it to reach state-of-the-art performance on the DeepMind control suite, surpassing model-based methods and recently proposed contrastive learning (CURL).
Proceedings ArticleDOI

Deal or No Deal? End-to-End Learning of Negotiation Dialogues

TL;DR: For the first time, it is shown it is possible to train end-to-end models for negotiation, which must learn both linguistic and reasoning skills with no annotated dialogue states, and this technique dramatically improves performance.
Posted Content

Improving Sample Efficiency in Model-Free Reinforcement Learning from Images

TL;DR: A simple approach capable of matching state-of-the-art model-free and model-based algorithms on MuJoCo control tasks and demonstrating robustness to observational noise, surpassing existing approaches in this setting.