scispace - formally typeset
O

Olivier Pietquin

Researcher at Google

Publications -  248
Citations -  7478

Olivier Pietquin is an academic researcher from Google. The author has contributed to research in topics: Reinforcement learning & Markov decision process. The author has an hindex of 35, co-authored 228 publications receiving 6279 citations. Previous affiliations of Olivier Pietquin include University of Grenoble & university of lille.

Papers
More filters
Posted Content

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

TL;DR: A general and model-free approach for Reinforcement Learning on real robotics with sparse rewards built upon the Deep Deterministic Policy Gradient algorithm to use demonstrations that out-performs DDPG, and does not require engineered rewards.
Posted Content

Deep Q-learning from Demonstrations

TL;DR: Deep Q-learning from Demonstrations (DQfD) as mentioned in this paper leverages small sets of demonstration data to massively accelerate the learning process, and is able to automatically assess the necessary ratio of demonstrating data while learning thanks to a prioritized replay mechanism.
Proceedings Article

Noisy Networks For Exploration

TL;DR: It is found that replacing the conventional exploration heuristics for A3C, DQN and dueling agents with NoisyNet yields substantially higher scores for a wide range of Atari games, in some cases advancing the agent from sub to super-human performance.
Proceedings Article

Modulating early visual processing by language

TL;DR: In this article, a conditional batch normalization (CBN) is used to modulate convolutional feature maps by a linguistic embedding, leading to the MODulatEd ResNet (MRN) architecture.