Reinforcement learning from demonstration through shaping

Open AccessProceedings Article

Reinforcement learning from demonstration through shaping

Tim Brys, +5 more

- pp 3352-3358

Chats0

TLDR

This paper investigates the intersection of reinforcement learning and expert demonstrations, leveraging the theoretical guarantees provided by reinforcement learning, and using expert demonstrations to speed up this learning by biasing exploration through a process called reward shaping.

Abstract:

Reinforcement learning describes how a learning agent can achieve optimal behaviour based on interactions with its environment and reward feedback. A limiting factor in reinforcement learning as employed in artificial intelligence is the need for an often prohibitively large number of environment samples before the agent reaches a desirable level of performance. Learning from demonstration is an approach that provides the agent with demonstrations by a supposed expert, from which it should derive suitable behaviour. Yet, one of the challenges of learning from demonstration is that no guarantees can be provided for the quality of the demonstrations, and thus the learned behavior. In this paper, we investigate the intersection of these two approaches, leveraging the theoretical guarantees provided by reinforcement learning, and using expert demonstrations to speed up this learning by biasing exploration through a process called reward shaping. This approach allows us to leverage human input without making an erroneous assumption regarding demonstration optimality. We show experimentally that this approach requires significantly fewer demonstrations, is more robust against suboptimality of demonstrations, and achieves much faster learning than the recently developed HAT algorithm.

Reinforcement learning from demonstration through shaping

Citations

Machine learning

Phd by thesis

Imitation Learning: A Survey of Learning Methods

Deep Q-learning from Demonstrations

Animal Intelligence: Experimental Studies

References

Reinforcement Learning: An Introduction

C4.5: Programs for Machine Learning

Machine learning

Phd by thesis

Introduction to Reinforcement Learning

Related Papers (5)

Human-level control through deep reinforcement learning

Reinforcement Learning: An Introduction

Mastering the game of Go with deep neural networks and tree search

A survey of robot learning from demonstration

Apprenticeship learning via inverse reinforcement learning