Framing reinforcement learning from human reward

doi:10.1016/J.ARTINT.2015.03.009

Open AccessJournal ArticleDOI

Framing reinforcement learning from human reward

W. Bradley Knox, +1 more

- 01 Aug 2015 -

Artificial Intelligence

- Vol. 225, pp 24-50

TLDR

The primary learning algorithm introduced in this article, which is called "vi-tamer", is the first algorithm to successfully learn non-myopically from reward generated by a human trainer and empirically shows that such non- myopic valuation facilitates higher-level understanding of the task.

About:

This article is published in Artificial Intelligence.The article was published on 2015-08-01 and is currently open access. It has received 61 citations till now. The article focuses on the topics: Reward-based selection & Reinforcement learning.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Review of User Interface Design for Interactive Machine Learning

John Dudley, +1 more

TL;DR: A structural and behavioural model of a generalised IML system is proposed and a solution principles for building effective interfaces for IML are identified, identified strands of user interface research key to unlocking more efficient and productive non-expert interactive machine learning applications.

...read moreread less

Posted Content

Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces

Garrett Warnell, +3 more

- 28 Sep 2017 -

arXiv: Artificial Intelligence

TL;DR: Deep TAMER is proposed, an extension of the TAMER framework that leverages the representational power of deep neural networks in order to learn complex tasks in just a short amount of time with a human trainer and demonstrated by using it and just 15 minutes of human-provided feedback to train an agent that performs better than humans on the Atari game of Bowling.

...read moreread less

Journal ArticleDOI

Human-Centered Reinforcement Learning: A Survey

Guangliang Li, +3 more

- 07 May 2019 -

IEEE Transactions on Human-Machine Syste...

TL;DR: The state-of-the-art human-centered RL algorithms are described and become a starting point for researchers who are initiating their endeavors in human- centered RL and references to the most interesting and successful works are provided.

...read moreread less

Journal ArticleDOI

Social is special: A normative framework for teaching with and learning from evaluative feedback

Mark K. Ho, +3 more

- 01 Oct 2017 -

Cognition

TL;DR: It is suggested that human learning from evaluative feedback depends on inferences about communicative intent, goals and other mental states-much like learning from other sources, such as demonstration, observation and instruction.

...read moreread less

Journal ArticleDOI

A Review on Interactive Reinforcement Learning From Human Social Feedback

Jinying Lin, +5 more

- 02 Jul 2020 -

IEEE Access

TL;DR: Methods for interactive reinforcement learning agent to learn from human social feedback and the ways of delivering feedback are reviewed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Journal ArticleDOI

An introduction to variable and feature selection

Isabelle Guyon, +1 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: The contributions of this special issue cover a wide range of aspects of variable selection: providing a better definition of the objective function, feature construction, feature ranking, multivariate feature selection, efficient search methods, and feature validity assessment methods.

...read moreread less

Book

Introduction to Reinforcement Learning

Richard S. Sutton, +1 more

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.

...read moreread less

Journal ArticleDOI

A survey of robot learning from demonstration

Brenna D. Argall, +3 more

- 01 May 2009 -

Robotics and Autonomous Systems

TL;DR: A comprehensive survey of robot Learning from Demonstration (LfD), a technique that develops policies from example state to action mappings, which analyzes and categorizes the multiple ways in which examples are gathered, as well as the various techniques for policy derivation.

...read moreread less

Journal ArticleDOI

A survey of cross-validation procedures for model selection

Sylvain Arlot, +1 more

- 01 Jan 2010 -

Statistics Surveys

TL;DR: This survey intends to relate the model selection performances of cross-validation procedures to the most recent advances of model selection theory, with a particular emphasis on distinguishing empirical statements from rigorous theoretical results.

...read moreread less