Reinforcement Learning for Improving Agent Design.

doi:10.1162/ARTL_A_00301

Open AccessJournal ArticleDOI

Reinforcement Learning for Improving Agent Design.

David Ha

- 20 Nov 2019 -

Artificial Life

- Vol. 25, Iss: 4, pp 352-365

Chats0

TLDR

In many reinforcement learning tasks, the goal is to learn a policy to manipulate an agent, whose design is fixed, to maximize some notion of cumulative reward as mentioned in this paper, where the design of the agent's physical s...

Abstract:

In many reinforcement learning tasks, the goal is to learn a policy to manipulate an agent, whose design is fixed, to maximize some notion of cumulative reward. The design of the agent's physical s...

Citations

PDF

Open Access

More filters

Posted Content

Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

Rui Wang, +3 more

- 07 Jan 2019 -

arXiv: Neural and Evolutionary Computing

TL;DR: The Paired Open-Ended Trailblazer (POET) algorithm is introduced, which pairs the generation of environmental challenges and the optimization of agents to solve those challenges and allows these stepping-stone solutions to transfer between problems if better, catalyzing innovation.

...read moreread less

Proceedings Article

Weight Agnostic Neural Networks

Adam Gaier, +1 more

TL;DR: In this paper, the authors propose a search method for neural network architectures that can already perform a task without any explicit weight training. But how important are the weight parameters of a neural network compared to its architecture, they question to what extent neural network architecture alone, without learning any weight parameters, can encode solutions for a given task.

...read moreread less

Posted Content

AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence

Jeff Clune

- 27 May 2019 -

arXiv: Artificial Intelligence

TL;DR: It is argued that the pursuit of AI-GAs should be considered a new grand challenge of computer science research and the ML community should increase its research investment in the AI-GA approach.

...read moreread less

Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments

Rémy Portelas, +3 more

TL;DR: TeachDeepRL as mentioned in this paper considers the problem of how a teacher algorithm can enable an unknown Deep Reinforcement Learning (DRL) student to become good at a skill over a wide range of diverse environments.

...read moreread less

Journal ArticleDOI

Shape Changing Robots: Bioinspiration, Simulation, and Physical Realization.

Dylan S. Shah, +6 more

- 01 May 2021 -

Advanced Materials

TL;DR: An overview of the literature related to robots that change shape to enhance and expand their functionality is presented and related grand challenges, including shape sensing, finding, and changing, which rely on innovations in multifunctional materials, distributed actuation and sensing, and somatic control to enable next-generation shape changing robots are discussed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Posted Content

Proximal Policy Optimization Algorithms

John Schulman, +4 more

- 20 Jul 2017 -

arXiv: Learning

TL;DR: A new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent, are proposed.

...read moreread less

Journal ArticleDOI

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Ronald J. Williams

- 01 May 1992 -

Machine Learning

TL;DR: This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units that are shown to make weight adjustments in a direction that lies along the gradient of expected reinforcement in both immediate-reinforcement tasks and certain limited forms of delayed-reInforcement tasks, and they do this without explicitly computing gradient estimates.

...read moreread less

Proceedings Article

Asynchronous methods for deep reinforcement learning

Volodymyr Mnih, +7 more

TL;DR: A conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers and shows that asynchronous actor-critic succeeds on a wide variety of continuous motor control problems as well as on a new task of navigating random 3D mazes using a visual input.

...read moreread less

Proceedings ArticleDOI

MuJoCo: A physics engine for model-based control

Emanuel Todorov, +2 more

TL;DR: A new physics engine tailored to model-based control, based on the modern velocity-stepping approach which avoids the difficulties with spring-dampers, which can compute both forward and inverse dynamics.

...read moreread less

Collapse

Reinforcement Learning for Improving Agent Design.

Citations

Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions

Weight Agnostic Neural Networks

AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence

Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments

Shape Changing Robots: Bioinspiration, Simulation, and Physical Realization.

References

Long short-term memory

Proximal Policy Optimization Algorithms

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Asynchronous methods for deep reinforcement learning

MuJoCo: A physics engine for model-based control

Related Papers (5)

Proximal Policy Optimization Algorithms

MuJoCo: A physics engine for model-based control

Automatic design and manufacture of robotic lifeforms

Evolving virtual creatures

Scalable co-optimization of morphology and control in embodied machines