Deep visual foresight for planning robot motion

doi:10.1109/ICRA.2017.7989324

Open AccessProceedings ArticleDOI

Deep visual foresight for planning robot motion

- pp 2786-2793

TLDR

This work develops a method for combining deep action-conditioned video prediction models with model-predictive control that uses entirely unlabeled training data and enables a real robot to perform nonprehensile manipulation — pushing objects — and can handle novel objects not seen during training.

Abstract:

A key challenge in scaling up robot learning to many skills and environments is removing the need for human supervision, so that robots can collect their own data and improve their own performance without being limited by the cost of requesting human feedback. Model-based reinforcement learning holds the promise of enabling an agent to learn to predict the effects of its actions, which could provide flexible predictive models for a wide range of tasks and environments, without detailed human supervision. We develop a method for combining deep action-conditioned video prediction models with model-predictive control that uses entirely unlabeled training data. Our approach does not require a calibrated camera, an instrumented training set-up, nor precise sensing and actuation. Our results show that our method enables a real robot to perform nonprehensile manipulation — pushing objects — and can handle novel objects not seen during training.

Citations

PDF

Open Access

More filters

Posted Content

NIPS 2016 Tutorial: Generative Adversarial Networks

Ian Goodfellow

- 31 Dec 2016 -

arXiv: Learning

TL;DR: This report summarizes the tutorial presented by the author at NIPS 2016 on generative adversarial networks (GANs), and describes state-of-the-art image models that combine GANs with other methods.

...read moreread less

Proceedings ArticleDOI

MagNet: A Two-Pronged Defense against Adversarial Examples

Dongyu Meng, +1 more

TL;DR: MagNet, a framework for defending neural network classifiers against adversarial examples, is proposed and it is shown empirically that MagNet is effective against the most advanced state-of-the-art attacks in blackbox and graybox scenarios without sacrificing false positive rate on normal examples.

...read moreread less

Posted Content

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Sergey Levine, +3 more

- 04 May 2020 -

arXiv: Learning

TL;DR: This tutorial article aims to provide the reader with the conceptual tools needed to get started on research on offline reinforcement learning algorithms: reinforcementlearning algorithms that utilize previously collected data, without additional online data collection.

...read moreread less

Posted Content

Deep Reinforcement Learning: An Overview

Yuxi Li

- 25 Jan 2017 -

arXiv: Learning

TL;DR: This work discusses core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration, and important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn.

...read moreread less

Journal ArticleDOI

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Thanh Nguyen, +2 more

- 20 Mar 2020 -

IEEE Transactions on Systems, Man, and C...

TL;DR: A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

Book

A Mathematical Introduction to Robotic Manipulation

Richard M. Murray, +2 more

TL;DR: In this paper, the authors present a detailed overview of the history of multifingered hands and dextrous manipulation, and present a mathematical model for steerable and non-driveable hands.

...read moreread less

Proceedings Article

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Jifeng Dai, +3 more

TL;DR: R-FCN as mentioned in this paper proposes position-sensitive score maps to address the dilemma between translation-invariance in image classification and translation-variance in object detection, and achieves state-of-the-art performance on the PASCAL VOC dataset.

...read moreread less

Posted Content

Layer Normalization

Jimmy Ba, +2 more

- 21 Jul 2016 -

arXiv: Machine Learning

TL;DR: In this paper, layer normalization is applied to recurrent neural networks by computing the mean and variance used for normalization from all of the summed inputs to the neurons in a layer on a single training case.

...read moreread less

Collapse

Related Papers (5)

End-to-end training of deep visuomotor policies

Sergey Levine, +3 more

- 01 Jan 2016 -

Journal of Machine Learning Research

Deep visual foresight for planning robot motion

Citations

NIPS 2016 Tutorial: Generative Adversarial Networks

MagNet: A Two-Pronged Defense against Adversarial Examples

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

References

Human-level control through deep reinforcement learning

Auto-Encoding Variational Bayes

A Mathematical Introduction to Robotic Manipulation

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Layer Normalization

Related Papers (5)

End-to-end training of deep visuomotor policies

Human-level control through deep reinforcement learning

Adam: A Method for Stochastic Optimization

Deep Residual Learning for Image Recognition

Auto-Encoding Variational Bayes