Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning

doi:10.1109/ICRA.2017.7989037

Open AccessProceedings ArticleDOI

Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning

- pp 285-292

TLDR

In this article, a value network is proposed to estimate the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors, and the value network not only admits efficient (i.e., realtime implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion.

Abstract:

Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion. Simulation results show more than 26% improvement in paths quality (i.e., time to reach the goal) when compared with optimal reciprocal collision avoidance (ORCA), a state-of-the-art collision avoidance strategy.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Human motion trajectory prediction: a survey:

Andrey Rudenko, +6 more

- 07 Jun 2020 -

The International Journal of Robotics Re...

TL;DR: In this article, the ability of intelligent autonomous systems to perceive, understand, and anticipate human behavior becomes increasingly important in a growing number of intelligent systems in human environments, and the ability to do so is discussed.

...read moreread less

Posted Content

CAD2RL: Real Single-Image Flight without a Single Real Image

Fereshteh Sadeghi, +1 more

- 13 Nov 2016 -

arXiv: Learning

TL;DR: In this paper, a collision avoidance policy is represented by a deep convolutional neural network that directly processes raw monocular images and outputs velocity commands, with a Monte Carlo policy evaluation algorithm that directly optimizes the network's ability to produce collision free flight.

...read moreread less

Proceedings ArticleDOI

Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning

Changan Chen, +3 more

TL;DR: This work proposes to rethink pairwise interactions with a self-attention mechanism, and jointly model Human-Robot as well as Human-Human interactions in the deep reinforcement learning framework, and captures the Human- human interactions occurring in dense crowds that indirectly affects the robot’s anticipation capability.

...read moreread less

Journal ArticleDOI

Human Motion Trajectory Prediction: A Survey

Andrey Rudenko, +6 more

- 15 May 2019 -

arXiv: Robotics

TL;DR: A survey of human motion trajectory prediction can be found in this article, where the authors provide an overview of the existing datasets and performance metrics and discuss limitations of the state-of-the-art and outline directions for further research.

...read moreread less

Journal ArticleDOI

Multi-agent deep reinforcement learning: a survey

Sven Gronauer, +1 more

- 15 Apr 2021 -

Artificial Intelligence Review

TL;DR: This article provides an overview of the current developments in the field of multi-agent deep reinforcement learning, focusing primarily on literature from recent years that combinesDeep reinforcement learning methods with a multi- agent scenario.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

An Introduction to Reinforcement Learning

Eduardo F. Morales, +1 more

Book ChapterDOI

Reciprocal n-Body Collision Avoidance

Jur van den Berg, +3 more

TL;DR: This paper presents a formal approach to reciprocal n-body collision avoidance, where multiple mobile robots need to avoid collisions with each other while moving in a common workspace, and derives sufficient conditions for collision-free motion by reducing the problem to solving a low-dimensional linear program.

...read moreread less

Proceedings ArticleDOI

Reciprocal Velocity Obstacles for real-time multi-agent navigation

J. van den Berg, +2 more

TL;DR: This paper applies the "Reciprocal Velocity Obstacle" concept to navigation of hundreds of agents in densely populated environments containing both static and moving obstacles, and shows that real-time and scalable performance is achieved in such challenging scenarios.

...read moreread less