scispace - formally typeset
Open AccessProceedings ArticleDOI

Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning

TLDR
In this article, a value network is proposed to estimate the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors, and the value network not only admits efficient (i.e., realtime implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion.
Abstract
Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion. Simulation results show more than 26% improvement in paths quality (i.e., time to reach the goal) when compared with optimal reciprocal collision avoidance (ORCA), a state-of-the-art collision avoidance strategy.

read more

Citations
More filters
Journal ArticleDOI

Human motion trajectory prediction: a survey:

TL;DR: In this article, the ability of intelligent autonomous systems to perceive, understand, and anticipate human behavior becomes increasingly important in a growing number of intelligent systems in human environments, and the ability to do so is discussed.
Posted Content

CAD2RL: Real Single-Image Flight without a Single Real Image

TL;DR: In this paper, a collision avoidance policy is represented by a deep convolutional neural network that directly processes raw monocular images and outputs velocity commands, with a Monte Carlo policy evaluation algorithm that directly optimizes the network's ability to produce collision free flight.
Proceedings ArticleDOI

Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning

TL;DR: This work proposes to rethink pairwise interactions with a self-attention mechanism, and jointly model Human-Robot as well as Human-Human interactions in the deep reinforcement learning framework, and captures the Human- human interactions occurring in dense crowds that indirectly affects the robot’s anticipation capability.
Journal ArticleDOI

Human Motion Trajectory Prediction: A Survey

TL;DR: A survey of human motion trajectory prediction can be found in this article, where the authors provide an overview of the existing datasets and performance metrics and discuss limitations of the state-of-the-art and outline directions for further research.
Journal ArticleDOI

Multi-agent deep reinforcement learning: a survey

TL;DR: This article provides an overview of the current developments in the field of multi-agent deep reinforcement learning, focusing primarily on literature from recent years that combinesDeep reinforcement learning methods with a multi- agent scenario.
References
More filters
Journal ArticleDOI

Human-level control through deep reinforcement learning

TL;DR: This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.
Journal ArticleDOI

Mastering the game of Go with deep neural networks and tree search

TL;DR: Using this search algorithm, the program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0.5, the first time that a computer program has defeated a human professional player in the full-sized game of Go.
Book ChapterDOI

Reciprocal n-Body Collision Avoidance

TL;DR: This paper presents a formal approach to reciprocal n-body collision avoidance, where multiple mobile robots need to avoid collisions with each other while moving in a common workspace, and derives sufficient conditions for collision-free motion by reducing the problem to solving a low-dimensional linear program.
Proceedings ArticleDOI

Reciprocal Velocity Obstacles for real-time multi-agent navigation

TL;DR: This paper applies the "Reciprocal Velocity Obstacle" concept to navigation of hundreds of agents in densely populated environments containing both static and moving obstacles, and shows that real-time and scalable performance is achieved in such challenging scenarios.
Related Papers (5)