Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

doi:10.1109/ACCESS.2021.3050338

Journal Article•DOI•

Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

Michael Everett¹, Yu Fan Chen², Jonathan P. How¹•Institutions (2)

Massachusetts Institute of Technology¹, Facebook²

08 Jan 2021-IEEE Access (IEEE)-Vol. 9, pp 10357-10377

TL;DR: This work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules and extends the previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents.

read less

Abstract: Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over short timescales, or lack a principled method to handle a large, possibly varying number of agents. Therefore, this work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules. It extends our previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents, instead of a small, fixed number of neighbors. The proposed algorithm is shown to outperform a classical collision avoidance algorithm, another deep RL-based algorithm, and scales with the number of agents better (fewer collisions, shorter time to goal) than our previously published learning-based approach. Analysis of the LSTM provides insights into how observations of nearby agents affect the hidden state and quantifies the performance impact of various agent ordering heuristics. The learned policy generalizes to several applications beyond the training scenarios: formation control (arrangement into letters), demonstrations on a fleet of four multirotors and on a fully autonomous robotic vehicle capable of traveling at human walking speed among pedestrians.

...read moreread less

Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

Citations

References

"Collision Avoidance in Pedestrian-R..." refers methods in this paper

"Collision Avoidance in Pedestrian-R..." refers background or methods in this paper

"Collision Avoidance in Pedestrian-R..." refers methods in this paper

"Collision Avoidance in Pedestrian-R..." refers methods in this paper

Related Papers (5)