Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

doi:10.1109/TG.2018.2849942

Open AccessJournal ArticleDOI

Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

- Vol. 10, Iss: 4, pp 400-412

TLDR

The proposed end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles can efficiently avoid moving obstacles and complete the navigation task at a high success rate.

Abstract:

In this paper, we propose an end-to-end modular reinforcement learning architecture for a navigation task in complex dynamic environments with rapidly moving obstacles. In this architecture, the main task is divided into two subtasks: local obstacle avoidance and global navigation. For obstacle avoidance, we develop a two-stream Q-network, which processes spatial and temporal information separately and generates action values. The global navigation subtask is resolved by a conventional Q-network framework. An online learning network and an action scheduler are introduced to first combine two pretrained policies, and then continue exploring and optimizing until a stable policy is obtained. The two-stream Q-network obtains better performance than the conventional deep Q-learning approach in the obstacle avoidance subtask. Experiments on the main task demonstrate that the proposed architecture can efficiently avoid moving obstacles and complete the navigation task at a high success rate. The modular architecture enables parallel training and also demonstrates good generalization capability in different environments.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Survey of Machine Learning for Indoor Positioning

Ahasanun Nessa, +3 more

- 19 Nov 2020 -

IEEE Access

TL;DR: A comprehensive survey of ML enabled localization techniques using most common wireless technologies for accurate indoor positioning and how the ML algorithms can be effectively used for fusing different technologies and algorithms to achieve a comprehensive IPS is provided.

...read moreread less

Journal ArticleDOI

Deep reinforcement learning based mobile robot navigation: A review

Kai Zhu, +1 more

- 20 Apr 2021 -

Tsinghua Science & Technology

TL;DR: This paper systematically compares and analyzes the relationship and differences between four typical application scenarios: local obstacle avoidance, indoor navigation, multi-robot navigation, and social navigation; and describes the development of DRL-based navigation.

...read moreread less

Journal ArticleDOI

Deterministic Policy Gradient With Integral Compensator for Robust Quadrotor Control

Yuanda Wang, +3 more

- 01 Oct 2020 -

IEEE Transactions on Systems, Man, and C...

TL;DR: A deep reinforcement learning-based robust control strategy for quadrotor helicopters which introduces an integral compensator to the actor-critic structure and shows that the online learning could significantly improve the control performance.

...read moreread less

Journal ArticleDOI

End-to-End Navigation Strategy With Deep Reinforcement Learning for Mobile Robots

Haobin Shi, +3 more

- 01 Apr 2020 -

IEEE Transactions on Industrial Informat...

TL;DR: An end-to-end navigation planner that translates sparse laser ranging results into movement actions and achieves map-less navigation in complex environments through a reward signal that is enhanced by intrinsic motivation, the agent explores more efficiently, and the learned strategy is more reliable.

...read moreread less

Journal ArticleDOI

Cooperative control for multi-player pursuit-evasion games with reinforcement learning

Yuanda Wang, +2 more

- 28 Oct 2020 -

Neurocomputing

TL;DR: The training and evaluation results demonstrate that the pursuit team could learn highly efficient cooperative control and communication policies and can capture a superior evader driven by an intelligent escape policy with a high success rate.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Collapse

IEEE Robotics & Automation Magazine

Learning Navigation Behaviors End-to-End With AutoRL

Hao-Tien Lewis Chiang, +3 more

Learning to Navigate Through Complex Dynamic Environment With Modular Deep Reinforcement Learning

Citations

A Survey of Machine Learning for Indoor Positioning

Deep reinforcement learning based mobile robot navigation: A review

Deterministic Policy Gradient With Integral Compensator for Robust Quadrotor Control

End-to-End Navigation Strategy With Deep Reinforcement Learning for Mobile Robots

Cooperative control for multi-player pursuit-evasion games with reinforcement learning

References

Adam: A Method for Stochastic Optimization

Deep learning

Reinforcement Learning: An Introduction

Human-level control through deep reinforcement learning

Mastering the game of Go with deep neural networks and tree search

Related Papers (5)

Human-level control through deep reinforcement learning

Virtual-to-real deep reinforcement learning: Continuous control of mobile robots for mapless navigation

Target-driven visual navigation in indoor scenes using deep reinforcement learning

The dynamic window approach to collision avoidance

Learning Navigation Behaviors End-to-End With AutoRL