Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning
Reads0
Chats0
TLDR
A novel cooperative exploration strategy is proposed for multiple mobile robots, which reduces the overall task completion time and energy costs compared to conventional methods and enables the control policy to learn from human demonstration data and thus improve the learning speed and performance.Abstract:
Autonomous exploration is an important application of multi-vehicle systems, where a team of networked robots are coordinated to explore an unknown environment collaboratively. This technique has earned significant research interest due to its usefulness in search and rescue, fault detection and monitoring, localization and mapping, etc. In this paper, a novel cooperative exploration strategy is proposed for multiple mobile robots, which reduces the overall task completion time and energy costs compared to conventional methods. To efficiently navigate the networked robots during the collaborative tasks, a hierarchical control architecture is designed which contains a high-level decision making layer and a low-level target tracking layer. The proposed cooperative exploration approach is developed using dynamic Voronoi partitions, which minimizes duplicated exploration areas by assigning different target locations to individual robots. To deal with sudden obstacles in the unknown environment, an integrated deep reinforcement learning based collision avoidance algorithm is then proposed, which enables the control policy to learn from human demonstration data and thus improve the learning speed and performance. Finally, simulation and experimental results are provided to demonstrate the effectiveness of the proposed scheme.read more
Citations
More filters
Journal ArticleDOI
A Decentralized Cluster Formation Containment Framework for Multirobot Systems
TL;DR: A unified cluster formation containment coordination framework for networked robots that can be decomposed into two layers containing the leaders and the followers is developed.
Journal ArticleDOI
A Comprehensive Review of Coverage Path Planning in Robotics Using Classical and Heuristic Algorithms
TL;DR: In this paper, the authors reviewed the principle of CPP and discussed the development trend, including design variations and the characteristic of optimization algorithms, such as classical, heuristic, and most recent deep learning methods, and compared the advantages and disadvantages of existing CPP-based modeling in the area and target coverage.
Journal ArticleDOI
Fault-tolerant cooperative navigation of networked UAV swarms for forest fire monitoring
TL;DR: In this paper , a cooperative navigation strategy based on network graph theory is proposed to coordinate all the connected UAVs in a swarm in the presence of unknown disturbances, and the stability of the aerial swarm system is guaranteed using the Lyapunov approach.
Journal ArticleDOI
Multi-Robot Space Exploration: An Augmented Arithmetic Approach
TL;DR: In this paper, a framework for the design of a hybrid stochastic optimizer (HSO) for multi-robot space exploration is presented, which augments deterministic Coordinated Multi-Robot Exploration (CME) and stochastically Arithmetic Optimization (AO) techniques for maximizing the utility.
Journal ArticleDOI
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu,Longyu Yang,Yali Du,Guang Chen,Florian Walter,Jun Wang,Yaodong Yang,Alois Knoll +7 more
TL;DR: A review of the progress of safe RL from the perspectives of methods, theory and applications, and problems that are crucial for safe RL being deployed in real-world applications, coined as “2H3W” are reviewed.
References
More filters
Posted Content
Continuous control with deep reinforcement learning
Timothy P. Lillicrap,Jonathan J. Hunt,Alexander Pritzel,Nicolas Heess,Tom Erez,Yuval Tassa,David Silver,Daan Wierstra +7 more
TL;DR: This work presents an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces, and demonstrates that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.
Proceedings ArticleDOI
Design and use paradigms for Gazebo, an open-source multi-robot simulator
Nathan Koenig,Andrew Howard +1 more
TL;DR: Gazebo is designed to fill this niche by creating a 3D dynamic multi-robot environment capable of recreating the complex worlds that would be encountered by the next generation of mobile robots.
Journal ArticleDOI
Coordinated multi-robot exploration
TL;DR: This paper presents an approach for the coordination of multiple robots, which simultaneously takes into account the cost of reaching a target point and its utility and describes how this algorithm can be extended to situations in which the communication range of the robots is limited.
Journal ArticleDOI
A Probabilistic Approach to Collaborative Multi-Robot Localization
TL;DR: This paper uses a sample-based version of Markov localization, capable of localizing mobile robots in an any-time fashion, to demonstrate drastic improvements in localization speed and accuracy when compared to conventional single-robot localization.
Proceedings ArticleDOI
Virtual-to-real deep reinforcement learning: Continuous control of mobile robots for mapless navigation
Lei Tai,Giuseppe Paolo,Ming Liu +2 more
TL;DR: In this paper, a mapless motion planner is proposed by taking the sparse 10-dimensional range findings and the target position with respect to the mobile robot coordinate frame as input and the continuous steering commands as output.