scispace - formally typeset
Open AccessJournal ArticleDOI

Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning

Reads0
Chats0
TLDR
A novel cooperative exploration strategy is proposed for multiple mobile robots, which reduces the overall task completion time and energy costs compared to conventional methods and enables the control policy to learn from human demonstration data and thus improve the learning speed and performance.
Abstract
Autonomous exploration is an important application of multi-vehicle systems, where a team of networked robots are coordinated to explore an unknown environment collaboratively. This technique has earned significant research interest due to its usefulness in search and rescue, fault detection and monitoring, localization and mapping, etc. In this paper, a novel cooperative exploration strategy is proposed for multiple mobile robots, which reduces the overall task completion time and energy costs compared to conventional methods. To efficiently navigate the networked robots during the collaborative tasks, a hierarchical control architecture is designed which contains a high-level decision making layer and a low-level target tracking layer. The proposed cooperative exploration approach is developed using dynamic Voronoi partitions, which minimizes duplicated exploration areas by assigning different target locations to individual robots. To deal with sudden obstacles in the unknown environment, an integrated deep reinforcement learning based collision avoidance algorithm is then proposed, which enables the control policy to learn from human demonstration data and thus improve the learning speed and performance. Finally, simulation and experimental results are provided to demonstrate the effectiveness of the proposed scheme.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A Decentralized Cluster Formation Containment Framework for Multirobot Systems

TL;DR: A unified cluster formation containment coordination framework for networked robots that can be decomposed into two layers containing the leaders and the followers is developed.
Journal ArticleDOI

A Comprehensive Review of Coverage Path Planning in Robotics Using Classical and Heuristic Algorithms

TL;DR: In this paper, the authors reviewed the principle of CPP and discussed the development trend, including design variations and the characteristic of optimization algorithms, such as classical, heuristic, and most recent deep learning methods, and compared the advantages and disadvantages of existing CPP-based modeling in the area and target coverage.
Journal ArticleDOI

Fault-tolerant cooperative navigation of networked UAV swarms for forest fire monitoring

TL;DR: In this paper , a cooperative navigation strategy based on network graph theory is proposed to coordinate all the connected UAVs in a swarm in the presence of unknown disturbances, and the stability of the aerial swarm system is guaranteed using the Lyapunov approach.
Journal ArticleDOI

Multi-Robot Space Exploration: An Augmented Arithmetic Approach

TL;DR: In this paper, a framework for the design of a hybrid stochastic optimizer (HSO) for multi-robot space exploration is presented, which augments deterministic Coordinated Multi-Robot Exploration (CME) and stochastically Arithmetic Optimization (AO) techniques for maximizing the utility.
Journal ArticleDOI

A Review of Safe Reinforcement Learning: Methods, Theory and Applications

TL;DR: A review of the progress of safe RL from the perspectives of methods, theory and applications, and problems that are crucial for safe RL being deployed in real-world applications, coined as “2H3W” are reviewed.
References
More filters
Posted Content

Continuous control with deep reinforcement learning

TL;DR: This work presents an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces, and demonstrates that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.
Proceedings ArticleDOI

Design and use paradigms for Gazebo, an open-source multi-robot simulator

TL;DR: Gazebo is designed to fill this niche by creating a 3D dynamic multi-robot environment capable of recreating the complex worlds that would be encountered by the next generation of mobile robots.
Journal ArticleDOI

Coordinated multi-robot exploration

TL;DR: This paper presents an approach for the coordination of multiple robots, which simultaneously takes into account the cost of reaching a target point and its utility and describes how this algorithm can be extended to situations in which the communication range of the robots is limited.
Journal ArticleDOI

A Probabilistic Approach to Collaborative Multi-Robot Localization

TL;DR: This paper uses a sample-based version of Markov localization, capable of localizing mobile robots in an any-time fashion, to demonstrate drastic improvements in localization speed and accuracy when compared to conventional single-robot localization.
Proceedings ArticleDOI

Virtual-to-real deep reinforcement learning: Continuous control of mobile robots for mapless navigation

TL;DR: In this paper, a mapless motion planner is proposed by taking the sparse 10-dimensional range findings and the target position with respect to the mobile robot coordinate frame as input and the continuous steering commands as output.
Related Papers (5)