scispace - formally typeset
Proceedings ArticleDOI

Learning the Point Gathering Task Using Shared Value Function In Mobile Robots

TLDR
The algorithm was analyzed through simulations for up to 5 robots and the analysis indicates that communication helped robots perform significantly faster than when they acted independently - measured using the path-length of the first robot to reach the goal as the metric.
Abstract
Managing the actions of several agents to perform tasks which require coordination and cooperation pose significant research challenges. One such challenge is to synchronize the agents’ view of the system to help them take the ‘right’ actions. In this paper, we propose an algorithm called MRCPG (Mobile Robot Coordination Point Gathering Algorithm) for coordinating the actions of a team of mobile robots. The aim is to gather these robots at a particular location in a 2-dimensional plane which is determined during execution. The robots are randomly deployed in the plane and they achieve the goal by communicating periodically. In addition, we impose a Reinforcement Learning framework and the robots learn a Shared Value Function (SVF) based on scalar rewards received. The SVF is used to select the best possible action in each state until at least one robot reaches the goal. Then a Reach-distance heuristic is used to direct the remaining robots to the goal. The algorithm was analyzed through simulations for up to 5 robots and the analysis indicates that communication helped robots perform significantly faster than when they acted independently - measured using the path-length of the first robot to reach the goal as the metric. We also observed that increasing team size enhances the effect of communication and hastens task completion.

read more

Citations
More filters
Book ChapterDOI

Trust-Based Multiagent Credit Assignment (TMCA)

TL;DR: This work presents a novel approach to solving the distribution of the environmental feedback signal among learning agents in Multiagent Reinforcement Learning (MARL).
References
More filters
Journal ArticleDOI

Behavior-based formation control for multirobot teams

TL;DR: New reactive behaviors that implement formations in multirobot teams are presented and evaluated and demonstrate the value of various types of formations in autonomous, human-led and communications-restricted applications, and their appropriateness in different types of task environments.
Journal ArticleDOI

Motor Schema — Based Mobile Robot Navigation

TL;DR: A variant of the potential field method is used to produce the appropriate velocity and steering commands for the robot and demonstrates the feasibility of this approach.
Journal ArticleDOI

Distributed memoryless point convergence algorithm for mobile robots with limited visibility

TL;DR: The results of computer simulation under a more realistic model give convincing indication that the algorithm, if implemented on physical robots, will be robust against sensor and control error.
Journal ArticleDOI

Gathering of asynchronous robots with limited visibility

TL;DR: This paper presents a protocol that allows anonymous oblivious robots with limited visibility to gather in the same location in finite time, provided they have orientation (i.e., agreement on a coordinate system), indicating that, with respect to gathering, orientation is at least as powerful as instantaneous movements.
Journal ArticleDOI

Navigation strategies for multiple autonomous mobile robots moving in formation

TL;DR: A sufficient condition for the stability of a desired formation pattern for a fleet of robots each equipped with the navigation strategy based on nearest neighbor tracking is developed and simple navigation strategies for robots moving in formation are derived.