Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

doi:10.48550/arXiv.2207.02249

Journal ArticleDOI

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Lukas Schäfer, +3 more

- 05 Jul 2022 -

arXiv.org

- Vol. abs/2207.02249

TLDR

This work discusses the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited limitedtuning and proposes three MATE training paradigms: independent MATE, centralised MATES, and mixed MATE which vary in the information used for the task encoding.

Abstract:

Successful deployment of multi-agent reinforcement learning often requires agents to adapt their behaviour. In this work, we discuss the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited fine-tuning. Motivated by the intuition that agents need to be able to identify and distinguish tasks in order to adapt their behaviour to the current task, we propose to learn multi-agent task embeddings (MATE). These task embeddings are trained using an encoder-decoder architecture optimised for reconstruction of the transition and reward functions which uniquely identify tasks. We show that a team of agents is able to adapt to novel tasks when provided with task embeddings. We propose three MATE training paradigms: independent MATE, centralised MATE, and mixed MATE which vary in the information used for the task encoding. We show that the embeddings learned by MATE identify tasks and provide useful information which agents leverage during adaptation to novel tasks.

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Citations

Deep Reinforcement Learning for Multi-Agent Interaction

Generating Diverse Teammates to Train Robust Agents For Ad Hoc Teamwork

Learning Embeddings for Sequential Tasks Using Population of Agents

Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity

References

Adam: A Method for Stochastic Optimization

Attention is All you Need

Visualizing Data using t-SNE

Auto-Encoding Variational Bayes

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation