GLoMo: Unsupervised Learning of Transferable Relational Graphs

Open AccessProceedings Article

GLoMo: Unsupervised Learning of Transferable Relational Graphs

- Vol. 31, pp 8950-8961

TLDR

This work explores the possibility of learning generic latent relational graphs that capture dependencies between pairs of data units from large-scale unlabeled data and transferring the graphs to downstream tasks, and shows that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have been trained.

Abstract:

Modern deep transfer learning approaches have mainly focused on learning generic feature vectors from one task that are transferable to other tasks, such as word embeddings in language and pretrained convolutional features in vision. However, these approaches usually transfer unary features and largely ignore more structured graphical representations. This work explores the possibility of learning generic latent relational graphs that capture dependencies between pairs of data units (e.g., words or pixels) from large-scale unlabeled data and transferring the graphs to downstream tasks. Our proposed transfer learning framework improves performance on various tasks including question answering, natural language inference, sentiment analysis, and image classification. We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have not been trained (including GloVe embeddings, ELMo embeddings, and task-specific RNN hidden units), or embedding-free units such as image pixels.

Citations

PDF

Open Access

More filters

Posted Content

ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

Tonghan Wang, +3 more

- 18 Mar 2020 -

arXiv: Multiagent Systems

TL;DR: Experiments show that the proposed role-oriented MARL framework (ROMA) can learn specialized, dynamic, and identifiable roles, which help the method push forward the state of the art on the StarCraft II micromanagement benchmark.

...read moreread less

Proceedings Article

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

Ashutosh Adhikari, +9 more

TL;DR: This work proposes a novel graph-aided transformer agent (GATA) that infers and updates latent belief graphs during planning to enable effective action selection by capturing the underlying game dynamics.

...read moreread less

Proceedings Article

ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

Tonghan Wang, +3 more

TL;DR: In this paper, a role-oriented multi-agent reinforcement learning (ROMA) framework is proposed, where roles are emergent and agents with similar roles tend to share their learning and to be specialized on certain sub-tasks.

...read moreread less

Posted Content

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

Ashutosh Adhikari, +9 more

- 21 Feb 2020 -

arXiv: Computation and Language

TL;DR: This article propose a graph-aided transformer agent (GATA) that infers and updates latent belief graphs during planning to enable effective action selection by capturing the underlying game dynamics, and demonstrate that the learned graph-based representations help agents converge to better policies than their text-only counterparts and facilitate effective generalization across game configurations.

...read moreread less

Journal ArticleDOI

Graph Interaction Networks for Relation Transfer in Human Activity Videos

Yansong Tang, +4 more

- 11 Feb 2020 -

IEEE Transactions on Circuits and System...

TL;DR: This work proposes a graph interaction networks (GINs) model for transferring relation knowledge across two graphs, which focuses on a “self-learned” weight matrix, which is a higher-level representation of the input data.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Collapse

Neural Computation

The Graph Neural Network Model

Franco Scarselli, +4 more

- 01 Jan 2009 -

IEEE Transactions on Neural Networks

GLoMo: Unsupervised Learning of Transferable Relational Graphs

Citations

ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

Learning Dynamic Belief Graphs to Generalize on Text-Based Games

Graph Interaction Networks for Relation Transfer in Human Activity Videos

References

Deep Residual Learning for Image Recognition

Long short-term memory

Very Deep Convolutional Networks for Large-Scale Image Recognition

Attention is All you Need

Very Deep Convolutional Networks for Large-Scale Image Recognition

Related Papers (5)

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Categorical Reparameterization with Gumbel-Softmax

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Long short-term memory

The Graph Neural Network Model