Rotational Unit of Memory: A Novel Representation Unit for RNNs with Scalable Applications

doi:10.1162/TACL_A_00258

Open AccessJournal ArticleDOI

Rotational Unit of Memory: A Novel Representation Unit for RNNs with Scalable Applications

Rumen Dangovski, +4 more

- 17 Apr 2019 -

Transactions of the Association for Comp...

- Vol. 7, pp 121-138

TLDR

This work derives a phase-coded representation of the memory state, Rotational Unit of Memory (RUM), that unifies the concepts of unitary learning and associative memory and shows experimentally that RNNs based on RUMs can solve basic sequential tasks such as memory copying and memory recall much better than LSTMs/GRUs.

Abstract:

Stacking long short-term memory (LSTM) cells or gated recurrent units (GRUs) as part of a recurrent neural network (RNN) has become a standard approach to solving a number of tasks ranging from lan...

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Review: Deep Learning in Electron Microscopy

Jeffrey M. Ede

- 17 Sep 2020 -

arXiv: Image and Video Processing

TL;DR: In this paper, a review of deep learning in electron microscopy is presented, with a focus on hardware and software needed to get started with deep learning and interface with electron microscopes.

...read moreread less

Posted Content

A Divide-and-Conquer Approach to the Summarization of Long Documents

Alexios Gidiotis, +1 more

- 13 Apr 2020 -

arXiv: Computation and Language

TL;DR: This work exploits the discourse structure of the document and uses sentence similarity to split the problem into an ensemble of smaller summarization problems, which can decompose the problem of long document summarization into smaller and simpler problems, reducing computational complexity and creating more training examples.

...read moreread less

Journal ArticleDOI

A Divide-and-Conquer Approach to the Summarization of Long Documents

Alexios Gidiotis, +1 more

- 13 Apr 2020 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: The authors proposed a divide-and-conquer method for the neural summarization of long documents, which exploits the discourse structure of the document and uses sentence similarity to split the problem into an ensemble of smaller summarization problems.

...read moreread less

Journal ArticleDOI

Deep reinforcement and transfer learning for abstractive text summarization: A review

- 01 Jan 2022 -

Computer Speech & Language

TL;DR: Automatic Text Summarization (ATS) is an important area in NLP as mentioned in this paper with the goal of shortening a long text into a more compact version by conveying the most important points in a readable form.

...read moreread less

Journal ArticleDOI

Deep reinforcement and transfer learning for abstractive text summarization: A review

Ayham Alomari, +3 more

- 01 Jan 2022 -

Computer Speech & Language

TL;DR: Automatic Text Summarization (ATS) is an important area in Natural Language Processing (NLP) with the goal of shortening a long text into a more compact version by conveying the most important points in a readable form as mentioned in this paper.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

Proceedings ArticleDOI

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

Collapse

Rotational Unit of Memory: A Novel Representation Unit for RNNs with Scalable Applications

Citations

Review: Deep Learning in Electron Microscopy

A Divide-and-Conquer Approach to the Summarization of Long Documents

A Divide-and-Conquer Approach to the Summarization of Long Documents

Deep reinforcement and transfer learning for abstractive text summarization: A review

Deep reinforcement and transfer learning for abstractive text summarization: A review

References

Adam: A Method for Stochastic Optimization

Long short-term memory

Dropout: a simple way to prevent neural networks from overfitting

Neural Machine Translation by Jointly Learning to Align and Translate

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Get To The Point: Summarization with Pointer-Generator Networks

Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies

Attention is All you Need

ROUGE: A Package for Automatic Evaluation of Summaries