Continual Learning with Deep Generative Replay

Open AccessProceedings Article

Continual Learning with Deep Generative Replay

Hanul Shin, +3 more

- Vol. 30, pp 2990-2999

Chats0

TLDR

The Deep Generative Replay is proposed, a novel framework with a cooperative dual model architecture consisting of a deep generative model ("generator") and a task solving model ("solver"), with only these two models, training data for previous tasks can easily be sampled and interleaved with those for a new task.

Abstract:

Attempts to train a comprehensive artificial intelligence capable of solving multiple tasks have been impeded by a chronic problem called catastrophic forgetting. Although simply replaying all previous data alleviates the problem, it requires large memory and even worse, often infeasible in real world applications where the access to past data is limited. Inspired by the generative nature of the hippocampus as a short-term memory system in primate brain, we propose the Deep Generative Replay, a novel framework with a cooperative dual model architecture consisting of a deep generative model (“generator”) and a task solving model (“solver”). With only these two models, training data for previous tasks can easily be sampled and interleaved with those for a new task. We test our methods in several sequential learning settings involving image classification tasks.

Citations

PDF

Open Access

More filters

Posted Content

Learning without Forgetting

Zhizhong Li, +1 more

- 29 Jun 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work proposes the Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities, and performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques.

...read moreread less

Journal ArticleDOI

A continual learning survey: Defying forgetting in classification tasks.

Matthias Delange, +7 more

- 05 Feb 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work focuses on task incremental classification, where tasks arrive sequentially and are delineated by clear boundaries and study the influence of model capacity, weight decay and dropout regularization, and the order in which the tasks are presented, and qualitatively compare methods in terms of required memory, computation time and storage.

...read moreread less

Proceedings ArticleDOI

Large Scale Incremental Learning

Yue Wu, +6 more

TL;DR: This work found that the last fully connected layer has a strong bias towards the new classes, and this bias can be corrected by a linear model, and with two bias parameters, this method performs remarkably well on two large datasets.

...read moreread less

Posted Content

Efficient Lifelong Learning with A-GEM

Arslan Chaudhry, +3 more

- 02 Dec 2018 -

arXiv: Learning

TL;DR: An improved version of GEM is proposed, dubbed Averaged GEM (A-GEM), which enjoys the same or even better performance as GEM, while being almost as computationally and memory efficient as EWC and other regularization-based methods.

...read moreread less

Posted Content

Experience Replay for Continual Learning

David Rolnick, +4 more

- 28 Nov 2018 -

arXiv: Learning

TL;DR: This work shows that using experience replay buffers for all past events with a mixture of on- and off-policy learning can still learn new tasks quickly yet can substantially reduce catastrophic forgetting in both Atari and DMLab domains, even matching the performance of methods that require task identities.

...read moreread less