Overcoming catastrophic forgetting in neural networks
Citations
7,027 citations
2,393 citations
2,095 citations
Cites background or methods or result from "Overcoming catastrophic forgetting ..."
...Additional experiments showed that this approach performs significantly worse than EWC (Kirkpatrick et al., 2017) on different permutation tasks (see Section 3....
[...]
...According to the reported results, EWC (Kirkpatrick et al., 2017) and LwF (Li & Hoiem, 2016) perform significantly worse in NC and NIC than in NI....
[...]
...The approaches considered were supervised: a standard MLP trained online as a baseline, the EWC (Kirkpatrick et al., 2017), the PathNet (Fernando et al....
[...]
...studies evidence the contribution of synaptic rewiring by structural plasticity on memory formation in adults (Knoblauch, 2017; Knoblauch, Körner, Krner, & Sommer, 2014), with a major role of structural plasticity in increasing information storage efficiency in terms of space and energy demands. While the hippocampus is normally associated with the immediate recall of recent memories (i.e., short-term memories), the prefrontal cortex (PFC) is usually associated with the preservation and recall of remote memories (i.e., long-term memories; Bontempi, Laurent-Demir, Destrade, and Jaffard (1999))....
[...]
...This method requires considerable more memory than other regularization approaches such as EWC (Kirkpatrick et al., 2017) at training time (with an episodicmemory Mk for each task k) but can work much better in the single pass setting....
[...]
1,864 citations
1,515 citations
References
73,978 citations
[...]
46,982 citations
23,074 citations
"Overcoming catastrophic forgetting ..." refers background in this paper
...These allow action values for each task to be learned off-policy, using an experience replay mechanism (25)....
[...]
...We asked whether Deep Q Networks (DQNs)—an architecture that has achieved impressive successes in such challenging RL settings (25)—could be harnessed with EWC to successfully support continual learning in the classic Atari 2600 task set (26)....
[...]
...As such, the overall system has memory on two timescales: Over short timescales, the experience replay mechanism allows learning in the DQN to be based on the interleaved and uncorrelated experiences (25)....
[...]
10,943 citations
4,301 citations