Home
/
Authors
/
Kieran Milan

Author

Kieran Milan

Bio: Kieran Milan is an academic researcher from Google. The author has contributed to research in topics: Computer science & Artificial neural network. The author has an hindex of 4, co-authored 4 publications receiving 3520 citations.

Papers

PDF

Open Access

More filters

Posted Content•

Overcoming catastrophic forgetting in neural networks

[...]

James Kirkpatrick, Razvan Pascanu, Neil C. Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath¹, Dharshan Kumaran, Raia Hadsell - Show less +10 more•Institutions (1)

Imperial College London¹

02 Dec 2016-arXiv: Learning

TL;DR: It is shown that it is possible to overcome the limitation of connectionist models and train networks that can maintain expertise on tasks that they have not experienced for a long time and selectively slowing down learning on the weights important for previous tasks.

...read moreread less

Abstract: The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on the MNIST hand written digit dataset and by learning several Atari 2600 games sequentially.

...read moreread less

3,026 citations

Journal Article•DOI•

Overcoming catastrophic forgetting in neural networks

[...]

Imperial College London¹

28 Mar 2017-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: In this paper, the authors show that it is possible to train networks that can maintain expertise on tasks that they have not experienced for a long time by selectively slowing down learning on the weights important for those tasks.

...read moreread less

Abstract: The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks that they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on a hand-written digit dataset and by learning several Atari 2600 games sequentially.

...read moreread less

2,917 citations

Proceedings Article•

The Forget-me-not Process

[...]

Kieran Milan¹, Joel Veness¹, James E. Kirkpatrick, Michael Bowling², Anna Koop², Demis Hassabis¹ - Show less +2 more•Institutions (2)

Google¹, University of Alberta²

01 Jan 2016

TL;DR: The Forget-me-not Process is introduced, an efficient, non-parametric meta-algorithm for online probabilistic sequence prediction for piecewise stationary, repeating sources by taking a Bayesian approach to partition a stream of data into postulated task-specific segments, while simultaneously building a model for each task.

...read moreread less

Abstract: We introduce the Forget-me-not Process, an efficient, non-parametric meta-algorithm for online probabilistic sequence prediction for piecewise stationary, repeating sources. Our method works by taking a Bayesian approach to partition a stream of data into postulated task-specific segments, while simultaneously building a model for each task. We provide regret guarantees with respect to piecewise stationary data sources under the logarithmic loss, and validate the method empirically across a range of sequence prediction and task identification problems.

...read moreread less

25 citations

Journal Article•DOI•

Reply to Huszár: The elastic weight consolidation penalty is empirically valid.

[...]

James Kirkpatrick¹, Razvan Pascanu¹, Neil C. Rabinowitz¹, Joel Veness¹, Guillaume Desjardins¹, Andrei Rusu¹, Kieran Milan¹, John Quan¹, Tiago Ramalho¹, Agnieszka Grabska-Barwinska¹, Demis Hassabis¹, Claudia Clopath², Dharshan Kumaran², Raia Hadsell¹ - Show less +10 more•Institutions (2)

Google¹, Imperial College London²

13 Mar 2018-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The recent work on elastic weight consolidation shows that forgetting in neural networks can be alleviated by using a quadratic penalty whose derivation was inspired by Bayesian evidence accumulation, and Dr. Huszar provides an alternative form by following the standard work on expectation propagation using the Laplace approximation.

...read moreread less

Abstract: In our recent work on elastic weight consolidation (EWC) (1) we show that forgetting in neural networks can be alleviated by using a quadratic penalty whose derivation was inspired by Bayesian evidence accumulation. In his letter (2), Dr. Huszar provides an alternative form for this penalty by following the standard work on expectation propagation using the Laplace approximation (3). He correctly argues that in cases when more than two tasks are undertaken the two forms of the penalty are different. Dr. Huszar also shows that for a toy linear regression problem his expression appears to be better. We would like to thank Dr. Huszar for pointing out … [↵][1]1To whom correspondence should be addressed. Email: kirkpatrick@google.com. [1]: #xref-corresp-1-1

...read moreread less

13 citations

Journal Article•DOI•

Faster sorting algorithms discovered using deep reinforcement learning

[...]

Daniel J. Mankowitz, Andrea Michi, A.P. Zhernov, Marco Selvi, Cosmin Paduraru, Edouard Leurent, Shariq Iqbal, Jean-Baptiste Lespiau, Stephen Gaffney, Jackson Broshear, Kieran Milan, Robert Tung, Taylan Cemgil, Mohammadamin Barekatain, Yujia Li, Amol Mandhane, Thomas Hubert, Julian Schrittwieser, Demis Hassabis, Pushmeet Kohli, Martin Riedmiller, Oriol Vinyals, David Silver - Show less +19 more

01 Jun 2023-Visual education

TL;DR: In this paper , the authors formulated the task of finding a better sorting algorithm as a single-player game and trained a new deep reinforcement learning agent, AlphaDev, to play this game.

...read moreread less

Abstract: Fundamental algorithms such as sorting or hashing are used trillions of times on any given day1. As demand for computation grows, it has become critical for these algorithms to be as performant as possible. Whereas remarkable progress has been achieved in the past2, making further improvements on the efficiency of these routines has proved challenging for both human scientists and computational approaches. Here we show how artificial intelligence can go beyond the current state of the art by discovering hitherto unknown routines. To realize this, we formulated the task of finding a better sorting routine as a single-player game. We then trained a new deep reinforcement learning agent, AlphaDev, to play this game. AlphaDev discovered small sorting algorithms from scratch that outperformed previously known human benchmarks. These algorithms have been integrated into the LLVM standard C++ sort library3. This change to this part of the sort library represents the replacement of a component with an algorithm that has been automatically discovered using reinforcement learning. We also present results in extra domains, showcasing the generality of the approach.

...read moreread less

3 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•

Model-agnostic meta-learning for fast adaptation of deep networks

[...]

Chelsea Finn¹, Pieter Abbeel¹, Sergey Levine¹•Institutions (1)

University of California, Berkeley¹

06 Aug 2017

TL;DR: An algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning is proposed.

...read moreread less

Abstract: We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning. The goal of meta-learning is to train a model on a variety of learning tasks, such that it can solve new learning tasks using only a small number of training samples. In our approach, the parameters of the model are explicitly trained such that a small number of gradient steps with a small amount of training data from a new task will produce good generalization performance on that task. In effect, our method trains the model to be easy to fine-tune. We demonstrate that this approach leads to state-of-the-art performance on two few-shot image classification benchmarks, produces good results on few-shot regression, and accelerates fine-tuning for policy gradient reinforcement learning with neural network policies.

...read moreread less

7,027 citations

Posted Content•

Overcoming catastrophic forgetting in neural networks

[...]

Imperial College London¹

02 Dec 2016-arXiv: Learning

...read moreread less

3,026 citations

Journal Article•DOI•

Overcoming catastrophic forgetting in neural networks

[...]

Imperial College London¹

28 Mar 2017-Proceedings of the National Academy of Sciences of the United States of America

...read moreread less

2,917 citations

Proceedings Article•DOI•

iCaRL: Incremental Classifier and Representation Learning

[...]

Sylvestre-Alvise Rebuffi¹, Alexander Kolesnikov², Georg Sperl², Christoph H. Lampert²•Institutions (2)

University of Oxford¹, Institute of Science and Technology Austria²

01 Jul 2017

TL;DR: In this paper, the authors introduce a new training strategy, iCaRL, that allows learning in such a class-incremental way: only the training data for a small number of classes has to be present at the same time and new classes can be added progressively.

...read moreread less

Abstract: A major open problem on the road to artificial intelligence is the development of incrementally learning systems that learn about more and more concepts over time from a stream of data. In this work, we introduce a new training strategy, iCaRL, that allows learning in such a class-incremental way: only the training data for a small number of classes has to be present at the same time and new classes can be added progressively. iCaRL learns strong classifiers and a data representation simultaneously. This distinguishes it from earlier works that were fundamentally limited to fixed data representations and therefore incompatible with deep learning architectures. We show by experiments on CIFAR-100 and ImageNet ILSVRC 2012 data that iCaRL can learn many classes incrementally over a long period of time where other strategies quickly fail.

...read moreread less

2,393 citations

Journal Article•DOI•

Continual lifelong learning with neural networks: A review.

[...]

German Ignacio Parisi¹, Ronald Kemker², Jose L. Part³, Christopher Kanan², Stefan Wermter¹ - Show less +1 more•Institutions (3)

University of Hamburg¹, Rochester Institute of Technology², Heriot-Watt University³

01 May 2019-Neural Networks

TL;DR: This review critically summarize the main challenges linked to lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic forgetting.

...read moreread less

2,095 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse