Home
/
Authors
/
Arnaud Doucet

Author

Arnaud Doucet

Other affiliations: University of British Columbia, École nationale supérieure de l'électronique et de ses applications, Microsoft ...read more

Bio: Arnaud Doucet is an academic researcher from University of Oxford. The author has contributed to research in topics: Particle filter & Markov chain Monte Carlo. The author has an hindex of 75, co-authored 386 publications receiving 43388 citations. Previous affiliations of Arnaud Doucet include University of British Columbia & École nationale supérieure de l'électronique et de ses applications.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Maximum Likelihood Learning of Energy-Based Models for Simulation-Based Inference

[...]

Pierre Glaser, Michael Arbel, Arnaud Doucet, Arthur Gretton

arXiv.org

TL;DR: Two synthetic likelihood methods for Simulation-Based Inference, to conduct either amortized or targeted inference from experimental observations when a high-ﬁdelity simulator is available, that uniquely combine a ﬂexible Energy-Based Model and the minimization of a KL loss.

...read moreread less

Abstract: We introduce two synthetic likelihood methods for Simulation-Based Inference (SBI), to conduct either amortized or targeted inference from experimental observations when a high-ﬁdelity simulator is available. Both methods learn a conditional energy-based model (EBM) of the likelihood using synthetic data generated by the simulator, conditioned on parameters drawn from a proposal distribution. The learned likelihood can then be combined with any prior to obtain a posterior estimate, from which samples can be drawn using MCMC. Our methods uniquely combine a ﬂexible Energy-Based Model and the minimization of a KL loss: this is in contrast to other synthetic likelihood methods, which either rely on normalizing ﬂows, or minimize score-based objectives; choices that come with known pitfalls. Our ﬁrst method, Amortized Unnormalized Neural Likelihood Estimation (AUNLE), introduces a tilting trick during training that allows to signiﬁcantly lower the computational cost of inference by enabling the use of eﬃcient MCMC techniques. Our second method, Sequential UNLE (SUNLE), employs a robust doubly intractable approach in order to re-use simulation data and improve posterior accuracy on a speciﬁc dataset. We demonstrate the properties of both methods on a range of synthetic datasets, and apply them to a neuroscience model of the pyloric network in the crab Cancer Borealis , matching the performance of other synthetic likelihood methods at a fraction of the simulation budget.

...read moreread less

1 citations

Proceedings Article•DOI•

Fixed-lag sequential Monte Carlo data association

[...]

Mark Briers¹, Arnaud Doucet², Simon Maskell³, Paul R. Horridge³•Institutions (3)

University of Cambridge¹, University of British Columbia², Qinetiq³

05 May 2006

TL;DR: This paper introduces a novel application of a recent innovation in the SMC literature that uses multiple scans of data to improve the stochastic approximation (and so the data association ability) of a multiple target Sequential Monte Carlo based tracking system.

...read moreread less

Abstract: The use of multiple scans of data to improve ones ability to improve target tracking performance is widespread in the tracking literature. In this paper, we introduce a novel application of a recent innovation in the SMC literature that uses multiple scans of data to improve the stochastic approximation (and so the data association ability) of a multiple target Sequential Monte Carlo based tracking system. Such an improvement is achieved by resimulating sampled variates over a fixed-lag time window by artificially extending the space of the target distribution. In doing so, the stochastic approximation is improved and so the data association ambiguity is more readily resolved.

...read moreread less

1 citations

Bayesian models selection approaches to model selection

[...]

A Andrieu, Arnaud Doucet, WJ Fitzgerald, J-M Perez

22 Feb 2001

1 citations

DOI•

Monte Carlo and Quasi-Monte Carlo Methods 2008

[...]

Christophe Andrieu, Arnaud Doucet, Roman Holenstein

01 Jan 2009

1 citations

Posted Content•

Monte Carlo Variational Auto-Encoders

[...]

Achille Thin¹, Nikita Kotelevskii², Arnaud Doucet³, Alain Durmus⁴, Eric Moulines¹, Maxim Panov² - Show less +2 more•Institutions (4)

École Polytechnique¹, Skolkovo Institute of Science and Technology², University of Oxford³, École Normale Supérieure⁴

30 Jun 2021-arXiv: Machine Learning

TL;DR: In this article, Monte Carlo VAEs have been proposed to combine Annealed Importance Sampling (AIS) and its sequential importance sampling (SIS) extensions for VAE.

...read moreread less

Abstract: Variational auto-encoders (VAE) are popular deep latent variable models which are trained by maximizing an Evidence Lower Bound (ELBO). To obtain tighter ELBO and hence better variational approximations, it has been proposed to use importance sampling to get a lower variance estimate of the evidence. However, importance sampling is known to perform poorly in high dimensions. While it has been suggested many times in the literature to use more sophisticated algorithms such as Annealed Importance Sampling (AIS) and its Sequential Importance Sampling (SIS) extensions, the potential benefits brought by these advanced techniques have never been realized for VAE: the AIS estimate cannot be easily differentiated, while SIS requires the specification of carefully chosen backward Markov kernels. In this paper, we address both issues and demonstrate the performance of the resulting Monte Carlo VAEs on a variety of applications.

...read moreread less

1 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
…
75
76
77
78
79
80
81
…
82
83
84
85
86
87

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking

[...]

M.S. Arulampalam¹, Simon Maskell², Neil Gordon², T. Clapp•Institutions (2)

Defence Science and Technology Organization¹, University of Cambridge²

01 Feb 2002-IEEE Transactions on Signal Processing

TL;DR: Both optimal and suboptimal Bayesian algorithms for nonlinear/non-Gaussian tracking problems, with a focus on particle filters are reviewed.

...read moreread less

Abstract: Increasingly, for many application areas, it is becoming important to include elements of nonlinearity and non-Gaussianity in order to model accurately the underlying dynamics of a physical system. Moreover, it is typically crucial to process data on-line as it arrives, both from the point of view of storage costs as well as for rapid adaptation to changing signal characteristics. In this paper, we review both optimal and suboptimal Bayesian algorithms for nonlinear/non-Gaussian tracking problems, with a focus on particle filters. Particle filters are sequential Monte Carlo methods based on point mass (or "particle") representations of probability densities, which can be applied to any state-space model and which generalize the traditional Kalman filtering methods. Several variants of the particle filter such as SIR, ASIR, and RPF are introduced within a generic framework of the sequential importance sampling (SIS) algorithm. These are discussed and compared with the standard EKF through an illustrative example.

...read moreread less

11,409 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Book•

Machine Learning : A Probabilistic Perspective

[...]

Kevin P. Murphy

24 Aug 2012

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

Abstract: Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package--PMTK (probabilistic modeling toolkit)--that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

8,059 citations

Book•

Learning Deep Architectures for AI

[...]

Yoshua Bengio¹•Institutions (1)

Université de Montréal¹

01 Jan 2009

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less

Abstract: Can machine learning deliver AI? Theoretical results, inspiration from the brain and cognition, as well as machine learning experiments suggest that in order to learn the kind of complicated functions that can represent high-level abstractions (e.g. in vision, language, and other AI-level tasks), one would need deep architectures. Deep architectures are composed of multiple levels of non-linear operations, such as in neural nets with many hidden layers, graphical models with many levels of latent variables, or in complicated propositional formulae re-using many sub-formulae. Each level of the architecture represents features at a different level of abstraction, defined as a composition of lower-level features. Searching the parameter space of deep architectures is a difficult task, but new algorithms have been discovered and a new sub-area has emerged in the machine learning community since 2006, following these discoveries. Learning algorithms such as those for Deep Belief Networks and other related unsupervised learning algorithms have recently been proposed to train deep architectures, yielding exciting results and beating the state-of-the-art in certain areas. Learning Deep Architectures for AI discusses the motivations for and principles of learning algorithms for deep architectures. By analyzing and comparing recent results with different learning algorithms for deep architectures, explanations for their success are proposed and discussed, highlighting challenges and suggesting avenues for future explorations in this area.

...read moreread less

7,767 citations

Journal Article•DOI•

Mixed-effects modeling with crossed random effects for subjects and items

[...]

R.H. Baayen¹, Douglas J. Davidson², Douglas M. Bates³•Institutions (3)

University of Alberta¹, Max Planck Society², University of Wisconsin-Madison³

01 Nov 2008-Journal of Memory and Language

TL;DR: In this article, the authors provide an introduction to mixed-effects models for the analysis of repeated measurement data with subjects and items as crossed random effects, and a worked-out example of how to use recent software for mixed effects modeling is provided.

...read moreread less

6,853 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse