Showing papers by "Fredrik Lindsten published in 2013"

PDF

Open Access

Book•

Backward Simulation Methods for Monte Carlo Statistical Inference

[...]

Fredrik Lindsten¹, Thomas B. Schön¹•Institutions (1)

11 Aug 2013

TL;DR: This tutorial reviews and discusses several related backward-simulation-based methods for state inference as well as learning of static parameters, both using a frequentistic and a Bayesian approach.

...read moreread less

Abstract: Monte Carlo methods, in particular those based on Markov chains and on interacting particle systems, are by now tools that are routinely used in machine learning. These methods have had a profound impact on statistical inference in a wide range of application areas where probabilistic models are used. Moreover, there are many algorithms in machine learning which are based on the idea of processing the data sequentially, first in the forward direction and then in the backward direction. In this tutorial, we will review a branch of Monte Carlo methods based on the forward–backward idea, referred to as backward simulators. These methods are useful for learning and inference in probabilistic models containing latent stochastic processes. The theory and practice of backward simulation algorithms have undergone a significant development in recent years and the algorithms keep finding new applications. The foundation for these methods is sequential Monte Carlo (SMC). SMC-based backward simulators are capable of addressing smoothing problems in sequential latent variable models, such as general, nonlinear/non-Gaussian state-space models (SSMs). However, we will also clearly show that the underlying backward simulation idea is by no means restricted to SSMs. Furthermore, backward simulation plays an important role in recent developments of Markov chain Monte Carlo (MCMC) methods. Particle MCMC is a systematic way of using SMC within MCMC. In this framework, backward simulation gives us a way to significantly improve the performance of the samplers. We review and discuss several related backward-simulation-based methods for state inference as well as learning of static parameters, both using a frequentistic and a Bayesian approach.

...read moreread less

170 citations

Posted Content•

Bayesian Inference and Learning in Gaussian Process State-Space Models with Particle MCMC

[...]

Roger Frigola¹, Fredrik Lindsten², Thomas B. Schön², Carl Edward Rasmussen¹•Institutions (2)

University of Cambridge¹, Linköping University²

12 Jun 2013-arXiv: Machine Learning

TL;DR: This work presents a fully Bayesian approach to inference and learning in nonlinear nonparametric state-space models and places a Gaussian process prior over the state transition dynamics, resulting in a flexible model able to capture complex dynamical phenomena.

...read moreread less

Abstract: State-space models are successfully used in many areas of science, engineering and economics to model time series and dynamical systems. We present a fully Bayesian approach to inference \emph{and learning} (i.e. state estimation and system identification) in nonlinear nonparametric state-space models. We place a Gaussian process prior over the state transition dynamics, resulting in a flexible model able to capture complex dynamical phenomena. To enable efficient inference, we marginalize over the transition dynamics function and infer directly the joint smoothing distribution using specially tailored Particle Markov Chain Monte Carlo samplers. Once a sample from the smoothing distribution is computed, the state transition predictive distribution can be formulated analytically. Our approach preserves the full nonparametric expressivity of the model and can make use of sparse Gaussian processes to greatly reduce computational complexity.

...read moreread less

127 citations

Posted Content•

Recursive maximum likelihood identification of jump Markov nonlinear systems

[...]

Emre Ozkan¹, Fredrik Lindsten², Carsten Fritsche¹, Fredrik Gustafsson¹•Institutions (2)

Linköping University¹, University of Cambridge²

03 Dec 2013-arXiv: Computation

TL;DR: An online method for joint state and parameter estimation in jump Markov non-linear systems (JMNLS) using a Rao-Blackwellized particle filter (RBPF) where the discrete mode is marginalized out analytically and this results in an efficient implementation of the algorithm and reduces the estimation error variance.

...read moreread less

Abstract: In this contribution, we present an online method for joint state and parameter estimation in jump Markov non-linear systems (JMNLS). State inference is enabled via the use of particle filters which makes the method applicable to a wide range of non-linear models. To exploit the inherent structure of JMNLS, we design a Rao-Blackwellized particle filter (RBPF) where the discrete mode is marginalized out analytically. This results in an efficient implementation of the algorithm and reduces the estimation error variance. The proposed RBPF is then used to compute, recursively in time, smoothed estimates of complete data sufficient statistics. Together with the online expectation maximization algorithm, this enables recursive identification of unknown model parameters. The performance of the method is illustrated in simulations and on a localization problem in wireless networks using real data.

...read moreread less

61 citations

Journal Article•DOI•

Bayesian semiparametric Wiener system identification

[...]

Fredrik Lindsten¹, Thomas B. Schön¹, Michael I. Jordan²•Institutions (2)

Linköping University¹, University of California, Berkeley²

01 Jul 2013-Automatica

TL;DR: An inference algorithm based on an ecient particle Markov chain Monte Carlo method, referred to as particle Gibbs with ancestor sampling, is derived based on a mixed parametric/nonparametric model of a Wiener system.

...read moreread less

60 citations

Proceedings Article•DOI•

An efficient stochastic approximation EM algorithm using conditional particle filters

[...]

Fredrik Lindsten¹•Institutions (1)

Linköping University¹

26 May 2013

TL;DR: The proposed method combines the efficient conditional particle filter with ancestor sampling (CPF-AS) with the stochastic approximation EM (SAEM) algorithm, which results in a procedure which does not rely on asymptotics in the number of particles for convergence, meaning that the method is very computationally competitive.

...read moreread less

Abstract: I present a novel method for maximum likelihood parameter estimation in nonlinear/non-Gaussian state-space models. It is an expectation maximization (EM) like method, which uses sequential Monte Carlo (SMC) for the intermediate state inference problem. Contrary to existing SMC-based EM algorithms, however, it makes efficient use of the simulated particles through the use of particle Markov chain Monte Carlo (PMCMC) theory. More precisely, the proposed method combines the efficient conditional particle filter with ancestor sampling (CPF-AS) with the stochastic approximation EM (SAEM) algorithm. This results in a procedure which does not rely on asymptotics in the number of particles for convergence, meaning that the method is very computationally competitive. Indeed, the method is evaluated in a simulation study, using a small number of particles, with promising results.

...read moreread less

59 citations

Proceedings Article•

Bayesian Inference and Learning in Gaussian Process State-Space Models with Particle MCMC

[...]

Roger Frigola¹, Fredrik Lindsten², Thomas B. Schön², Carl Edward Rasmussen¹•Institutions (2)

University of Cambridge¹, Linköping University²

05 Dec 2013

TL;DR: In this paper, a fully Bayesian approach to inference and learning in nonlinear nonparametric state-space models is presented, where a Gaussian process prior is placed over the state transition dynamics, resulting in a flexible model able to capture complex dynamical phenomena.

...read moreread less

Abstract: State-space models are successfully used in many areas of science, engineering and economics to model time series and dynamical systems. We present a fully Bayesian approach to inference and learning (i.e. state estimation and system identification) in nonlinear nonparametric state-space models. We place a Gaussian process prior over the state transition dynamics, resulting in a flexible model able to capture complex dynamical phenomena. To enable efficient inference, we marginalize over the transition dynamics function and, instead, infer directly the joint smoothing distribution using specially tailored Particle Markov Chain Monte Carlo samplers. Once a sample from the smoothing distribution is computed, the state transition predictive distribution can be formulated analytically. Our approach preserves the full nonparametric expressivity of the model and can make use of sparse Gaussian processes to greatly reduce computational complexity.

...read moreread less

56 citations

Proceedings Article•DOI•

Adaptive stopping for fast particle smoothing

[...]

Ehsan Taghavi¹, Fredrik Lindsten², Lennart Svensson¹, Thomas B. Schön²•Institutions (2)

Chalmers University of Technology¹, Linköping University²

26 May 2013

TL;DR: This contribution develops a hybrid method, governed by an adaptive stopping rule, in order to exploit the benefits, but avoid the drawbacks, of RS-FFBS, and is shown in a simulation study to be considerably more computationally efficient than both FFBS and RS- FFBS.

...read moreread less

Abstract: Particle smoothing is useful for offline state inference and parameter learning in nonlinear/non-Gaussian state-space models. However, many particle smoothers, such as the popular forward filter/backward simulator (FFBS), are plagued by a quadratic computational complexity in the number of particles. One approach to tackle this issue is to use rejection-sampling-based FFBS (RS-FFBS), which asymptotically reaches linear complexity. In practice, however, the constants can be quite large and the actual gain in computational time limited. In this contribution, we develop a hybrid method, governed by an adaptive stopping rule, in order to exploit the benefits, but avoid the drawbacks, of RS-FFBS. The resulting particle smoother is shown in a simulation study to be considerably more computationally efficient than both FFBS and RS-FFBS.

...read moreread less

31 citations

Proceedings Article•DOI•

Rao-Blackwellized particle smoothers for mixed linear/nonlinear state-space models

[...]

Fredrik Lindsten¹, Pete Bunch², Simon J. Godsill², Thomas B. Schön¹•Institutions (2)

Linköping University¹, University of Cambridge²

26 May 2013

TL;DR: A Rao-Blackwellized particle smoother is derived for a class of conditionally linear Gaussian state-space (CLGSS) models, referred to as mixed linear/nonlinear models, by exploiting its tractable substructure.

...read moreread less

Abstract: We consider the smoothing problem for a class of conditionally linear Gaussian state-space (CLGSS) models, referred to as mixed linear/nonlinear models. In contrast to the better studied hierarchical CLGSS models, these allow for an intricate cross dependence between the linear and the nonlinear parts of the state vector. We derive a Rao-Blackwellized particle smoother (RBPS) for this model class by exploiting its tractable substructure. The smoother is of the forward filtering/backward simulation type. A key feature of the proposed method is that, unlike existing RBPS for this model class, the linear part of the state vector is marginalized out in both the forward direction and in the backward direction.

...read moreread less

28 citations

Monograph•DOI•

Particle filters and Markov chains for learning of dynamical systems

[...]

Fredrik Lindsten¹•Institutions (1)

Linköping University¹

08 Oct 2013

TL;DR: SMC and Markov chain Monte Carlo methods provide computational tools for systematic inference and learning in complex dynamical systems, such as nonlinear and non-Gaussian systems.

...read moreread less

Abstract: Sequential Monte Carlo (SMC) and Markov chain Monte Carlo (MCMC) methods provide computational tools for systematic inference and learning in complex dynamical systems, such as nonlinear and non-Ga ...

...read moreread less

27 citations

Proceedings Article•DOI•

Particle metropolis hastings using Langevin dynamics

[...]

Johan Dahlin¹, Fredrik Lindsten¹, Thomas B. Schön¹•Institutions (1)

Linköping University¹

26 May 2013

TL;DR: This paper proposes to use log-likelihood gradients, i.e. the score, in the construction of the proposal, akin to the Langevin Monte Carlo method, but adapted to the PMCMC framework.

...read moreread less

Abstract: Particle Markov Chain Monte Carlo (PMCMC) samplers allow for routine inference of parameters and states in challenging nonlinear problems. A common choice for the parameter proposal is a simple random walk sampler, which can scale poorly with the number of parameters. In this paper, we propose to use log-likelihood gradients, i.e. the score, in the construction of the proposal, akin to the Langevin Monte Carlo method, but adapted to the PMCMC framework. This can be thought of as a way to guide a random walk proposal by using drift terms that are proportional to the score function. The method is successfully applied to a stochastic volatility model and the drift term exhibits intuitive behaviour.

...read moreread less

24 citations

Journal Article•DOI•

Particle filter-based Gaussian process optimisation for parameter inference

[...]

Johan Dahlin¹, Fredrik Lindsten²•Institutions (2)

Linköping University¹, University of Cambridge²

04 Nov 2013-arXiv: Computation

TL;DR: A novel method for maximum-likelihood-based parameter inference in nonlinear and/or non-Gaussian state space models, providing an automatic trade-off between exploration and exploitation of the surrogate model.

...read moreread less

Abstract: We propose a novel method for maximum likelihood-based parameter inference in nonlinear and/or non-Gaussian state space models. The method is an iterative procedure with three steps. At each iteration a particle filter is used to estimate the value of the log-likelihood function at the current parameter iterate. Using these log-likelihood estimates, a surrogate objective function is created by utilizing a Gaussian process model. Finally, we use a heuristic procedure to obtain a revised parameter iterate, providing an automatic trade-off between exploration and exploitation of the surrogate model. The method is profiled on two state space models with good performance both considering accuracy and computational cost.

...read moreread less

Posted Content•

Second-order Particle MCMC for Bayesian Parameter Inference

[...]

Johan Dahlin¹, Fredrik Lindsten², Thomas B. Schön³•Institutions (3)

Linköping University¹, University of Cambridge², Uppsala University³

04 Nov 2013

TL;DR: An improved proposal distribution in the Particle Metropolis-Hastings (PMH) algorithm for Bayesian parameter inference in nonlinear state space models is proposed, which incorporates second-order information about the parameter posterior distribution, which can be extracted from the particle filter already used within the PMH algorithm.

...read moreread less

Abstract: Particle Metropolis-Hastings (PMH) allows for Bayesian parameter inference in nonlinear state space models by combining Markov chain Monte Carlo (MCMC) and particle filtering. The latter is used to estimate the intractable likelihood. In its original formulation, PMH makes use of a marginal MCMC proposal for the parameters, typically a Gaussian random walk. However, this can lead to a poor exploration of the parameter space and an inefficient use of the generated particles. We propose a number of alternative versions of PMH that incorporate gradient and Hessian information about the posterior into the proposal. This information is more or less obtained as a byproduct of the likelihood estimation. Indeed, we show how to estimate the required information using a fixed-lag particle smoother, with a computational cost growing linearly in the number of particles. We conclude that the proposed methods can: (i) decrease the length of the burn-in phase, (ii) increase the mixing of the Markov chain at the stationary phase, and (iii) make the proposal distribution scale invariant which simplifies tuning.

...read moreread less

Journal Article•DOI•

Particle Metropolis-Hastings using gradient and Hessian information

[...]

Johan Dahlin¹, Fredrik Lindsten², Thomas B. Schön³•Institutions (3)

Linköping University¹, University of Cambridge², Uppsala University³

04 Nov 2013-arXiv: Computation

TL;DR: In this article, the authors proposed a number of alternative versions of particle metropolis-hastings (PMH) that incorporate gradient and Hessian information about the posterior into the proposal, which is more or less obtained as a byproduct of the likelihood estimation.

...read moreread less

Posted Content•

Identification of Gaussian Process State-Space Models with Particle Stochastic Approximation EM

[...]

Roger Frigola¹, Fredrik Lindsten², Thomas B. Schön², Thomas B. Schön³, Carl Edward Rasmussen¹ - Show less +1 more•Institutions (3)

University of Cambridge¹, Linköping University², Uppsala University³

17 Dec 2013-arXiv: Machine Learning

TL;DR: In this article, the authors present an approach to maximum likelihood identification of the parameters in GP-SSMs, while retaining the full nonparametric description of the dynamics, based on a stochastic approximation version of the EM algorithm.

...read moreread less

Abstract: Gaussian process state-space models (GP-SSMs) are a very flexible family of models of nonlinear dynamical systems. They comprise a Bayesian nonparametric representation of the dynamics of the system and additional (hyper-)parameters governing the properties of this nonparametric representation. The Bayesian formalism enables systematic reasoning about the uncertainty in the system dynamics. We present an approach to maximum likelihood identification of the parameters in GP-SSMs, while retaining the full nonparametric description of the dynamics. The method is based on a stochastic approximation version of the EM algorithm that employs recent developments in particle Markov chain Monte Carlo for efficient identification.

...read moreread less

Posted Content•

Inference in Gaussian models with missing data using Equalisation Maximisation

[...]

Johan Dahlin, Fredrik Lindsten, Thomas B. Schön

21 Aug 2013-arXiv: Computation

TL;DR: It is shown that EqM can be viewed as an approximation of a proximal point algorithm and derived the method for the entire class of Gaussian models and exemplify its use for estimation of ARMA models with missing data.

...read moreread less

Abstract: Equalisation Maximisation (EqM) is an algorithm for estimating parameters in auto-regressive (AR) models where some fraction of the data is missing. It has previously been shown that the EqM algorithm is a competitive alternative to expectation maximisation, estimating models with equal predictive capability at a lower computational cost. The EqM algorithm has previously been motivated as a heuristic. In this paper, we instead show that EqM can be viewed as an approximation of a proximal point algorithm. We also derive the method for the entire class of Gaussian models and exemplify its use for estimation of ARMA models with missing data. The resulting method is evaluated in numerical simulations, resulting in similar results as for the AR processes.

...read moreread less