Showing papers on "Gaussian process published in 2005"

PDF

Open Access

Proceedings Article•

Sparse Gaussian Processes using Pseudo-inputs

[...]

Edward Snelson¹, Zoubin Ghahramani¹•Institutions (1)

05 Dec 2005

TL;DR: It is shown that this new Gaussian process (GP) regression model can match full GP performance with small M, i.e. very sparse solutions, and it significantly outperforms other approaches in this regime.

...read moreread less

Abstract: We present a new Gaussian process (GP) regression model whose co-variance is parameterized by the the locations of M pseudo-input points, which we learn by a gradient based optimization. We take M ≪ N, where N is the number of real data points, and hence obtain a sparse regression method which has O(M2N) training cost and O(M2) prediction cost per test case. We also find hyperparameters of the covariance function in the same joint optimization. The method can be viewed as a Bayesian regression model with particular input dependent noise. The method turns out to be closely related to several other sparse GP approaches, and we discuss the relation in detail. We finally demonstrate its performance on some large data sets, and make a direct comparison to other sparse GP methods. We show that our method can match full GP performance with small M, i.e. very sparse solutions, and it significantly outperforms other approaches in this regime.

...read moreread less

1,708 citations

Journal Article•

Probabilistic Non-linear Principal Component Analysis with Gaussian Process Latent Variable Models

[...]

Neil D. Lawrence¹•Institutions (1)

University of Sheffield¹

01 Dec 2005-Journal of Machine Learning Research

TL;DR: A novel probabilistic interpretation of principal component analysis (PCA) that is based on a Gaussian process latent variable model (GP-LVM), and related to popular spectral techniques such as kernel PCA and multidimensional scaling.

...read moreread less

Abstract: Summarising a high dimensional data set with a low dimensional embedding is a standard approach for exploring its structure In this paper we provide an overview of some existing techniques for discovering such embeddings We then introduce a novel probabilistic interpretation of principal component analysis (PCA) that we term dual probabilistic PCA (DPPCA) The DPPCA model has the additional advantage that the linear mappings from the embedded space can easily be non-linearised through Gaussian processes We refer to this model as a Gaussian process latent variable model (GP-LVM) Through analysis of the GP-LVM objective function, we relate the model to popular spectral techniques such as kernel PCA and multidimensional scaling We then review a practical algorithm for GP-LVMs in the context of large data sets and develop it to also handle discrete valued data and missing attributes We demonstrate the model on a range of real-world and artificially generated data sets

...read moreread less

1,065 citations

Journal Article•DOI•

An unsupervised approach based on the generalized Gaussian model to automatic change detection in multitemporal SAR images

[...]

Yakoub Bazi¹, Lorenzo Bruzzone¹, Farid Melgani¹•Institutions (1)

University of Trento¹

04 Apr 2005-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: Experiments carried out on two sets of multitemporal images acquired by the European Remote Sensing 2 satellite SAR sensor confirm the effectiveness of the proposed unsupervised approach, which results in change-detection accuracies very similar to those that can be achieved by a manual supervised thresholding.

...read moreread less

Abstract: We present a novel automatic and unsupervised change-detection approach specifically oriented to the analysis of multitemporal single-channel single-polarization synthetic aperture radar (SAR) images. This approach is based on a closed-loop process made up of three main steps: (1) a novel preprocessing based on a controlled adaptive iterative filtering; (2) a comparison between multitemporal images carried out according to a standard log-ratio operator; and (3) a novel approach to the automatic analysis of the log-ratio image for generating the change-detection map. The first step aims at reducing the speckle noise in a controlled way in order to maximize the discrimination capability between changed and unchanged classes. In the second step, the two filtered multitemporal images are compared to generate a log-ratio image that contains explicit information on changed areas. The third step produces the change-detection map according to a thresholding procedure based on a reformulation of the Kittler-Illingworth (KI) threshold selection criterion. In particular, the modified KI criterion is derived under the generalized Gaussian assumption for modeling the distributions of changed and unchanged classes. This parametric model was chosen because it is capable of better fitting the conditional densities of classes in the log-ratio image. In order to control the filtering step and, accordingly, the effects of the filtering process on change-detection accuracy, we propose to identify automatically the optimal number of despeckling filter iterations [Step 1] by analyzing the behavior of the modified KI criterion. This results in a completely automatic and self-consistent change-detection approach that avoids the use of empirical methods for the selection of the best number of filtering iterations. Experiments carried out on two sets of multitemporal images (characterized by different levels of speckle noise) acquired by the European Remote Sensing 2 satellite SAR sensor confirm the effectiveness of the proposed unsupervised approach, which results in change-detection accuracies very similar to those that can be achieved by a manual supervised thresholding.

...read moreread less

688 citations

Journal Article•DOI•

Combining Field Data and Computer Simulations for Calibration and Prediction

[...]

Dave Higdon¹, Marc C. Kennedy, James C. Cavendish, John A. Cafeo, Robert D. Ryne - Show less +1 more•Institutions (1)

Los Alamos National Laboratory¹

01 Feb 2005

TL;DR: A statistical approach for characterizing uncertainty in predictions that are made with the aid of a computer simulation model that uses a Bayesian formulation and relies on Gaussian process models to model unknown functions of the model inputs.

...read moreread less

Abstract: We develop a statistical approach for characterizing uncertainty in predictions that are made with the aid of a computer simulation model. Typically, the computer simulation code models a physical system and requires a set of inputs---some known and specified, others unknown. A limited amount of field data from the true physical system is available to inform us about the unknown inputs and also to inform us about the uncertainty that is associated with a simulation-based prediction. The approach given here allows for the following: uncertainty regarding model inputs (i.e., calibration); accounting for uncertainty due to limitations on the number of simulations that can be carried out; discrepancy between the simulation code and the actual physical system; uncertainty in the observation process that yields the actual field data on the true physical system. The resulting analysis yields predictions and their associated uncertainties while accounting for multiple sources of uncertainty. We use a Bayesian formulation and rely on Gaussian process models to model unknown functions of the model inputs. The estimation is carried out using a Markov chain Monte Carlo method. This methodology is applied to two examples: a charged particle accelerator and a spot welding process.

...read moreread less

629 citations

Journal Article•DOI•

A Gaussian mixture model based classification scheme for myoelectric control of powered upper limb prostheses

[...]

Yonghong Huang¹, Kevin Englehart¹, B. Hudgins¹, Adrian D. C. Chan²•Institutions (2)

University of New Brunswick¹, Carleton University²

17 Oct 2005-IEEE Transactions on Biomedical Engineering

TL;DR: The GMM-based limb motion classification system demonstrates exceptional classification accuracy and results in a robust method of motion classification with low computational load.

...read moreread less

Abstract: This paper introduces and evaluates the use of Gaussian mixture models (GMMs) for multiple limb motion classification using continuous myoelectric signals. The focus of this work is to optimize the configuration of this classification scheme. To that end, a complete experimental evaluation of this system is conducted on a 12 subject database. The experiments examine the GMMs algorithmic issues including the model order selection and variance limiting, the segmentation of the data, and various feature sets including time-domain features and autoregressive features. The benefits of postprocessing the results using a majority vote rule are demonstrated. The performance of the GMM is compared to three commonly used classifiers: a linear discriminant analysis, a linear perceptron network, and a multilayer perceptron neural network. The GMM-based limb motion classification system demonstrates exceptional classification accuracy and results in a robust method of motion classification with low computational load.

...read moreread less

597 citations

Proceedings Article•DOI•

Near-optimal sensor placements in Gaussian processes

[...]

Carlos Guestrin¹, Andreas Krause¹, Ajit P. Singh¹•Institutions (1)

Carnegie Mellon University¹

07 Aug 2005

TL;DR: A mutual information criteria is proposed, and it is proved that finding the configuration that maximizes mutual information is NP-complete, and a polynomial-time approximation is described that is within (1 -- 1/e) of the optimum by exploiting the submodularity of the criterion.

...read moreread less

Abstract: When monitoring spatial phenomena, which are often modeled as Gaussian Processes (GPs), choosing sensor locations is a fundamental task. A common strategy is to place sensors at the points of highest entropy (variance) in the GP model. We propose a mutual information criteria, and show that it produces better placements. Furthermore, we prove that finding the configuration that maximizes mutual information is NP-complete. To address this issue, we describe a polynomial-time approximation that is within (1 -- 1/e) of the optimum by exploiting the submodularity of our criterion. This algorithm is extended to handle local structure in the GP, yielding significant speedups. We demonstrate the advantages of our approach on two real-world data sets.

...read moreread less

537 citations

Journal Article•DOI•

Central limit theorems for sequences of multiple stochastic integrals

[...]

David Nualart, Giovanni Peccati¹•Institutions (1)

University of Luxembourg¹

01 Jan 2005-Annals of Probability

TL;DR: In this article, the convergence in distribution to a standard normal law for a sequence of multiple stochastic integrals of a fixed order with variance converging to 1 is characterized.

...read moreread less

Abstract: We characterize the convergence in distribution to a standard normal law for a sequence of multiple stochastic integrals of a fixed order with variance converging to 1. Some applications are given, in particular to study the limiting behavior of quadratic functionals of Gaussian processes.

...read moreread less

518 citations

Proceedings Article•

Gaussian Process Dynamical Models

[...]

Jack M. Wang¹, Aaron Hertzmann¹, David M. Blei¹•Institutions (1)

University of Toronto¹

05 Dec 2005

TL;DR: This paper marginalize out the model parameters in closed-form, using Gaussian Process (GP) priors for both the dynamics and the observation mappings, resulting in a nonparametric model for dynamical systems that accounts for uncertainty in the model.

...read moreread less

Abstract: This paper introduces Gaussian Process Dynamical Models (GPDM) for nonlinear time series analysis. A GPDM comprises a low-dimensional latent space with associated dynamics, and a map from the latent space to an observation space. We marginalize out the model parameters in closed-form, using Gaussian Process (GP) priors for both the dynamics and the observation mappings. This results in a nonparametric model for dynamical systems that accounts for uncertainty in the model. We demonstrate the approach on human motion capture data in which each pose is 62-dimensional. Despite the use of small data sets, the GPDM learns an effective representation of the nonlinear dynamics in these spaces. Webpage: http://www.dgp.toronto.edu/~jmwang/gpdm/

...read moreread less

517 citations

Journal Article•

Gaussian Processes for Ordinal Regression

[...]

Wei Chu, Zoubin Ghahramani

01 Dec 2005-Journal of Machine Learning Research

TL;DR: A probabilistic kernel approach to ordinal regression based on Gaussian processes is presented, where a threshold model that generalizes the probit function is used as the likelihood function for ordinal variables.

...read moreread less

Abstract: We present a probabilistic kernel approach to ordinal regression based on Gaussian processes. A threshold model that generalizes the probit function is used as the likelihood function for ordinal variables. Two inference techniques, based on the Laplace approximation and the expectation propagation algorithm respectively, are derived for hyperparameter learning and model selection. We compare these two Gaussian process approaches with a previous ordinal regression method based on support vector machines on some benchmark and real-world data sets, including applications of ordinal regression to collaborative filtering and gene expression analysis. Experimental results on these data sets verify the usefulness of our approach.

...read moreread less

475 citations

Journal Article•DOI•

Space–Time Covariance Functions

[...]

Michael L. Stein

01 Mar 2005-Journal of the American Statistical Association

TL;DR: In this paper, the authors consider a number of properties of space-time covariance functions and how these relate to the spatial-temporal interactions of the process and obtain a parametric class of spectral densities whose corresponding space time covariance function are infinitely differentiable away from the origin and allow for essentially arbitrary and possibly different degrees of smoothness for the process in space and time.

...read moreread less

Abstract: This work considers a number of properties of space–time covariance functions and how these relate to the spatial-temporal interactions of the process. First, it examines how the smoothness away from the origin of a space–time covariance function affects, for example, temporal correlations of spatial differences. Models that are not smoother away from the origin than they are at the origin, such as separable models, have a kind of discontinuity to certain correlations that one might wish to avoid in some circumstances. Smoothness away from the origin of a covariance function is shown to follow from the corresponding spectral density having derivatives with finite moments. These results are used to obtain a parametric class of spectral densities whose corresponding space–time covariance functions are infinitely differentiable away from the origin and that allows for essentially arbitrary and possibly different degrees of smoothness for the process in space and time. Second, this work considers models that ...

...read moreread less

440 citations

Proceedings Article•DOI•

Learning Gaussian processes from multiple tasks

[...]

Kai Yu¹, Volker Tresp¹, Anton Schwaighofer²•Institutions (2)

Siemens¹, Analysis Group²

07 Aug 2005

TL;DR: This work considers the problem of multi-task learning, that is, learning multiple related functions, and presents a hierarchical Bayesian framework, that exploits the equivalence between parametric linear models and nonparametric Gaussian processes.

...read moreread less

Abstract: We consider the problem of multi-task learning, that is, learning multiple related functions. Our approach is based on a hierarchical Bayesian framework, that exploits the equivalence between parametric linear models and nonparametric Gaussian processes (GPs). The resulting models can be learned easily via an EM-algorithm. Empirical studies on multi-label text categorization suggest that the presented models allow accurate solutions of these multi-task problems.

...read moreread less

Proceedings Article•DOI•

Reinforcement learning with Gaussian processes

[...]

Yaakov Engel¹, Shie Mannor², Ron Meir³•Institutions (3)

University of Alberta¹, McGill University², Technion – Israel Institute of Technology³

07 Aug 2005

TL;DR: A SARSA based extension of GPTD is presented, termed GPSARSA, that allows the selection of actions and the gradual improvement of policies without requiring a world-model.

...read moreread less

Abstract: Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framework by addressing two pressing issues, which were not adequately treated in the original GPTD paper (Engel et al., 2003). The first is the issue of stochasticity in the state transitions, and the second is concerned with action selection and policy improvement. We present a new generative model for the value function, deduced from its relation with the discounted return. We derive a corresponding on-line algorithm for learning the posterior moments of the value Gaussian process. We also present a SARSA based extension of GPTD, termed GPSARSA, that allows the selection of actions and the gradual improvement of policies without requiring a world-model.

...read moreread less

Journal Article•DOI•

Observation of a power-law memory kernel for fluctuations within a single protein molecule.

[...]

Wei Min¹, Guobin Luo¹, Binny J. Cherayil¹, Samuel C. Kou¹, X. Sunney Xie¹ - Show less +1 more•Institutions (1)

Harvard University¹

18 May 2005-Physical Review Letters

TL;DR: The fluctuation of the distance between a fluorescein-tyrosine pair within a single protein complex was directly monitored in real time by photoinduced electron transfer and found to be a stationary, time-reversible, and non-Markovian Gaussian process.

...read moreread less

Abstract: The fluctuation of the distance between a fluorescein-tyrosine pair within a single protein complex was directly monitored in real time by photoinduced electron transfer and found to be a stationary, time-reversible, and non-Markovian Gaussian process. Within the generalized Langevin equation formalism, we experimentally determine the memory kernel K(t), which is proportional to the autocorrelation function of the random fluctuating force. K(t) is a power-law decay, t(-0.51 +/- 0.07) in a broad range of time scales (10(-3)-10 s). Such a long-time memory effect could have implications for protein functions.

...read moreread less

Journal Article•DOI•

Bayesian Nonparametric Spatial Modeling With Dirichlet Process Mixing

[...]

Alan E. Gelfand¹, Athanasios Kottas¹, Steven N MacEachern¹•Institutions (1)

University of California, Santa Cruz¹

01 Jan 2005-Journal of the American Statistical Association

TL;DR: This work develops a spatial Dirichlet process model for spatial data and discusses its properties, and introduces mixing by convolving this process with a pure error process to produce a random spatial process that is neither Gaussian nor stationary.

...read moreread less

Abstract: Customary modeling for continuous point-referenced data assumes a Gaussian process that is often taken to be stationary. When such models are fitted within a Bayesian framework, the unknown parameters of the process are assumed to be random, so a random Gaussian process results. Here we propose a novel spatial Dirichlet process mixture model to produce a random spatial process that is neither Gaussian nor stationary. We first develop a spatial Dirichlet process model for spatial data and discuss its properties. Because of familiar limitations associated with direct use of Dirichlet process models, we introduce mixing by convolving this process with a pure error process. We then examine properties of models created through such Dirichlet process mixing. In the Bayesian framework, we implement posterior inference using Gibbs sampling. Spatial prediction raises interesting questions, but these can be handled. Finally, we illustrate the approach using simulated data, as well as a dataset involving precipitati...

...read moreread less

Proceedings Article•DOI•

A robust algorithm for point set registration using mixture of Gaussians

[...]

Bing Jian¹, Baba C. Vemuri¹•Institutions (1)

University of Florida¹

17 Oct 2005

TL;DR: A novel and robust approach to the point set registration problem in the presence of large amounts of noise and outliers is proposed, which derives a closed-form expression for the L/sub 2/distance between two Gaussian mixtures, which leads to a computationally efficient registration algorithm.

...read moreread less

Abstract: This paper proposes a novel and robust approach to the point set registration problem in the presence of large amounts of noise and outliers. Each of the point sets is represented by a mixture of Gaussians and the point set registration is treated as a problem of aligning the two mixtures. We derive a closed-form expression for the L/sub 2/distance between two Gaussian mixtures, which in turn leads to a computationally efficient registration algorithm. This new algorithm has an intuitive interpretation, is simple to implement and exhibits inherent statistical robustness. Experimental results indicate that our algorithm achieves very good performance in terms of both robustness and accuracy.

...read moreread less

Proceedings Article•DOI•

Preference learning with Gaussian processes

[...]

Wei Chu¹, Zoubin Ghahramani¹•Institutions (1)

University College London¹

07 Aug 2005

TL;DR: A probabilistic kernel approach to preference learning based on Gaussian processes and a new likelihood function is proposed to capture the preference relations in the Bayesian framework.

...read moreread less

Abstract: In this paper, we propose a probabilistic kernel approach to preference learning based on Gaussian processes. A new likelihood function is proposed to capture the preference relations in the Bayesian framework. The generalized formulation is also applicable to tackle many multiclass problems. The overall approach has the advantages of Bayesian methods for model selection and probabilistic prediction. Experimental results compared against the constraint classification approach on several benchmark datasets verify the usefulness of this algorithm.

...read moreread less

Journal Article•DOI•

Speech enhancement based on minimum mean-square error estimation and supergaussian priors

[...]

Rainer Martin¹•Institutions (1)

Ruhr University Bochum¹

15 Aug 2005-IEEE Transactions on Speech and Audio Processing

TL;DR: Compared to algorithms based on the Gaussian assumption, such as the Wiener filter or the Ephraim and Malah (1984) MMSE short-time spectral amplitude estimator, the estimators based on these supergaussian densities deliver an improved signal-to-noise ratio.

...read moreread less

Abstract: This paper presents a class of minimum mean-square error (MMSE) estimators for enhancing short-time spectral coefficients of a noisy speech signal. In contrast to most of the presently used methods, we do not assume that the spectral coefficients of the noise or of the clean speech signal obey a (complex) Gaussian probability density. We derive analytical solutions to the problem of estimating discrete Fourier transform (DFT) coefficients in the MMSE sense when the prior probability density function of the clean speech DFT coefficients can be modeled by a complex Laplace or by a complex bilateral Gamma density. The probability density function of the noise DFT coefficients may be modeled either by a complex Gaussian or by a complex Laplacian density. Compared to algorithms based on the Gaussian assumption, such as the Wiener filter or the Ephraim and Malah (1984) MMSE short-time spectral amplitude estimator, the estimators based on these supergaussian densities deliver an improved signal-to-noise ratio.

...read moreread less

Journal Article•DOI•

Managing Diversity in Regression Ensembles

[...]

Gavin Brown, Jeremy L. Wyatt, Peter Tiňo

01 Dec 2005-Journal of Machine Learning Research

TL;DR: It is demonstrated that these methods control the bias-variance-covariance trade-off systematically, and can be utilised with any estimator capable of minimising a quadratic error function, for example MLPs, or RBF networks.

...read moreread less

Abstract: Ensembles are a widely used and effective technique in machine learning---their success is commonly attributed to the degree of disagreement, or 'diversity', within the ensemble. For ensembles where the individual estimators output crisp class labels, this 'diversity' is not well understood and remains an open research issue. For ensembles of regression estimators, the diversity can be exactly formulated in terms of the covariance between individual estimator outputs, and the optimum level is expressed in terms of a bias-variance-covariance trade-off. Despite this, most approaches to learning ensembles use heuristics to encourage the right degree of diversity. In this work we show how to explicitly control diversity through the error function. The first contribution of this paper is to show that by taking the combination mechanism for the ensemble into account we can derive an error function for each individual that balances ensemble diversity with individual accuracy. We show the relationship between this error function and an existing algorithm called negative correlation learning, which uses a heuristic penalty term added to the mean squared error function. It is demonstrated that these methods control the bias-variance-covariance trade-off systematically, and can be utilised with any estimator capable of minimising a quadratic error function, for example MLPs, or RBF networks. As a second contribution, we derive a strict upper bound on the coefficient of the penalty term, which holds for any estimator that can be cast in a generalised linear regression framework, with mild assumptions on the basis functions. Finally we present the results of an empirical study, showing significant improvements over simple ensemble learning, and finding that this technique is competitive with a variety of methods, including boosting, bagging, mixtures of experts, and Gaussian processes, on a number of tasks.

...read moreread less

Journal Article•DOI•

Accelerating evolutionary algorithms with Gaussian process fitness function models

[...]

Dirk Büche¹, Nicol N. Schraudolph¹, Petros Koumoutsakos•Institutions (1)

ETH Zurich¹

01 May 2005

TL;DR: The Gaussian process model is described and proposed using it as an inexpensive fitness function surrogate and clearly outperforms other evolutionary strategies on standard test functions as well as on a real-world problem: the optimization of stationary gas turbine compressor profiles.

...read moreread less

Abstract: We present an overview of evolutionary algorithms that use empirical models of the fitness function to accelerate convergence, distinguishing between evolution control and the surrogate approach. We describe the Gaussian process model and propose using it as an inexpensive fitness function surrogate. Implementation issues such as efficient and numerically stable computation, exploration versus exploitation, local modeling, multiple objectives and constraints, and failed evaluations are addressed. Our resulting Gaussian process optimization procedure clearly outperforms other evolutionary strategies on standard test functions as well as on a real-world problem: the optimization of stationary gas turbine compressor profiles.

...read moreread less

Journal Article•DOI•

An online kernel change detection algorithm

[...]

F. Desobry¹, Manuel Davy², Christian Doncarli³•Institutions (3)

Centre national de la recherche scientifique¹, École centrale de Lille², École centrale de Nantes³

01 Aug 2005-IEEE Transactions on Signal Processing

TL;DR: This work considers the case where no accurate nor tractable model can be found, using a model-free approach, called Kernel change detection (KCD), and builds a dissimilarity measure in feature space between two sets of descriptors, shown to be asymptotically equivalent to the Fisher ratio in the Gaussian case.

...read moreread less

Abstract: A number of abrupt change detection methods have been proposed in the past, among which are efficient model-based techniques such as the Generalized Likelihood Ratio (GLR) test. We consider the case where no accurate nor tractable model can be found, using a model-free approach, called Kernel change detection (KCD). KCD compares two sets of descriptors extracted online from the signal at each time instant: The immediate past set and the immediate future set. Based on the soft margin single-class Support Vector Machine (SVM), we build a dissimilarity measure in feature space between those sets, without estimating densities as an intermediary step. This dissimilarity measure is shown to be asymptotically equivalent to the Fisher ratio in the Gaussian case. Implementation issues are addressed; in particular, the dissimilarity measure can be computed online in input space. Simulation results on both synthetic signals and real music signals show the efficiency of KCD.

...read moreread less

Journal Article•DOI•

Spatial distribution model for tracking extended objects

[...]

K. Gilholm¹, D. Salmond¹•Institutions (1)

Qinetiq¹

17 Oct 2005

TL;DR: In this paper, a Bayesian filter was developed for tracking an extended object in clutter based on two simple axioms: (i) the number of received target and clutter measurements in a frame are Poisson distributed (so several measurements may originate from the target) and (ii) target extent is modelled by a spatial probability distribution and each targetrelated measurement is an independent 'random draw' from this spatial distribution (convolved with a sensor model).

...read moreread less

Abstract: A Bayesian filter has been developed for tracking an extended object in clutter based on two simple axioms: (i) the numbers of received target and clutter measurements in a frame are Poisson distributed (so several measurements may originate from the target) and (ii) target extent is modelled by a spatial probability distribution and each target-related measurement is an independent 'random draw' from this spatial distribution (convolved with a sensor model). Diffuse spatial models of target extent are of particular interest. This model is especially suitable for a particle filter implementation, and examples are presented for a Gaussian mixture model and for a uniform stick target convolved with a Gaussian error. A rather restrictive special case that admits a solution in the form of a multiple hypothesis Kalman filter is also discussed and demonstrated.

...read moreread less

Proceedings Article•DOI•

Priors for people tracking from small training sets

[...]

Raquel Urtasun¹, David J. Fleet², Aaron Hertzmann², Pascal Fua¹•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, University of Toronto²

17 Oct 2005

TL;DR: It is shown that the SGPLVM sufficiently constrains the problem such that tracking can be accomplished with straightforward deterministic optimization.

...read moreread less

Abstract: We advocate the use of scaled Gaussian process latent variable models (SGPLVM) to learn prior models of 3D human pose for 3D people tracking. The SGPLVM simultaneously optimizes a low-dimensional embedding of the high-dimensional pose data and a density function that both gives higher probability to points close to training data and provides a nonlinear probabilistic mapping from the low-dimensional latent space to the full-dimensional pose space. The SGPLVM is a natural choice when only small amounts of training data are available. We demonstrate our approach with two distinct motions, golfing and walking. We show that the SGPLVM sufficiently constrains the problem such that tracking can be accomplished with straightforward deterministic optimization.

...read moreread less

Book•

The Generic chaining : upper and lower bounds of stochastic processes

[...]

Michel Talagrand

01 Jan 2005

TL;DR: In this paper, the Bernoulli Conjecture and families of distances have been used in the application of Gaussian Processes and Related Structures to Banach Space Theory.

...read moreread less

Abstract: Overview and Basic Facts.- Gaussian Processes and Related Structures.- Matching Theorems.- The Bernoulli Conjecture.- Families of distances.- Applications to Banach Space Theory.

...read moreread less

Proceedings Article•

Kernel methods for missing variables

[...]

Alexander J. Smola, S. V. N. Vishwanathan, Thomas Hofmann

01 Jan 2005

TL;DR: An optimization scheme is proposed which extends the Concave Convex Procedure (CCP) of Yuille and Rangarajan and it is shown how the algorithm can be specialized to various cases in order to efficiently solve the optimization problems that arise.

...read moreread less

Abstract: We present methods for dealing with missing variables in the context of Gaussian Processes and Support Vector Machines. This solves an important problem which has largely been ignored by kernel methods: How to systematically deal with incomplete data? Our method can also be applied to problems with partially observed labels as well as to the transductive setting where we view the labels as missing data. Our approach relies on casting kernel methods as an estimation problem in exponential families. Hence, estimation with missing variables becomes a problem of computing marginal distributions, and finding efficient optimization methods. To that extent we propose an optimization scheme which extends the Concave Convex Procedure (CCP) of Yuille and Rangarajan, and present a simplified and intuitive proof of its convergence. We show how our algorithm can be specialized to various cases in order to efficiently solve the optimization problems that arise. Encouraging preliminary experimental results on the USPS dataset are also presented.

...read moreread less

Journal Article•DOI•

Sampling and reconstruction of signals with finite rate of innovation in the presence of noise

[...]

Irena Maravic¹, Martin Vetterli¹•Institutions (1)

University of California, Berkeley¹

01 Aug 2005-IEEE Transactions on Signal Processing

TL;DR: This paper revisits the problem of sampling and reconstruction of signals with finite rate of innovation and proposes improved, more robust methods that have better numerical conditioning in the presence of noise and yield more accurate reconstruction.

...read moreread less

Abstract: Recently, it was shown that it is possible to develop exact sampling schemes for a large class of parametric nonbandlimited signals, namely certain signals of finite rate of innovation. A common feature of such signals is that they have a finite number of degrees of freedom per unit of time and can be reconstructed from a finite number of uniform samples. In order to prove sampling theorems, Vetterli et al. considered the case of deterministic, noiseless signals and developed algebraic methods that lead to perfect reconstruction. However, when noise is present, many of those schemes can become ill-conditioned. In this paper, we revisit the problem of sampling and reconstruction of signals with finite rate of innovation and propose improved, more robust methods that have better numerical conditioning in the presence of noise and yield more accurate reconstruction. We analyze, in detail, a signal made up of a stream of Diracs and develop algorithmic tools that will be used as a basis in all constructions. While some of the techniques have been already encountered in the spectral estimation framework, we further explore preconditioning methods that lead to improved resolution performance in the case when the signal contains closely spaced components. For classes of periodic signals, such as piecewise polynomials and nonuniform splines, we propose novel algebraic approaches that solve the sampling problem in the Laplace domain, after appropriate windowing. Building on the results for periodic signals, we extend our analysis to finite-length signals and develop schemes based on a Gaussian kernel, which avoid the problem of ill-conditioning by proper weighting of the data matrix. Our methods use structured linear systems and robust algorithmic solutions, which we show through simulation results.

...read moreread less

Journal Article•DOI•

Detecting multiple change-points in the mean of Gaussian process by model selection

[...]

E. Lebarbier¹•Institutions (1)

Institut national de la recherche agronomique¹

01 Apr 2005-Signal Processing

TL;DR: This paper proposes to estimate change-points in the mean of a signal corrupted by an additive Gaussian noise with a method based on a penalized least-squares criterion, and chooses the penalty function such that the resulting estimator minimizes the quadratic risk.

...read moreread less

Proceedings Article•DOI•

Distributed particle filter with GMM approximation for multiple targets localization and tracking in wireless sensor network

[...]

Xiaohong Sheng¹, Yu Hen Hu¹, Parameswaran Ramanathan¹•Institutions (1)

University of Wisconsin-Madison¹

24 Apr 2005

TL;DR: Simulation results show that the proposed DPF with GMM approximation algorithms provide robust localization and tracking performance at much reduced communication overhead.

...read moreread less

Abstract: Two novel distributed particle filters with Gaussian mixer approximation are proposed to localize and track multiple moving targets in a wireless sensor network. The distributed particle filters run on a set of uncorrelated sensor cliques that are dynamically organized based on moving target trajectories. These two algorithms differ in how the distributive computing is performed. In the first algorithm, partial results are updated at each sensor clique sequentially based on partial results forwarded from a neighboring clique and local observations. In the second algorithm, all individual cliques compute partial estimates based only on local observations in parallel, and forward their estimates to a fusion center to obtain final output. In order to conserve bandwidth and power, the local sufficient statistics (belief) is approximated by a low dimensional Gaussian mixture model (GMM) before propagating among sensor cliques. We further prove that the posterior distribution estimated by distributed particle filter convergence almost surely to the posterior distribution estimated from a centralized Bayesian formula. Moreover, a data-adaptive application layer communication protocol is proposed to facilitate sensor self-organization and collaboration. Simulation results show that the proposed DPF with GMM approximation algorithms provide robust localization and tracking performance at much reduced communication overhead.

...read moreread less

Proceedings Article•

Semiparametric Latent Factor Models

[...]

Yee Whye Teh, Matthias Seeger, Michael I. Jordan

01 Jan 2005

TL;DR: A semiparametric model for regression and classification problems involving multiple response variables makes use of a set of Gaussian processes to model the relationship to the inputs in a nonparametric fashion.

...read moreread less

Abstract: We propose a semiparametric model for regression and classification problems involving multiple response variables. The model makes use of a set of Gaussian processes to model the relationship to the inputs in a nonparametric fashion. Conditional dependencies between the responses can be captured through a linear mixture of the driving processes. This feature becomes important if some of the responses of predictive interest are less densely supplied by observed data than related auxiliary ones. We propose an efficient approximate inference scheme for this semiparametric model whose complexity is linear in the number of training data points.

...read moreread less

Journal Article•DOI•

Dynamic systems identification with Gaussian processes

[...]

J. Kocijan¹, Agathe Girard², Blaž Banko¹, Roderick Murray-Smith²•Institutions (2)

Jožef Stefan Institute¹, University of Glasgow²

01 Dec 2005-Mathematical and Computer Modelling of Dynamical Systems

TL;DR: This model is an example of the use of a probabilistic non-parametric modelling approach and can be used to highlight areas of the input space where prediction quality is poor, owing to the lack of data or complexity (high variance).

...read moreread less

Abstract: This paper describes the identification of nonlinear dynamic systems with a Gaussian process (GP) prior model. This model is an example of the use of a probabilistic non-parametric modelling approach. GPs are flexible models capable of modelling complex nonlinear systems. Also, an attractive feature of this model is that the variance associated with the model response is readily obtained, and it can be used to highlight areas of the input space where prediction quality is poor, owing to the lack of data or complexity (high variance). We illustrate the GP modelling technique on a simulated example of a nonlinear system.

...read moreread less

Journal Article•DOI•

Hierarchical Mixture Modeling With Normalized Inverse-Gaussian Priors

[...]

Antonio Lijoi, Ramsés H. Mena, Igor Prünster

01 Dec 2005-Journal of the American Statistical Association

TL;DR: In this article, the normalized inverse-Gaussian (N-IG) prior is proposed as an alternative to the Dirichlet process to be used in Bayesian hierarchical models.

...read moreread less

Abstract: In recent years the Dirichlet process prior has experienced a great success in the context of Bayesian mixture modeling. The idea of overcoming discreteness of its realizations by exploiting it in hierarchical models, combined with the development of suitable sampling techniques, represent one of the reasons of its popularity. In this article we propose the normalized inverse-Gaussian (N–IG) process as an alternative to the Dirichlet process to be used in Bayesian hierarchical models. The N–IG prior is constructed via its finite-dimensional distributions. This prior, although sharing the discreteness property of the Dirichlet prior, is characterized by a more elaborate and sensible clustering which makes use of all the information contained in the data. Whereas in the Dirichlet case the mass assigned to each observation depends solely on the number of times that it occurred, for the N–IG prior the weight of a single observation depends heavily on the whole number of ties in the sample. Moreover, expressio...

...read moreread less

Collapse