Showing papers by "Robert B. Gramacy published in 2010"

PDF

Open Access

Journal Article•DOI•

amei: An R Package for the Adaptive Management of Epidemiological Interventions

[...]

Daniel Merl, Leah R. Johnson, Robert B. Gramacy, Marc Mangel

05 Aug 2010-Journal of Statistical Software

TL;DR: This document briefly covers the background and methodology underpinning the implementation provided by the amei package for R and contains extensive examples showing the functions and methods in action.

...read moreread less

Abstract: The amei package for R is a tool that provides a flexible statistical framework for generating optimal epidemiological interventions that are designed to minimize the total expected cost of an emerging epidemic. Uncertainty regarding the underlying disease parameters is propagated through to the decision process via Bayesian posterior inference. The strategies produced through this framework are adaptive: vaccination schedules are iteratively adjusted to reflect the anticipated trajectory of the epidemic given the current population state and updated parameter estimates. This document briefly covers the background and methodology underpinning the implementation provided by the package and contains extensive examples showing the functions and methods in action.

...read moreread less

186 citations

Posted Content•

Cases for the nugget in modeling computer experiments

[...]

Robert B. Gramacy¹, Herbert K. H. Lee²•Institutions (2)

University of Chicago¹, University of California, Santa Cruz²

26 Jul 2010-arXiv: Computation

TL;DR: In this paper, the authors show that estimating a non-zero nugget can lead to surrogate models with better statistical properties, such as predictive accuracy and coverage, in a variety of common situations.

...read moreread less

Abstract: Most surrogate models for computer experiments are interpolators, and the most common interpolator is a Gaussian process (GP) that deliberately omits a small-scale (measurement) error term called the nugget. The explanation is that computer experiments are, by definition, "deterministic", and so there is no measurement error. We think this is too narrow a focus for a computer experiment and a statistically inefficient way to model them. We show that estimating a (non-zero) nugget can lead to surrogate models with better statistical properties, such as predictive accuracy and coverage, in a variety of common situations.

...read moreread less

175 citations

Journal Article•DOI•

Categorical Inputs, Sensitivity Analysis, Optimization and Importance Tempering with tgp Version 2, an R Package for Treed Gaussian Process Models

[...]

Robert B. Gramacy, Matt Taddy

17 Feb 2010-Journal of Statistical Software

TL;DR: The topics covered include methods for dealing with categorical inputs and excluding inputs from the tree or GP part of the model; fully Bayesian sensitivity analysis for inputs/covariates; sequential optimization of black-box functions; and a new Monte Carlo method for inference in multi-modal posterior distributions that combines simulated tempering and importance sampling.

...read moreread less

Abstract: This document describes the new features in version 2x of the tgp package for R, implementing treed Gaussian process (GP) models The topics covered include methods for dealing with categorical inputs and excluding inputs from the tree or GP part of the model; fully Bayesian sensitivity analysis for inputs/covariates; sequential optimization of black-box functions; and a new Monte Carlo method for inference in multi-modal posterior distributions that combines simulated tempering and importance sampling These additions extend the functionality of tgp across all models in the hierarchy: from Bayesian linear models, to classification and regression trees (CART), to treed Gaussian processes with jumps to the limiting linear model It is assumed that the reader is familiar with the baseline functionality of the package, outlined in the first vignette (Gramacy 2007)

...read moreread less

145 citations

Posted Content•

Optimization Under Unknown Constraints

[...]

Robert B. Gramacy¹, Lee Hkh., C Holmes, M Osborne•Institutions (1)

University of Cambridge¹

22 Apr 2010-arXiv: Methodology

TL;DR: A new integrated improvement criterion is proposed to recognize that responses from inputs that violate the constraint may still be informative about the function, and thus could potentially be useful in the optimization.

...read moreread less

Abstract: Optimization of complex functions, such as the output of computer simulators, is a difficult task that has received much attention in the literature. A less studied problem is that of optimization under unknown constraints, i.e., when the simulator must be invoked both to determine the typical real-valued response and to determine if a constraint has been violated, either for physical or policy reasons. We develop a statistical approach based on Gaussian processes and Bayesian learning to both approximate the unknown function and estimate the probability of meeting the constraints. A new integrated improvement criterion is proposed to recognize that responses from inputs that violate the constraint may still be informative about the function, and thus could potentially be useful in the optimization. The new criterion is illustrated on synthetic data, and on a motivating optimization problem from health care policy.

...read moreread less

144 citations

Journal Article•DOI•

Importance tempering

[...]

Robert B. Gramacy¹, Richard J. Samworth¹, Ruth King²•Institutions (2)

University of Cambridge¹, University of St Andrews²

01 Jan 2010

TL;DR: In this article, the authors derive an optimal method for combining multiple importance sampling estimators and prove that the resulting estimator has a highly desirable property related to the notion of effective sample size.

...read moreread less

Abstract: Simulated tempering (ST) is an established Markov chain Monte Carlo (MCMC) method for sampling from a multimodal density ?(?). Typically, ST involves introducing an auxiliary variable k taking values in a finite subset of [0,1] and indexing a set of tempered distributions, say ? k (?)? ?(?) k . In this case, small values of k encourage better mixing, but samples from ? are only obtained when the joint chain for (?,k) reaches k=1. However, the entire chain can be used to estimate expectations under ? of functions of interest, provided that importance sampling (IS) weights are calculated. Unfortunately this method, which we call importance tempering (IT), can disappoint. This is partly because the most immediately obvious implementation is naive and can lead to high variance estimators. We derive a new optimal method for combining multiple IS estimators and prove that the resulting estimator has a highly desirable property related to the notion of effective sample size. We briefly report on the success of the optimal combination in two modelling scenarios requiring reversible-jump MCMC, where the naive approach fails.

...read moreread less

71 citations

Journal Article•DOI•

Shrinkage regression for multivariate inference with missing data, and an application to portfolio balancing

[...]

Robert B. Gramacy, Ester Pantaleo

01 Jun 2010-Bayesian Analysis

TL;DR: This paper detail a fully Bayesian hierarchical formulation that extends the framework further by allowing for heavy-tailed errors, relaxing the historical missingness assumption, and accounting for estimation risk.

...read moreread less

Abstract: Portfolio balancing requires estimates of covariance between asset returns. Returns data have histories which greatly vary in length, since assets begin public trading at different times. This can lead to a huge amount of missing data---too much for the conventional imputation-based approach. Fortunately, a well-known factorization of the MVN likelihood under the prevailing historical missingness pattern leads to a simple algorithm of OLS regressions that is much more reliable. When there are more assets than returns, however, OLS becomes unstable. Gramacy et. al (2008) showed how classical shrinkage regression may be used instead, thus extending the state of the art to much bigger asset collections, with further accuracy and interpretation advantages. In this paper, we detail a fully Bayesian hierarchical formulation that extends the framework further by allowing for heavy-tailed errors, relaxing the historical missingness assumption, and accounting for estimation risk. We illustrate how this approach compares favorably to the classical one using synthetic data and an investment exercise with real returns. An accompanying R package is on CRAN.

...read moreread less

25 citations

Posted Content•

Gaussian process single-index models as emulators for computer experiments

[...]

Robert B. Gramacy¹, Heng Lian¹•Institutions (1)

University of Illinois at Chicago¹

22 Sep 2010-arXiv: Methodology

TL;DR: This work shows that a particular Gaussian process (GP) formulation is simple to work with and ideal as an emulator for some types of computer experiment as it can outperform the canonical separable GP regression model commonly used in this setting.

...read moreread less

Abstract: A single-index model (SIM) provides for parsimonious multi-dimensional nonlinear regression by combining parametric (linear) projection with univariate nonparametric (non-linear) regression models. We show that a particular Gaussian process (GP) formulation is simple to work with and ideal as an emulator for some types of computer experiment as it can outperform the canonical separable GP regression model commonly used in this setting. Our contribution focuses on drastically simplifying, re-interpreting, and then generalizing a recently proposed fully Bayesian GP-SIM combination, and then illustrating its favorable performance on synthetic data and a real-data computer experiment. Two R packages, both released on CRAN, have been augmented to facilitate inference under our proposed model(s).

...read moreread less

12 citations

Proceedings Article•

Gaussian process structural equation models with latent variables

[...]

Ricardo Silva¹, Robert B. Gramacy²•Institutions (2)

University College London¹, University of Cambridge²

08 Jul 2010

TL;DR: A sparseGaussian process parameterization is introduced that defines a non-linear structure connecting latent variables, unlike common formulations of Gaussian process latent variable models.

...read moreread less

Abstract: In a variety of disciplines such as social sciences, psychology, medicine and economics, the recorded data are considered to be noisy measurements of latent variables connected by some causal structure. This corresponds to a family of graphical models known as the structural equation model with latent variables. While linear non-Gaussian variants have been well-studied, inference in nonparametric structural equation models is still underdeveloped. We introduce a sparse Gaussian process parameterization that defines a non-linear structure connecting latent variables, unlike common formulations of Gaussian process latent variable models. The sparse parameterization is given a full Bayesian treatment without compromising Markov chain Monte Carlo efficiency. We compare the stability of the sampling procedure and the predictive ability of the model against the current practice.

...read moreread less

10 citations

Posted Content•

Simulation-based Regularized Logistic Regression

[...]

Robert B. Gramacy, Nicholas G. Polson

19 May 2010-arXiv: Methodology

TL;DR: The authors developed a simulation-based framework for regularized logistic regression, exploiting two novel results for scale mixtures of normals, by carefully choosing a hierarchical model for the likelihood by one type of mixture, and implementing regularization with another.

...read moreread less

Abstract: In this paper, we develop a simulation-based framework for regularized logistic regression, exploiting two novel results for scale mixtures of normals. By carefully choosing a hierarchical model for the likelihood by one type of mixture, and implementing regularization with another, we obtain new MCMC schemes with varying efficiency depending on the data type (binary v. binomial, say) and the desired estimator (maximum likelihood, maximum a posteriori, posterior mean). Advantages of our omnibus approach include flexibility, computational efficiency, applicability in p >> n settings, uncertainty estimates, variable selection, and assessing the optimal degree of regularization. We compare our methodology to modern alternatives on both synthetic and real data. An R package called reglogit is available on CRAN.

...read moreread less

9 citations

Book Chapter•DOI•

Treed Gaussian Process Models for Classification

[...]

Tamara Broderick¹, Robert B. Gramacy•Institutions (1)

University of Cambridge¹

01 Jan 2010

TL;DR: This work develops a Bayesian model averaging scheme to traverse the full space of classification TGPs (CTGPs) and illustrates how the combined approach is highly flexible, offers tractable inference, produces rules that are easy to interpret, and performs well out of sample.

...read moreread less

Abstract: Recognizing the success of the treed Gaussian process (TGP) model as an interpretable and thrifty model for nonstationary regression, we seek to extend the model to classification. By combining Bayesian CART and the latent variable approach to classification via Gaussian processes (GPs), we develop a Bayesian model averaging scheme to traverse the full space of classification TGPs (CTGPs). We illustrate our method on synthetic and real data and thereby show how the combined approach is highly flexible, offers tractable inference, produces rules that are easy to interpret, and performs well out of sample.

...read moreread less

7 citations