Showing papers on "Bayesian probability published in 2013"

PDF

Open Access

Posted Content•

Empirical Analysis of Predictive Algorithms for Collaborative Filtering

[...]

John S. Breese¹, David Heckerman¹, Carl M. Kadie¹•Institutions (1)

30 Jan 2013-arXiv: Information Retrieval

TL;DR: In this article, the authors compare the predictive accuracy of various methods in a set of representative problem domains, including correlation coefficients, vector-based similarity calculations, and statistical Bayesian methods.

...read moreread less

Abstract: Collaborative filtering or recommender systems use a database about user preferences to predict additional topics or products a new user might like. In this paper we describe several algorithms designed for this task, including techniques based on correlation coefficients, vector-based similarity calculations, and statistical Bayesian methods. We compare the predictive accuracy of the various methods in a set of representative problem domains. We use two basic classes of evaluation metrics. The first characterizes accuracy over a set of individual predictions in terms of average absolute deviation. The second estimates the utility of a ranked list of suggested items. This metric uses an estimate of the probability that a user will see a recommendation in an ordered list. Experiments were run for datasets associated with 3 application areas, 4 experimental protocols, and the 2 evaluation metrics for the various algorithms. Results indicate that for a wide range of conditions, Bayesian networks with decision trees at each node and correlation methods outperform Bayesian-clustering and vector-similarity methods. Between correlation and Bayesian networks, the preferred method depends on the nature of the dataset, nature of the application (ranked versus one-by-one presentation), and the availability of votes with which to make predictions. Other considerations include the size of database, speed of predictions, and learning time.

...read moreread less

4,883 citations

Posted Content•

Estimating Continuous Distributions in Bayesian Classifiers

[...]

George H. John¹, Pat Langley¹•Institutions (1)

Stanford University¹

20 Feb 2013-arXiv: Learning

TL;DR: This paper abandon the normality assumption and instead use statistical methods for nonparametric density estimation for kernel estimation, which suggests that kernel estimation is a useful tool for learning Bayesian models.

...read moreread less

Abstract: When modeling a probability distribution with a Bayesian network, we are faced with the problem of how to handle continuous variables. Most previous work has either solved the problem by discretizing, or assumed that the data are generated by a single Gaussian. In this paper we abandon the normality assumption and instead use statistical methods for nonparametric density estimation. For a naive Bayesian classifier, we present experimental results on a variety of natural and artificial domains, comparing two methods of density estimation: assuming normality and modeling each conditional distribution with a single Gaussian; and using nonparametric kernel density estimation. We observe large reductions in error on several natural and artificial data sets, which suggests that kernel estimation is a useful tool for learning Bayesian models.

...read moreread less

3,071 citations

Journal Article•DOI•

Statistical Methods in Medical Research

[...]

Wenyaw Chan¹•Institutions (1)

University of Texas Health Science Center at Houston¹

01 Jan 2013-Model Assisted Statistics and Applications

TL;DR: This issue focuses on statistical methods in medical research and proposes two probabilistic models to estimate male-to-female HIV-1 transmission rate in one sexual contact.

...read moreread less

Abstract: Since John Snow first conducted a modern epidemiological study in 1854 during a cholera epidemic in London, statistics has been associated with medical research. After Austin Bradford Hill published a series of articles on the use of statistical methodology in medical research in 1937, statistical considerations and computational tools have been paramount in conductingmedical research [1]. For the past century, statistics has played an important role in the advancement of medical research and medical research has stimulated rapid development of statistical methods. For example, the development of modern survival analysis-an important branch of statistics has aimed to solve problems encountered in clinical trials and large-scale epidemiological studies. In this era of evidence-based medicine, the development of novel statistical methods will continue to be crucial in medical research. With the expansion of computer capacity and advancement of computational techniques, it is inevitable that modern statistical methods will likely incorporate, to a greater degree, complex computational procedures. This issue focuses on statistical methods in medical research. Several novel methods aiming on solving different medical research questions are introduced. Some unique approaches of statistical analysis are also present. Hanagal and Sharma contribute two papers. The first one deals with a bivariate survival model. They examine a parameter estimation issue when the samples are taken from a bivariate log-logistic distribution with shared gamma frailty. They propose to use a Bayesian approach along with theMarkov ChainMonte Carlo computational technique for implementation. The computer simulation is conducted for performance evaluation. Two well-known datasets, one about acute leukemia and the other about kidney infection are applied as examples. The second paper contributed by Hanagal and Sharma examines the shared inverse Gaussian frailty model with the bivariate exponential baseline hazard. They first derive the likelihood of the joint survival function. In their Bayesian approach, the parameters of the baseline hazard are assumed to follow a gamma distribution while the coefficients of the regression relationship are assumed to follow an independent normal distribution. The dependence of two components of the survival function is tested. Three information criteria are used for model comparisons. The proposed method is applied to analyze diabetic retinopathy data. The paper by Chang, Lyer, Bullitt and Wang provides a method to find determinants of the brain arterial system. They represent the brain arterial system as a binary tree and apply the mixed logistic regression model to find significant covariates. The authors also demonstrate model selection methods for both fixed and random effects. A case study is presented using the method. This paper provides a rigorous approach for analyzing the binary branching structure data. It is potentially applicable to other tree structure data. Chakraborty proposes two probabilistic models to estimate male-to-female HIV-1 transmission rate in one sexual contact. One model is applicable when the transmitter cell counts are known and the other model is applicable when the receptor cell counts are known. By first uniformizing each transmitter (or receptor) cell count and assuming as a beta distribution, this paper algebraically derives the transition probability by imposing some boundary conditions based on scientific phenomena related to HIV infection. The paper by Yeh, Jiang, Garrard, Lei and Gajewski proposes to use a zero-truncated Poisson model to analyze human cancer tissues transplanted to mice when the positive counts of affected ducts is subject to right censoring. A Bayesian approach choosing a Gamma distribution as the prior is adopted. After implementing through complex computational procedures, this paper obtains the estimates of the coefficients and demonstrates model fitting through

...read moreread less

1,127 citations

Journal Article•DOI•

FUBAR : A Fast, Unconstrained Bayesian AppRoximation for inferring selection

[...]

Ben Murrell¹, Sasha Moola¹, Sasha Moola², Amandla Mabona¹, Amandla Mabona², Thomas Weighill¹, Daniel J. Sheward², Sergei L. Kosakovsky Pond³, Konrad Scheffler³, Konrad Scheffler¹ - Show less +6 more•Institutions (3)

Stellenbosch University¹, University of Cape Town², University of California, San Diego³

01 May 2013-Molecular Biology and Evolution

TL;DR: This work presents an approximate hierarchical Bayesian method using a Markov chain Monte Carlo (MCMC) routine that ensures robustness against model misspecification by averaging over a large number of predefined site classes, and leaves the distribution of selection parameters essentially unconstrained.

...read moreread less

Abstract: Model-based analyses of natural selection often categorize sites into a relatively small number of site classes. Forcing each site to belong to one of these classes places unrealistic constraints on the distribution of selection parameters, which can result in misleading inference due to model misspecification. We present an approximate hierarchical Bayesian method using a Markov chain Monte Carlo (MCMC) routine that ensures robustness against model misspecification by averaging over a large number of predefined site classes. This leaves the distribution of selection parameters essentially unconstrained, and also allows sites experiencing positive and purifying selection to be identified orders of magnitude faster than by existing methods. We demonstrate that popular random effects likelihood methods can produce misleading results when sites assigned to the same site class experience different levels of positive or purifying selection—an unavoidable scenario when using a small number of site classes. Our Fast Unconstrained Bayesian AppRoximation (FUBAR) is unaffected by this problem, while achieving higher power than existing unconstrained (fixed effects likelihood) methods. The speed advantage of FUBAR allows us to analyze larger data sets than other methods: We illustrate this on a large influenza hemagglutinin data set (3,142 sequences). FUBAR is available as a batch file within the latest HyPhy distribution (http://www.hyphy.org), as well as on the Datamonkey web server (http://www.datamonkey.org/).

...read moreread less

939 citations

Journal Article•DOI•

Polygenic modeling with bayesian sparse linear mixed models.

[...]

Xiang Zhou¹, Peter Carbonetto¹, Matthew Stephens¹•Institutions (1)

University of Chicago¹

07 Feb 2013-PLOS Genetics

TL;DR: This work applies Bayesian sparse linear mixed model (BSLMM) and compares it with other methods for two polygenic modeling applications: estimating the proportion of variance in phenotypes explained (PVE) by available genotypes, and phenotype (or breeding value) prediction, and demonstrates that BSLMM considerably outperforms either of the other two methods.

...read moreread less

Abstract: Both linear mixed models (LMMs) and sparse regression models are widely used in genetics applications, including, recently, polygenic modeling in genome-wide association studies. These two approaches make very different assumptions, so are expected to perform well in different situations. However, in practice, for a given dataset one typically does not know which assumptions will be more accurate. Motivated by this, we consider a hybrid of the two, which we refer to as a “Bayesian sparse linear mixed model” (BSLMM) that includes both these models as special cases. We address several key computational and statistical issues that arise when applying BSLMM, including appropriate prior specification for the hyper-parameters and a novel Markov chain Monte Carlo algorithm for posterior inference. We apply BSLMM and compare it with other methods for two polygenic modeling applications: estimating the proportion of variance in phenotypes explained (PVE) by available genotypes, and phenotype (or breeding value) prediction. For PVE estimation, we demonstrate that BSLMM combines the advantages of both standard LMMs and sparse regression modeling. For phenotype prediction it considerably outperforms either of the other two methods, as well as several other large-scale regression methods previously suggested for this problem. Software implementing our method is freely available from http://stephenslab.uchicago.edu/software.html.

...read moreread less

764 citations

Journal Article•DOI•

Labeled Random Finite Sets and Multi-Object Conjugate Priors

[...]

Ba-Tuong Vo¹, Ba-Ngu Vo¹•Institutions (1)

Curtin University¹

01 Jul 2013-IEEE Transactions on Signal Processing

TL;DR: A new class of RFS distributions is proposed that is conjugate with respect to the multiobject observation likelihood and closed under the Chapman-Kolmogorov equation and is tested on a Bayesian multi-target tracking algorithm.

...read moreread less

Abstract: The objective of multi-object estimation is to simultaneously estimate the number of objects and their states from a set of observations in the presence of data association uncertainty, detection uncertainty, false observations, and noise. This estimation problem can be formulated in a Bayesian framework by modeling the (hidden) set of states and set of observations as random finite sets (RFSs) that covers thinning, Markov shifts, and superposition. A prior for the hidden RFS together with the likelihood of the realization of the observed RFS gives the posterior distribution via the application of Bayes rule. We propose a new class of RFS distributions that is conjugate with respect to the multiobject observation likelihood and closed under the Chapman-Kolmogorov equation. This result is tested on a Bayesian multi-target tracking algorithm.

...read moreread less

762 citations

Journal Article•DOI•

Importance Nested Sampling and the MultiNest Algorithm

[...]

Farhan Feroz¹, Michael P. Hobson, Ewan Cameron, Anthony N. Pettitt•Institutions (1)

University of Cambridge¹

10 Jun 2013-arXiv: Instrumentation and Methods for Astrophysics

TL;DR: In this article, an alternative summation of the MultiNest draws, called importance nested sampling (INS), is presented, which can calculate the Bayesian evidence at up to an order of magnitude higher accuracy than vanilla NS with no change in the way Multi-Nest explores the parameter space.

...read moreread less

Abstract: Bayesian inference involves two main computational challenges. First, in estimating the parameters of some model for the data, the posterior distribution may well be highly multi-modal: a regime in which the convergence to stationarity of traditional Markov Chain Monte Carlo (MCMC) techniques becomes incredibly slow. Second, in selecting between a set of competing models the necessary estimation of the Bayesian evidence for each is, by definition, a (possibly high-dimensional) integration over the entire parameter space; again this can be a daunting computational task, although new Monte Carlo (MC) integration algorithms offer solutions of ever increasing efficiency. Nested sampling (NS) is one such contemporary MC strategy targeted at calculation of the Bayesian evidence, but which also enables posterior inference as a by-product, thereby allowing simultaneous parameter estimation and model selection. The widely-used MultiNest algorithm presents a particularly efficient implementation of the NS technique for multi-modal posteriors. In this paper we discuss importance nested sampling (INS), an alternative summation of the MultiNest draws, which can calculate the Bayesian evidence at up to an order of magnitude higher accuracy than `vanilla' NS with no change in the way MultiNest explores the parameter space. This is accomplished by treating as a (pseudo-)importance sample the totality of points collected by MultiNest, including those previously discarded under the constrained likelihood sampling of the NS algorithm. We apply this technique to several challenging test problems and compare the accuracy of Bayesian evidences obtained with INS against those from vanilla NS.

...read moreread less

674 citations

Posted Content•

Induction of Selective Bayesian Classifiers

[...]

Pat Langley, Stephanie Sage

27 Feb 2013-arXiv: Learning

TL;DR: In this paper, the naive Bayesian classifier is extended with a greedy search through the space of features to improve asymptotic accuracy in domains that involve correlated features without reducing the rate of learning in ones that do not.

...read moreread less

Abstract: In this paper, we examine previous work on the naive Bayesian classifier and review its limitations, which include a sensitivity to correlated features. We respond to this problem by embedding the naive Bayesian induction scheme within an algorithm that c arries out a greedy search through the space of features. We hypothesize that this approach will improve asymptotic accuracy in domains that involve correlated features without reducing the rate of learning in ones that do not. We report experimental results on six natural domains, including comparisons with decision-tree induction, that support these hypotheses. In closing, we discuss other approaches to extending naive Bayesian classifiers and outline some directions for future research.

...read moreread less

660 citations

Journal Article•DOI•

Bayesian computing with INLA: New features

[...]

Thiago G. Martins¹, Daniel Simpson¹, Finn Lindgren¹, Håvard Rue¹•Institutions (1)

Norwegian University of Science and Technology¹

01 Nov 2013-Computational Statistics & Data Analysis

TL;DR: The INLA approach for approximate Bayesian inference for latent Gaussian models has been shown to give fast and accurate estimates of posterior marginals and to be a valuable tool in practice via the R-package R-INLA.

...read moreread less

491 citations

Journal Article•DOI•

Philosophy and the practice of Bayesian statistics

[...]

Andrew Gelman¹, Cosma Rohilla Shalizi²•Institutions (2)

Columbia University¹, Carnegie Mellon University²

01 Feb 2013-British Journal of Mathematical and Statistical Psychology

TL;DR: The authors argue that the most successful forms of Bayesian statistics do not actually support that particular philosophy but rather accord much better with sophisticated forms of hypothetico-deductivism, and examine the actual role played by prior distributions in Bayesian models, and the crucial aspects of model checking and model revision.

...read moreread less

Abstract: A substantial school in the philosophy of science identies Bayesian inference with inductive inference and even rationality as such, and seems to be strengthened by the rise and practical success of Bayesian statistics. We argue that the most successful forms of Bayesian statistics do not actually support that particular philosophy but rather accord much better with sophisticated forms of hypothetico-deductivism. We examine the actual role played by prior distributions in Bayesian models, and the crucial aspects of model checking and model revision, which fall outside the scope of Bayesian

...read moreread less

450 citations

Journal Article•DOI•

Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci

[...]

Mandev S. Gill¹, Philippe Lemey², Nuno R. Faria², Andrew Rambaut³, Beth Shapiro⁴, Marc A. Suchard¹ - Show less +2 more•Institutions (4)

University of California, Los Angeles¹, Katholieke Universiteit Leuven², University of Edinburgh³, University of California, Santa Cruz⁴

01 Mar 2013-Molecular Biology and Evolution

TL;DR: In this article, a Gaussian Markov random field (GMRF) model was proposed for the analysis of multilocus sequence data and the time to the most recent common ancestor (TMRCA) was recovered.

...read moreread less

Abstract: Effective population size is fundamental in population genetics and characterizes genetic diversity. To infer past population dynamics from molecular sequence data, coalescent-based models have been developed for Bayesian nonparametric estimation of effective population size over time. Among the most successful is a Gaussian Markov random field (GMRF) model for a single gene locus. Here, we present a generalization of the GMRF model that allows for the analysis of multilocus sequence data. Using simulated data, we demonstrate the improved performance of our method to recover true population trajectories and the time to the most recent common ancestor (TMRCA). We analyze a multilocus alignment of HIV-1 CRF02_AG gene sequences sampled from Cameroon. Our results are consistent with HIV prevalence data and uncover some aspects of the population history that go undetected in Bayesian parametric estimation. Finally, we recover an older and more reconcilable TMRCA for a classic ancient DNA data set.

...read moreread less

Journal Article•DOI•

Studies in astronomical time series analysis. vi. bayesian block representations

[...]

Jeffrey D. Scargle¹, Jay P. Norris², Brad Jackson³, James Chiang⁴•Institutions (4)

Ames Research Center¹, Boise State University², San Jose State University³, SLAC National Accelerator Laboratory⁴

04 Feb 2013-The Astrophysical Journal

TL;DR: A simple nonparametric modeling technique and an algorithm implementing it—an improved and generalized version of Bayesian Blocks [Scargle 1998]—that finds the optimal segmentation of the data in the observation interval are presented.

...read moreread less

Abstract: This paper addresses the problem of detecting and characterizing local variability in time series and other forms of sequential data. The goal is to identify and characterize statistically significant variations, at the same time suppressing the inevitable corrupting observational errors. We present a simple nonparametric modeling technique and an algorithm implementing it—an improved and generalized version of Bayesian Blocks [Scargle 1998]—that finds the optimal segmentation of the data in the observation interval. The structure of the algorithm allows it to be used in either a real-time trigger mode, or a retrospective mode. Maximum likelihood or marginal posterior functions to measure model fitness are presented for events, binned counts, and measurements at arbitrary times with known error distributions. Problems addressed include those connected with data gaps, variable exposure, extension to piecewise linear and piecewise exponential representations, multi-variate time series data, analysis of variance, data on the circle, other data modes, and dispersed data. Simulations provide evidence that the detection efficiency for weak signals is close to a theoretical asymptotic limit derived by [Arias-Castro, Donoho and Huo 2003]. In the spirit of Reproducible Research [Donoho et al. (2008)] all of the code and data necessary to reproduce all of the figures in this paper are included as auxiliary material.

...read moreread less

Journal Article•DOI•

Simulation-based optimal Bayesian experimental design for nonlinear systems

[...]

Xun Huan¹, Youssef M. Marzouk¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2013-Journal of Computational Physics

TL;DR: This work proposes a general mathematical framework and an algorithmic approach for optimal experimental design with nonlinear simulation-based models, and focuses on finding sets of experiments that provide the most information about targeted sets of parameters.

...read moreread less

Journal Article•DOI•

Priors in Whole-Genome Regression: The Bayesian Alphabet Returns

[...]

Daniel Gianola¹•Institutions (1)

University of Wisconsin-Madison¹

01 Jul 2013-Genetics

TL;DR: It is concluded that members of the alphabet have a room in whole-genome prediction of phenotypes, but have somewhat doubtful inferential value, at least when sample size is such that n ≪ p.

...read moreread less

Abstract: Whole-genome enabled prediction of complex traits has received enormous attention in animal and plant breeding and is making inroads into human and even Drosophila genetics. The term “Bayesian alphabet” denotes a growing number of letters of the alphabet used to denote various Bayesian linear regressions that differ in the priors adopted, while sharing the same sampling model. We explore the role of the prior distribution in whole-genome regression models for dissecting complex traits in what is now a standard situation with genomic data where the number of unknown parameters (p) typically exceeds sample size (n). Members of the alphabet aim to confront this overparameterization in various manners, but it is shown here that the prior is always influential, unless n ≫ p. This happens because parameters are not likelihood identified, so Bayesian learning is imperfect. Since inferences are not devoid of the influence of the prior, claims about genetic architecture from these methods should be taken with caution. However, all such procedures may deliver reasonable predictions of complex traits, provided that some parameters (“tuning knobs”) are assessed via a properly conducted cross-validation. It is concluded that members of the alphabet have a room in whole-genome prediction of phenotypes, but have somewhat doubtful inferential value, at least when sample size is such that n ≪ p.

...read moreread less

Journal Article•DOI•

Spatio-temporal modeling of particulate matter concentration through the SPDE approach

[...]

Michela Cameletti¹, Finn Lindgren², Daniel Simpson², Håvard Rue²•Institutions (2)

University of Bergamo¹, Norwegian University of Science and Technology²

01 Apr 2013-AStA Advances in Statistical Analysis

TL;DR: This work considers a hierarchical spatio-temporal model for particulate matter (PM) concentration in the North-Italian region Piemonte and proposes a strategy to represent a GF with Matérn covariance function as a Gaussian Markov Random Field (GMRF) through the SPDE approach.

...read moreread less

Abstract: In this work, we consider a hierarchical spatio-temporal model for particulate matter (PM) concentration in the North-Italian region Piemonte. The model involves a Gaussian Field (GF), affected by a measurement error, and a state process characterized by a first order autoregressive dynamic model and spatially correlated innovations. This kind of model is well discussed and widely used in the air quality literature thanks to its flexibility in modelling the effect of relevant covariates (i.e. meteorological and geographical variables) as well as time and space dependence. However, Bayesian inference—through Markov chain Monte Carlo (MCMC) techniques—can be a challenge due to convergence problems and heavy computational loads. In particular, the computational issue refers to the infeasibility of linear algebra operations involving the big dense covariance matrices which occur when large spatio-temporal datasets are present. The main goal of this work is to present an effective estimating and spatial prediction strategy for the considered spatio-temporal model. This proposal consists in representing a GF with Matern covariance function as a Gaussian Markov Random Field (GMRF) through the Stochastic Partial Differential Equations (SPDE) approach. The main advantage of moving from a GF to a GMRF stems from the good computational properties that the latter enjoys. In fact, GMRFs are defined by sparse matrices that allow for computationally effective numerical methods. Moreover, when dealing with Bayesian inference for GMRFs, it is possible to adopt the Integrated Nested Laplace Approximation (INLA) algorithm as an alternative to MCMC methods giving rise to additional computational advantages. The implementation of the SPDE approach through the R-library INLA ( www.r-inla.org ) is illustrated with reference to the Piemonte PM data. In particular, providing the step-by-step R-code, we show how it is easy to get prediction and probability of exceedance maps in a reasonable computing time.

...read moreread less

Journal Article•DOI•

Quantitative risk analysis of offshore drilling operations: A Bayesian approach

[...]

Nima Khakzad¹, Faisal Khan¹, Paul Amyotte²•Institutions (2)

Memorial University of Newfoundland¹, Dalhousie University²

01 Aug 2013-Safety Science

TL;DR: The Bayesian network method provides greater value than the bow-tie model since it can consider common cause failures and conditional dependencies along with performing probability updating and sequential learning using accident precursors.

...read moreread less

Journal Article•DOI•

New Important Developments in Small Area Estimation

[...]

Danny Pfeffermann¹•Institutions (1)

University of Southampton¹

01 Feb 2013-Statistical Science

TL;DR: The problem of small area estimation (SAE) is how to produce reliable estimates of characteristics of interest such as means, counts, quantiles, etc., for areas or domains for which only small samples or no samples are available, and how to assess their precision.

...read moreread less

Abstract: The problem of small area estimation (SAE) is how to produce reliable estimates of characteristics of interest such as means, counts, quantiles, etc., for areas or domains for which only small samples or no samples are available, and how to assess their precision. The purpose of this paper is to review and discuss some of the new important developments in small area estimation methods. Rao (2003) wrote a very comprehensive book, which covers all the main developments in this topic until that time. A few review papers have been written after 2003 but they are limited in scope. Hence, the focus of this review is on new developments in the last 7-8 years but to make the review more self-contained, I also mention shortly some of the older developments. The review covers both design-based and model-dependent methods, with the latter methods further classified into frequentist and Bayesian methods. The style of the paper is similar to the style of my previous review on SAE published in 2002, explaining the new problems investigated and describing the proposed solutions, but without dwelling on theoretical details, which can be found in the original articles. I hope that this paper will be useful both to researchers who like to learn more on the research carried out in SAE and to practitioners who might be interested in the application of the new methods.

...read moreread less

Book•

A Beginner's Guide to GLM and GLMM with R: A Frequentist and Bayesian Perspective for Ecologists

[...]

Alain F. Zuur, Joseph Hilbe, Elena N. Ieno

01 Jan 2013

Posted Content•

Being Bayesian about Network Structure

[...]

Nir Friedman¹, Daphne Koller²•Institutions (2)

Hebrew University of Jerusalem¹, Stanford University²

16 Jan 2013-arXiv: Learning

TL;DR: In this paper, a Markov Chain Monte Carlo (MCMC) method is proposed to compute the posterior probability of a feature over a given number of networks that are consistent with a fixed ordering over network variables.

...read moreread less

Abstract: In many domains, we are interested in analyzing the structure of the underlying distribution, e.g., whether one variable is a direct parent of the other. Bayesian model-selection attempts to find the MAP model and use its structure to answer these questions. However, when the amount of available data is modest, there might be many models that have non-negligible posterior. Thus, we want compute the Bayesian posterior of a feature, i.e., the total posterior probability of all models that contain it. In this paper, we propose a new approach for this task. We first show how to efficiently compute a sum over the exponential number of networks that are consistent with a fixed ordering over network variables. This allows us to compute, for a given ordering, both the marginal probability of the data and the posterior of a feature. We then use this result as the basis for an algorithm that approximates the Bayesian posterior of a feature. Our approach uses a Markov Chain Monte Carlo (MCMC) method, but over orderings rather than over network structures. The space of orderings is much smaller and more regular than the space of structures, and has a smoother posterior `landscape'. We present empirical results on synthetic and real-life datasets that compare our approach to full model averaging (when possible), to MCMC over network structures, and to a non-Bayesian bootstrap approach.

...read moreread less

Posted Content•

Streaming Variational Bayes

[...]

Tamara Broderick¹, Nicholas Boyd¹, Andre Wibisono¹, Ashia C. Wilson¹, Michael I. Jordan¹ - Show less +1 more•Institutions (1)

University of California, Berkeley¹

25 Jul 2013-arXiv: Machine Learning

TL;DR: SDA-Bayes as mentioned in this paper is a framework for streaming and distributed computation of a Bayesian posterior, which makes streaming updates to the estimated posterior according to a user-specified approximation batch primitive.

...read moreread less

Abstract: We present SDA-Bayes, a framework for (S)treaming, (D)istributed, (A)synchronous computation of a Bayesian posterior. The framework makes streaming updates to the estimated posterior according to a user-specified approximation batch primitive. We demonstrate the usefulness of our framework, with variational Bayes (VB) as the primitive, by fitting the latent Dirichlet allocation model to two large-scale document collections. We demonstrate the advantages of our algorithm over stochastic variational inference (SVI) by comparing the two after a single pass through a known amount of data---a case where SVI may be applied---and in the streaming setting, where SVI does not apply.

...read moreread less

Journal Article•DOI•

Generalized double pareto shrinkage.

[...]

Artin Armagan¹, David B. Dunson², Jaeyong Lee³•Institutions (3)

SAS Institute¹, Duke University², Seoul National University³

01 Jan 2013-Statistica Sinica

TL;DR: In this paper, a generalized double Pareto prior is proposed for Bayesian shrinkage estimation and inferences in linear models, which can be obtained via a scale mixture of Laplace or normal distributions, forming a bridge between the Laplace and Normal-Jeffreys' priors.

...read moreread less

Abstract: We propose a generalized double Pareto prior for Bayesian shrinkage estimation and inferences in linear models. The prior can be obtained via a scale mixture of Laplace or normal distributions, forming a bridge between the Laplace and Normal-Jeffreys' priors. While it has a spike at zero like the Laplace density, it also has a Student's t-like tail behavior. Bayesian computation is straightforward via a simple Gibbs sampling algorithm. We investigate the properties of the maximum a posteriori estimator, as sparse estimation plays an important role in many problems, reveal connections with some well-established regularization procedures, and show some asymptotic results. The performance of the prior is tested through simulations and an application.

...read moreread less

Journal Article•DOI•

Improved Reversible Jump Algorithms for Bayesian Species Delimitation

[...]

Bruce Rannala¹, Bruce Rannala², Bruce Rannala³, Ziheng Yang⁴, Ziheng Yang¹ - Show less +1 more•Institutions (4)

Chinese Academy of Sciences¹, University of California, Davis², Joseph Fourier University³, University College London⁴

01 May 2013-Genetics

TL;DR: Modifications are introduced to the rjMCMC algorithms that remove the constraint on the new species divergence time when splitting and alter the gene trees to remove incompatibilities, and are found to improve mixing of the Markov chain for both simulated and empirical data sets.

...read moreread less

Abstract: Several computational methods have recently been proposed for delimiting species using multilocus sequence data. Among them, the Bayesian method of Yang and Rannala uses the multispecies coalescent model in the likelihood framework to calculate the posterior probabilities for the different species-delimitation models. It has a sound statistical basis and is found to have nice statistical properties in simulation studies, such as low error rates of undersplitting and oversplitting. However, the method suffers from poor mixing of the reversible-jump Markov chain Monte Carlo (rjMCMC) algorithms. Here, we describe several modifications to the algorithms. We propose a flexible prior that allows the user to specify the probability that each node on the guide tree represents a true speciation event. We also introduce modifications to the rjMCMC algorithms that remove the constraint on the new species divergence time when splitting and alter the gene trees to remove incompatibilities. The new algorithms are found to improve mixing of the Markov chain for both simulated and empirical data sets.

...read moreread less

Journal Article•DOI•

The Best Inflationary Models After Planck

[...]

Jérôme Martin¹, Christophe Ringeval², Roberto Trotta³, Vincent Vennin¹•Institutions (3)

Institut d'Astrophysique de Paris¹, Université catholique de Louvain², Imperial College London³

12 Dec 2013-arXiv: Cosmology and Nongalactic Astrophysics

TL;DR: This analysis singles out the most probable models (from an Occam's razor point of view) that are compatible with Planck data, while ruling out with very strong evidence 34% of the models considered.

...read moreread less

Abstract: We compute the Bayesian evidence and complexity of 193 slow-roll single-field models of inflation using the Planck 2013 Cosmic Microwave Background data, with the aim of establishing which models are favoured from a Bayesian perspective Our calculations employ a new numerical pipeline interfacing an inflationary effective likelihood with the slow-roll library ASPIC and the nested sampling algorithm MULTINEST The models considered represent a complete and systematic scan of the entire landscape of inflationary scenarios proposed so far Our analysis singles out the most probable models (from an Occam's razor point of view) that are compatible with Planck data, while ruling out with very strong evidence 34% of the models considered We identify 26% of the models that are favoured by the Bayesian evidence, corresponding to 15 different potential shapes If the Bayesian complexity is included in the analysis, only 9% of the models are preferred, corresponding to only 9 different potential shapes These shapes are all of the plateau type

...read moreread less

Journal Article•DOI•

CARBayes: An R Package for Bayesian Spatial Modeling with Conditional Autoregressive Priors

[...]

Duncan Lee

20 Nov 2013-Journal of Statistical Software

TL;DR: The general class of Bayesian hierarchical models that can be implemented in the CARBayes software are outlined, their implementation via MCMC simulation techniques are described, and their use with two worked examples in the fields of house price analysis and disease mapping are illustrated.

...read moreread less

Abstract: Conditional autoregressive models are commonly used to represent spatial autocorrelation in data relating to a set of non-overlapping areal units, which arise in a wide variety of applications including agriculture, education, epidemiology and image analysis. Such models are typically specified in a hierarchical Bayesian framework, with inference based on Markov chain Monte Carlo (MCMC) simulation. The most widely used software to fit such models is WinBUGS or OpenBUGS, but in this paper we introduce the R package CARBayes. The main advantage of CARBayes compared with the BUGS software is its ease of use, because: (1) the spatial adjacency information is easy to specify as a binary neighbourhood matrix; and (2) given the neighbourhood matrix the models can be implemented by a single function call in R. This paper outlines the general class of Bayesian hierarchical models that can be implemented in the CARBayes software, describes their implementation via MCMC simulation techniques, and illustrates their use with two worked examples in the fields of house price analysis and disease mapping.

...read moreread less

Journal Article•DOI•

A Computational Framework for Infinite-Dimensional Bayesian Inverse Problems Part I: The Linearized Case, with Application to Global Seismic Inversion

[...]

Tan Bui-Thanh¹, Omar Ghattas¹, James Martin¹, Georg Stadler¹•Institutions (1)

University of Texas at Austin¹

12 Nov 2013-SIAM Journal on Scientific Computing

TL;DR: In this article, the uncertainty in the numerical solution of linearized infinite-dimensional statistical inverse problems is estimated using the Bayesian inference formulation, where the prior probability distribution is chosen appropriately in order to guarantee wellposedness of the inverse problem and facilitate computation of the posterior.

...read moreread less

Abstract: We present a computational framework for estimating the uncertainty in the numerical solution of linearized infinite-dimensional statistical inverse problems. We adopt the Bayesian inference formulation: given observational data and their uncertainty, the governing forward problem and its uncertainty, and a prior probability distribution describing uncertainty in the parameter field, find the posterior probability distribution over the parameter field. The prior must be chosen appropriately in order to guarantee well-posedness of the infinite-dimensional inverse problem and facilitate computation of the posterior. Furthermore, straightforward discretizations may not lead to convergent approximations of the infinite-dimensional problem. And finally, solution of the discretized inverse problem via explicit construction of the covariance matrix is prohibitive due to the need to solve the forward problem as many times as there are parameters. Our computational framework builds on the infinite-dimensional form...

...read moreread less

Journal Article•DOI•

Incorporating pro-environmental preferences towards green automobile technologies through a Bayesian hybrid choice model

[...]

Ricardo A. Daziano¹, Denis Bolduc¹•Institutions (1)

Laval University¹

02 Jan 2013-Transportmetrica

TL;DR: In this paper, the authors developed, implemented and applied a Markov chain Monte Carlo (MCMC) Gibbs sampler for Bayesian estimation of a hybrid choice model (HCM), using stated data on both vehicle purchase decisions and environmental concerns.

...read moreread less

Abstract: In this article we develop, implement and apply a Markov chain Monte Carlo (MCMC) Gibbs sampler for Bayesian estimation of a hybrid choice model (HCM), using stated data on both vehicle purchase decisions and environmental concerns. Our study has two main contributions. The first is the feasibility of the Bayesian estimator we derive. Whereas classical estimation of HCMs is fairly complex, we show that the Bayesian approach for HCMs is methodologically easier to implement than simulated maximum likelihood because the inclusion of latent variables translates into adding independent ordinary regressions; we also find that, using the Bayesian estimates, forecasting and deriving confidence intervals for willingness to pay measures is straightforward. The second is the capacity of HCMs to adapt to practical situations. Our empirical results coincide with a priori expectations, namely that environmentally-conscious consumers are willing to pay more for low-emission vehicles. The model outperforms standard discr...

...read moreread less

Journal Article•DOI•

Fixed-form variational posterior approximation through stochastic linear regression

[...]

Tim Salimans, David A. Knowles

01 Dec 2013-Bayesian Analysis

TL;DR: A general algorithm for approximating nonstandard Bayesian posterior distributions that minimizes the Kullback-Leibler divergence of an approximating distribution to the intractable posterior distribu- tion.

...read moreread less

Abstract: textWe propose a general algorithm for approximating nonstandard Bayesian posterior distributions. The algorithm minimizes the Kullback-Leibler divergence of an approximating distribution to the intractable posterior distribu- tion. Our method can be used to approximate any posterior distribution, provided that it is given in closed form up to the proportionality constant. The approxi- mation can be any distribution in the exponential family or any mixture of such distributions, which means that it can be made arbitrarily precise. Several exam- ples illustrate the speed and accuracy of our approximation method in practice.

...read moreread less

Journal Article•DOI•

A practical introduction to multivariate meta-analysis.

[...]

Dimitris Mavridis¹, Georgia Salanti¹•Institutions (1)

University of Ioannina¹

01 Apr 2013-Statistical Methods in Medical Research

TL;DR: The statistical methods and related software for multivariate meta-analysis are reviewed, emphasis is placed on Bayesian methods using Markov chain Monte Carlo, and codes in WinBUGS are provided.

...read moreread less

Abstract: Multivariate meta-analysis is becoming increasingly popular and official routines or self-programmed functions have been included in many statistical software. In this article, we review the statistical methods and the related software for multivariate meta-analysis. Emphasis is placed on Bayesian methods using Markov chain Monte Carlo, and codes in WinBUGS are provided. The various model-fitting options are illustrated in two examples and specific guidance is provided on how to run a multivariate meta-analysis using various software packages.

...read moreread less

Journal Article•DOI•

Bayesian inversion for finite fault earthquake source models I—theory and algorithm

[...]

Sarah E. Minson¹, Mark Simons¹, James L. Beck¹•Institutions (1)

California Institute of Technology¹

01 Sep 2013-Geophysical Journal International

TL;DR: CATMIP as discussed by the authors combines the Metropolis algorithm with elements of simulated annealing and genetic algorithms to dynamically optimize the algorithm's efficiency as it runs, and it works independently of the model design, a priori constraints and data under consideration, and can be used for a wide variety of scientific problems.

...read moreread less

Abstract: The estimation of finite fault earthquake source models is an inherently underdetermined problem: there is no unique solution to the inverse problem of determining the rupture history at depth as a function of time and space when our data are limited to observations at the Earth’s surface. Bayesian methods allow us to determine the set of all plausible source model parameters that are consistent with the observations, our a priori assumptions about the physics of the earthquake source and wave propagation, and models for the observation errors and the errors due to the limitations in our forward model. Because our inversion approach does not require inverting any matrices other than covariance matrices, we can restrict our ensemble of solutions to only those models that are physically defensible while avoiding the need to restrict our class of models based on considerations of numerical invertibility. We only use prior information that is consistent with the physics of the problem rather than some artefice (such as smoothing) needed to produce a unique optimal model estimate. Bayesian inference can also be used to estimate model-dependent and internally consistent effective errors due to shortcomings in the forward model or data interpretation, such as poor Green’s functions or extraneous signals recorded by our instruments. Until recently, Bayesian techniques have been of limited utility for earthquake source inversions because they are computationally intractable for problems with as many free parameters as typically used in kinematic finite fault models. Our algorithm, called cascading adaptive transitional metropolis in parallel (CATMIP), allows sampling of high-dimensional problems in a parallel computing framework. CATMIP combines the Metropolis algorithm with elements of simulated annealing and genetic algorithms to dynamically optimize the algorithm’s efficiency as it runs. The algorithm is a generic Bayesian Markov Chain Monte Carlo sampler; it works independently of the model design, a priori constraints and data under consideration, and so can be used for a wide variety of scientific problems. We compare CATMIP’s efficiency relative to several existing sampling algorithms and then present synthetic performance tests of finite fault earthquake rupture models computed using CATMIP.

...read moreread less

Journal Article•

Bayesian Canonical correlation analysis

[...]

Arto Klami¹, Seppo Virtanen¹, Samuel Kaski¹•Institutions (1)

Helsinki Institute for Information Technology¹

01 Jan 2013-Journal of Machine Learning Research

TL;DR: This work introduces a novel efficient solution that imposes group-wise sparsity to estimate the posterior of an extended model which not only extracts the statistical dependencies between data sets but also decomposes the data into shared and data set-specific components.

...read moreread less

Abstract: Canonical correlation analysis (CCA) is a classical method for seeking correlations between two multivariate data sets. During the last ten years, it has received more and more attention in the machine learning community in the form of novel computational formulations and a plethora of applications. We review recent developments in Bayesian models and inference methods for CCA which are attractive for their potential in hierarchical extensions and for coping with the combination of large dimensionalities and small sample sizes. The existing methods have not been particularly successful in fulfilling the promise yet; we introduce a novel efficient solution that imposes group-wise sparsity to estimate the posterior of an extended model which not only extracts the statistical dependencies (correlations) between data sets but also decomposes the data into shared and data set-specific components. In statistics literature the model is known as inter-battery factor analysis (IBFA), for which we now provide a Bayesian treatment.

...read moreread less

Collapse