Home
/
Authors
/
Matias Quiroz

Author

Matias Quiroz

Other affiliations: Stockholm University, University of New South Wales, Sveriges Riksbank ...read more

Bio: Matias Quiroz is an academic researcher from University of Technology, Sydney. The author has contributed to research in topics: Markov chain Monte Carlo & Bayesian inference. The author has an hindex of 11, co-authored 33 publications receiving 474 citations. Previous affiliations of Matias Quiroz include Stockholm University & University of New South Wales.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Speeding Up MCMC by Efficient Data Subsampling

[...]

Matias Quiroz¹, Robert Kohn¹, Mattias Villani², Minh-Ngoc Tran³•Institutions (3)

University of New South Wales¹, Linköping University², University of Sydney³

03 Apr 2019-Journal of the American Statistical Association

TL;DR: Subsampling Markov chain Monte Carlo is substantially more efficient than standard MCMC in terms of sampling efficiency for a given computational budget, and that it outperforms other subsampling methods for MCMC proposed in the literature.

...read moreread less

Abstract: We propose subsampling Markov chain Monte Carlo (MCMC), an MCMC framework where the likelihood function for n observations is estimated from a random subset of m observations. We introduce a highly efficient unbiased estimator of the log-likelihood based on control variates, such that the computing cost is much smaller than that of the full log-likelihood in standard MCMC. The likelihood estimate is bias-corrected and used in two dependent pseudo-marginal algorithms to sample from a perturbed posterior, for which we derive the asymptotic error with respect to n and m, respectively. We propose a practical estimator of the error and show that the error is negligible even for a very small m in our applications. We demonstrate that subsampling MCMC is substantially more efficient than standard MCMC in terms of sampling efficiency for a given computational budget, and that it outperforms other subsampling methods for MCMC proposed in the literature. Supplementary materials for this article are availabl...

...read moreread less

162 citations

Journal Article•DOI•

Speeding up MCMC by Delayed Acceptance and Data Subsampling

[...]

Matias Quiroz¹, Minh-Ngoc Tran², Mattias Villani¹, Robert Kohn³•Institutions (3)

Linköping University¹, University of Sydney², University of New South Wales³

02 Jan 2018-Journal of Computational and Graphical Statistics

TL;DR: A more precise likelihood estimator is proposed that incorporates auxiliary information about the full data likelihood while only operating on a sparse set of the data and is provably within O(m− 2) of the true posterior.

...read moreread less

Abstract: The complexity of the Metropolis–Hastings (MH) algorithm arises from the requirement of a likelihood evaluation for the full dataset in each iteration. One solution has been proposed to speed up the algorithm by a delayed acceptance approach where the acceptance decision proceeds in two stages. In the first stage, an estimate of the likelihood based on a random subsample determines if it is likely that the draw will be accepted and, if so, the second stage uses the full data likelihood to decide upon final acceptance. Evaluating the full data likelihood is thus avoided for draws that are unlikely to be accepted. We propose a more precise likelihood estimator that incorporates auxiliary information about the full data likelihood while only operating on a sparse set of the data. We prove that the resulting delayed acceptance MH is more efficient. The caveat of this approach is that the full dataset needs to be evaluated in the second stage. We therefore propose to substitute this evaluation by an es...

...read moreread less

41 citations

Posted Content•

Exact Subsampling MCMC

[...]

Matias Quiroz, Minh-Ngoc Tran, Mattias Villani, Robert Kohn

27 Mar 2016-arXiv: Computation

TL;DR: This work proposes a simulation consistent subsampling method for estimating expectations of any function of the parameters using a combination of MCMC subsampled and the importance sampling correction for occasionally negative likelihood estimates in Lyne et al. (2015).

...read moreread less

Abstract: Speeding up Markov Chain Monte Carlo (MCMC) for datasets with many observations by data subsampling has recently received considerable attention in the literature. Most of the proposed methods are approximate, and the only exact solution has been documented to be highly inefficient. We propose a simulation consistent subsampling method for estimating expectations of any function of the parameters using a combination of MCMC subsampling and the importance sampling correction for occasionally negative likelihood estimates in Lyne et al. (2015). Our algorithm is based on first obtaining an unbiased but not necessarily positive estimate of the likelihood. The estimator uses a soft lower bound such that the likelihood estimate is positive with a high probability, and computationally cheap control variables to lower variability. Second, we carry out a correlated pseudo marginal MCMC on the absolute value of the likelihood estimate. Third, the sign of the likelihood is corrected using an importance sampling step that has low variance by construction. We illustrate the usefulness of the method with two examples.

...read moreread less

39 citations

Journal Article•

Hamiltonian Monte Carlo with Energy Conserving Subsampling

[...]

Khue-Dung Dang, Matias Quiroz, Robert Kohn, Minh-Ngoc Tran, Mattias Villani - Show less +1 more

01 Apr 2019-Journal of Machine Learning Research

TL;DR: In this article, the Hamiltonian Monte Carlo (HMC) samples efficiently from high-dimensional posterior distributions with proposed parameter draws obtained by iterating on a discretized version of the HMC.

...read moreread less

Abstract: Hamiltonian Monte Carlo (HMC) samples efficiently from high-dimensional posterior distributions with proposed parameter draws obtained by iterating on a discretized version of the Hamiltonian dynam ...

...read moreread less

39 citations

Posted Content•

Subsampling Sequential Monte Carlo for Static Bayesian Models

[...]

David Gunawan¹, Khue-Dung Dang², Matias Quiroz², Matias Quiroz³, Robert Kohn⁴, Minh-Ngoc Tran⁵ - Show less +2 more•Institutions (5)

University of Wollongong¹, University of Technology, Sydney², Sveriges Riksbank³, University of New South Wales⁴, University of Sydney⁵

08 May 2018-arXiv: Computation

TL;DR: In order to speed up Sequential Monte Carlo (SMC) for Bayesian inference in large data problems by data subsampling, an approximately unbiased and efficient annealed likelihood estimator based on data subsAMpling is used.

...read moreread less

Abstract: We show how to speed up Sequential Monte Carlo (SMC) for Bayesian inference in large data problems by data subsampling. SMC sequentially updates a cloud of particles through a sequence of distributions, beginning with a distribution that is easy to sample from such as the prior and ending with the posterior distribution. Each update of the particle cloud consists of three steps: reweighting, resampling, and moving. In the move step, each particle is moved using a Markov kernel; this is typically the most computationally expensive part, particularly when the dataset is large. It is crucial to have an efficient move step to ensure particle diversity. Our article makes two important contributions. First, in order to speed up the SMC computation, we use an approximately unbiased and efficient annealed likelihood estimator based on data subsampling. The subsampling approach is more memory efficient than the corresponding full data SMC, which is an advantage for parallel computation. Second, we use a Metropolis within Gibbs kernel with two conditional updates. A Hamiltonian Monte Carlo update makes distant moves for the model parameters, and a block pseudo-marginal proposal is used for the particles corresponding to the auxiliary variables for the data subsampling. We demonstrate both the usefulness and limitations of the methodology for estimating four generalized linear models and a generalized additive model with large datasets.

...read moreread less

36 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

Riemann manifold Langevin and Hamiltonian Monte Carlo methods

[...]

Mark Girolami, Ben Calderhead

01 Jan 2011-Journal of the Royal Statistical Society

TL;DR: The methodology proposed automatically adapts to the local structure when simulating paths across this manifold, providing highly efficient convergence and exploration of the target density, and substantial improvements in the time‐normalized effective sample size are reported when compared with alternative sampling approaches.

...read moreread less

Abstract: The paper proposes Metropolis adjusted Langevin and Hamiltonian Monte Carlo sampling methods defined on the Riemann manifold to resolve the shortcomings of existing Monte Carlo algorithms when sampling from target densities that may be high dimensional and exhibit strong correlations. The methods provide fully automated adaptation mechanisms that circumvent the costly pilot runs that are required to tune proposal densities for Metropolis-Hastings or indeed Hamiltonian Monte Carlo and Metropolis adjusted Langevin algorithms. This allows for highly efficient sampling even in very high dimensions where different scalings may be required for the transient and stationary phases of the Markov chain. The methodology proposed exploits the Riemann geometry of the parameter space of statistical models and thus automatically adapts to the local structure when simulating paths across this manifold, providing highly efficient convergence and exploration of the target density. The performance of these Riemann manifold Monte Carlo methods is rigorously assessed by performing inference on logistic regression models, log-Gaussian Cox point processes, stochastic volatility models and Bayesian estimation of dynamic systems described by non-linear differential equations. Substantial improvements in the time-normalized effective sample size are reported when compared with alternative sampling approaches. MATLAB code that is available from http://www.ucl.ac.uk/statistics/research/rmhmc allows replication of all the results reported.

...read moreread less

1,031 citations

Journal Article•DOI•

Statistics for Spatio-Temporal Data

[...]

Christian P. Robert¹•Institutions (1)

Paris Dauphine University¹

23 Apr 2014-Chance

TL;DR: Cressie and Wikle as mentioned in this paper present a reference book about spatial and spatio-temporal statistical modeling for spatial and temporal modeling, which is based on the work of Cressie et al.

...read moreread less

Abstract: Noel Cressie and Christopher WikleHardcover: 624 pagesYear: 2011Publisher: John WileyISBN-13: 978-0471692744Here is the new reference book about spatial and spatio-temporal statistical modeling! No...

...read moreread less

680 citations

Journal Article•

Dependence modeling with copulas

[...]

Paul Embrechts, Johanna Nešlehová

01 Oct 2007-The Business & Management Collection

505 citations

Book Chapter•DOI•

Prescribing in Dermatology: Skin cancer

[...]

Polly Buchanan, Molly Courtenay

01 Jan 2006

TL;DR: The incidence of skin cancer is increasing and nurses are in an ideal position to help patients prevent and identify the disease at an early stage.

...read moreread less

Abstract: The incidence of skin cancer is increasing and nurses are in an ideal position to help patients prevent and identify the disease at an early stage.

...read moreread less

363 citations

Journal Article•DOI•

Bayesian computation: a summary of the current state, and samples backwards and forwards

[...]

Peter H.R. Green¹, Krzysztof Latuszynski², Marcelo Pereyra¹, Christian P. Robert²•Institutions (2)

University of Bristol¹, University of Warwick²

01 Jul 2015-Statistics and Computing

TL;DR: The difficulties of modelling and then handling ever more complex datasets most likely call for a new type of tool for computational inference that dramatically reduces the dimension and size of the raw data while capturing its essential aspects.

...read moreread less

Abstract: Recent decades have seen enormous improvements in computational inference for statistical models; there have been competitive continual enhancements in a wide range of computational tools. In Bayesian inference, first and foremost, MCMC techniques have continued to evolve, moving from random walk proposals to Langevin drift, to Hamiltonian Monte Carlo, and so on, with both theoretical and algorithmic innovations opening new opportunities to practitioners. However, this impressive evolution in capacity is confronted by an even steeper increase in the complexity of the datasets to be addressed. The difficulties of modelling and then handling ever more complex datasets most likely call for a new type of tool for computational inference that dramatically reduces the dimension and size of the raw data while capturing its essential aspects. Approximate models and algorithms may thus be at the core of the next computational revolution.

...read moreread less

202 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75

Collapse