Analyzing Hogwild Parallel Gaussian Gibbs Sampling
Citations
155 citations
Cites methods from "Analyzing Hogwild Parallel Gaussian..."
...Fast asynchronous variants of other algorithms have also been proposed, such as coordinate descent [27, 28] and Gibbs sampling [11, 19]....
[...]
139 citations
85 citations
Cites background or methods from "Analyzing Hogwild Parallel Gaussian..."
...Both papers [263, 179] address the question of how often these machines need to exchange boundary variables in order to ensure the distributed Gibbs process to converge properly....
[...]
...H-precision matrices were first considered in [263, 179] when studying a parallel implementation of the Gibbs process, known as Hogwild Gaussian Gibbs sampling....
[...]
79 citations
76 citations
Cites methods from "Analyzing Hogwild Parallel Gaussian..."
...The work [158] develops various variable partitioning strategies to achieve fast parallelization while maintaining the convergence to the target posterior and the work [159] analyzes the convergence and correctness of the asynchronous Gibbs sampler (a....
[...]
References
6,572 citations
4,335 citations
"Analyzing Hogwild Parallel Gaussian..." refers background in this paper
...In many problems [12] one has access to the pair (J, h) and must compute or estimate the moment parameters μ and Σ (or just the diagonal) or generate samples from N (μ,Σ)....
[...]
1,939 citations
1,413 citations
"Analyzing Hogwild Parallel Gaussian..." refers background or methods in this paper
...[1] provides both a motivation for Hogwild Gibbs sampling as well as the Hogwild name....
[...]
...We refer to this strategy as “Hogwild Gibbs sampling” in reference to recent work [1] in which sequential computations for computing gradient steps were applied in parallel (without global coordination) to great beneficial effect....
[...]
438 citations
"Analyzing Hogwild Parallel Gaussian..." refers methods in this paper
...The AD-LDA sampling algorithm is an instance of the strategy we have named Hogwild Gibbs, and Bekkerman et al. [5, Chapter 11] suggests applying the strategy to other latent variable models....
[...]
...The algorithms are supported by the standard Gibbs sampling analysis, and the authors point out that while heuristic parallel samplers such as the AD-LDA sampler offer easier implementation and often greater parallelism, they are currently not supported by much theoretical analysis....
[...]
...This Hogwild Gibbs sampling strategy has long been considered a useful hack, perhaps for preparing decent initial states for a proper serial Gibbs sampler, but extensive empirical work on Approximate Distributed Latent Dirichlet Allocation (AD-LDA) [2, 3, 4, 5, 6], which applies the strategy to generate samples from a collapsed LDA model, has demonstrated its effectiveness in sampling LDA models with the same predictive performance as those generated by standard serial Gibbs [2, Figure 3]....
[...]
...There have been recent advances in understanding some of the particular structure of AD-LDA [6], but a thorough theoretical explanation for the effectiveness and limitations of Hogwild Gibbs sampling is far from complete....
[...]
...The work of Ihler et al. [6] provides some understanding of the effectiveness of a variant of AD-LDA by bounding in terms of run-time quantities the one-step error probability induced by proceeding with sampling steps in parallel, thereby allowing an AD-LDA user to inspect the computed error bound after inference [6, Section 4.2]....
[...]