Topic

Maximum a posteriori estimation

About: Maximum a posteriori estimation is a research topic. Over the lifetime, 7486 publications have been published within this topic receiving 222291 citations. The topic is also known as: Maximum a posteriori, MAP & maximum a posteriori probability.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

[...]

H. Francis Song¹, Abbas Abdolmaleki¹, Jost Tobias Springenberg¹, Aidan Clark¹, Hubert Soyer², Jack W. Rae¹, Seb Noury³, Arun Ahuja¹, Siqi Liu¹, Dhruva Tirumala¹, Nicolas Heess¹, Dan Belov¹, Martin Riedmiller¹, Matthew Botvinick⁴ - Show less +10 more•Institutions (4)

Google¹, National Institute of Informatics², Palantir Technologies³, University College London⁴

30 Apr 2020

TL;DR: In this paper, an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO) that performs policy iteration based on a learned state-value function is proposed. But this method suffers from large variance that may limit performance and in practice requires carefully tuned entropy regularization to prevent policy collapse.

...read moreread less

Abstract: Some of the most successful applications of deep reinforcement learning to challenging domains in discrete and continuous control have used policy gradient methods in the on-policy setting. However, policy gradients can suffer from large variance that may limit performance, and in practice require carefully tuned entropy regularization to prevent policy collapse. As an alternative to policy gradient algorithms, we introduce V-MPO, an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO) that performs policy iteration based on a learned state-value function. We show that V-MPO surpasses previously reported scores for both the Atari-57 and DMLab-30 benchmark suites in the multi-task setting, and does so reliably without importance weighting, entropy regularization, or population-based tuning of hyperparameters. On individual DMLab and Atari levels, the proposed algorithm can achieve scores that are substantially higher than has previously been reported. V-MPO is also applicable to problems with high-dimensional, continuous action spaces, which we demonstrate in the context of learning to control simulated humanoids with 22 degrees of freedom from full state observations and 56 degrees of freedom from pixel observations, as well as example OpenAI Gym tasks where V-MPO achieves substantially higher asymptotic scores than previously reported.

...read moreread less

72 citations

Fast Bayesian Matching Pursuit: Model Uncertainty and Parameter Estimation for Sparse Linear Models

[...]

Philip Schniter, Justin Ziniel

01 Jan 2009

TL;DR: A low-complexity recursive procedure is presented for model selection and minimum mean squared error (MMSE) estimation in linear regression and returns both a set of high posterior probability models and an approximate MMSE estimate of the parameter vector.

...read moreread less

Abstract: A low-complexity recursive procedure is presented for model selection and minimum mean squared error (MMSE) estimation in linear regression. Emphasis is given to the case of a sparse parameter vector and fewer observations than unknown parameters. A Gaussian mixture is chosen as the prior on the unknown parameter vector. The algorithm returns both a set of high posterior probability models and an approximate MMSE estimate of the parameter vector. Exact ratios of posterior probabilities serve to reveal potential ambiguity among multiple candidate solutions that are ambiguous due to observation noise or correlation among columns in the regressor matrix. Algorithm complexity is O(MNK), with M observations, N coefficients, and K nonzero coefficients. For the case that hyperparameters are unknown, an approximate maximum likelihood estimator is proposed based on the generalized expectation-maximization algorithm. Numerical simulations demonstrate estimation performance and illustrate the distinctions between MMSE estimation and maximum a posteriori probability model selection.

...read moreread less

72 citations

Journal Article•DOI•

Block-sparse beamforming for spatially extended sources in a Bayesian formulation

[...]

Angeliki Xenaki¹, Efren Fernandez-Grande¹, Peter Gerstoft²•Institutions (2)

Technical University of Denmark¹, University of California, San Diego²

20 Sep 2016-Journal of the Acoustical Society of America

TL;DR: Simulations and experimental measurements show that a composite prior is introduced, which simultaneously promotes a piecewise constant profile and sparsity in the solution, provides high-resolution DOA estimation in a general framework, i.e., in the presence of spatially extended sources.

...read moreread less

Abstract: Direction-of-arrival (DOA) estimation refers to the localization of sound sources on an angular grid from noisy measurements of the associated wavefield with an array of sensors. For accurate localization, the number of angular look-directions is much larger than the number of sensors, hence, the problem is underdetermined and requires regularization. Traditional methods use an l2-norm regularizer, which promotes minimum-power (smooth) solutions, while regularizing with l1-norm promotes sparsity. Sparse signal reconstruction improves the resolution in DOA estimation in the presence of a few point sources, but cannot capture spatially extended sources. The DOA estimation problem is formulated in a Bayesian framework where regularization is imposed through prior information on the source spatial distribution which is then reconstructed as the maximum a posteriori estimate. A composite prior is introduced, which simultaneously promotes a piecewise constant profile and sparsity in the solution. Simulations and experimental measurements show that this choice of regularization provides high-resolution DOA estimation in a general framework, i.e., in the presence of spatially extended sources.

...read moreread less

72 citations

Proceedings Article•

Using Combinatorial Optimization within Max-Product Belief Propagation

[...]

Daniel Tarlow¹, Gal Elidan¹, Daphne Koller¹, John C. Duchi¹•Institutions (1)

Stanford University¹

04 Dec 2006

TL;DR: This paper presents a new method, called COMPOSE, for exploiting combinatorial optimization for sub-networks within the context of a max-product belief propagation algorithm, and describes highly efficient methods for computing max-marginals for subnetworks corresponding both to bipartite matchings and to regular networks.

...read moreread less

Abstract: In general, the problem of computing a maximum a posteriori (MAP) assignment in a Markov random field (MRF) is computationally intractable. However, in certain subclasses of MRF, an optimal or close-to-optimal assignment can be found very efficiently using combinatorial optimization algorithms: certain MRFs with mutual exclusion constraints can be solved using bipartite matching, and MRFs with regular potentials can be solved using minimum cut methods. However, these solutions do not apply to the many MRFs that contain such tractable components as sub-networks, but also other non-complying potentials. In this paper, we present a new method, called COMPOSE, for exploiting combinatorial optimization for sub-networks within the context of a max-product belief propagation algorithm. COMPOSE uses combinatorial optimization for computing exact max-marginals for an entire sub-network; these can then be used for inference in the context of the network as a whole. We describe highly efficient methods for computing max-marginals for subnetworks corresponding both to bipartite matchings and to regular networks. We present results on both synthetic and real networks encoding correspondence problems between images, which involve both matching constraints and pairwise geometric constraints. We compare to a range of current methods, showing that the ability of COMPOSE to transmit information globally across the network leads to improved convergence, decreased running time, and higher-scoring assignments.

...read moreread less

72 citations

Journal Article•DOI•

Efficient Huber-Markov Edge-Preserving Image Restoration

[...]

Ruimin Pan¹, Stanley J. Reeves¹•Institutions (1)

Auburn University¹

01 Dec 2006-IEEE Transactions on Image Processing

TL;DR: A decomposition-enabled edge-preserving image restoration algorithm for maximizing the likelihood function that exploits the sparsity of edges to define an FFT-based iteration that requires few iterations and is guaranteed to converge to the MAP estimate.

...read moreread less

Abstract: The regularization of the least-squares criterion is an effective approach in image restoration to reduce noise amplification. To avoid the smoothing of edges, edge-preserving regularization using a Gaussian Markov random field (GMRF) model is often used to allow realistic edge modeling and provide stable maximum a posteriori (MAP) solutions. However, this approach is computationally demanding because the introduction of a non-Gaussian image prior makes the restoration problem shift-variant. In this case, a direct solution using fast Fourier transforms (FFTs) is not possible, even when the blurring is shift-invariant. We consider a class of edge-preserving GMRF functions that are convex and have nonquadratic regions that impose less smoothing on edges. We propose a decomposition-enabled edge-preserving image restoration algorithm for maximizing the likelihood function. By decomposing the problem into two subproblems, with one shift-invariant and the other shift-variant, our algorithm exploits the sparsity of edges to define an FFT-based iteration that requires few iterations and is guaranteed to converge to the MAP estimate

...read moreread less

72 citations

Collapse

Network Information

Performance

Metrics

7,680

Papers

242,873

Citations

No. of papers in the topic in previous years
Year	Papers
2023	64
2022	125
2021	211
2020	244
2019	250
2018	236

Maximum a posteriori estimation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics