Search or ask a question

Showing papers by "Michael I. Jordan published in 2000"

PDF

Open Access

Journal Article•DOI•

Bayesian parameter estimation via variational methods

[...]

Tommi S. Jaakkola¹, Michael I. Jordan²•Institutions (2)

Massachusetts Institute of Technology¹, University of California, Berkeley²

01 Jan 2000-Statistics and Computing

TL;DR: It is shown that an accurate variational transformation can be used to obtain a closed form approximation to the posterior distribution of the parameters thereby yielding an approximate posterior predictive model.

...read moreread less

Abstract: We consider a logistic regression model with a Gaussian prior distribution over the parameters. We show that an accurate variational transformation can be used to obtain a closed form approximation to the posterior distribution of the parameters thereby yielding an approximate posterior predictive model. This approach is readily extended to binary graphical model with complete observations. For graphical models with incomplete observations we utilize an additional variational transformation and again obtain a closed form approximation to the posterior. Finally, we show that the dual of the regression problem gives a latent variable density model, the variational formulation of which leads to exactly solvable EM updates.

...read moreread less

632 citations

Proceedings Article•

PEGASUS: A policy search method for large MDPs and POMDPs

[...]

Andrew Y. Ng¹, Michael I. Jordan¹•Institutions (1)

University of California, Berkeley¹

30 Jun 2000

TL;DR: This work proposes a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decisions process (POMDP), given a model, based on the following observation: Any (PO)MDP can be transformed into an "equivalent" POMDP in which all state transitions are deterministic.

...read moreread less

Abstract: We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a model. Our approach is based on the following observation: Any (PO)MDP can be transformed into an "equivalent" POMDP in which all state transitions (given the current state and action) are deterministic. This reduces the general problem of policy search to one in which we need only consider POMDPs with deterministic transitions. We give a natural way of estimating the value of all policies in these transformed POMDPs. Policy search is then simply performed by searching for a policy with high estimated value. We also establish conditions under which our value estimates will be good, recovering theoretical results similar to those of Kearns, Mansour and Ng [7], but with "sample complexity" bounds that have only a polynomial rather than exponential dependence on the horizon time. Our method applies to arbitrary POMDPs, including ones with infinite state and action spaces. We also present empirical results for our approach on a small discrete problem, and on a complex continuous state/continuous action problem involving learning to ride a bicycle.

...read moreread less

397 citations

Journal Article•DOI•

Asymptotic Convergence Rate of the EM Algorithm for Gaussian Mixtures

[...]

Jinwen Ma¹, Lei Xu¹, Michael I. Jordan²•Institutions (2)

The Chinese University of Hong Kong¹, University of California, Berkeley²

01 Dec 2000-Neural Computation

TL;DR: It is proved that the asymptotic convergence rate of the EM algorithm for gaussian mixtures locally around the true solution is o(e0.5()), where > 0 is an arbitrarily small number and e() is a measure of the average overlap of gaussians in the mixture.

...read moreread less

Abstract: It is well known that the convergence rate of the expectation-maximization (EM) algorithm can be faster than those of convention first-order iterative algorithms when the overlap in the given mixture is small. But this argument has not been mathematically proved yet. This article studies this problem asymptotically in the setting of gaussian mixtures under the theoretical framework of Xu and Jordan (1996). It has been proved that the asymptotic convergence rate of the EM algorithm for gaussian mixtures locally around the true solution Θ* is o(e0.5-e(Θ*)), where e > 0 is an arbitrarily small number, o(x) means that it is a higher-order infinitesimal as x → 0, and e(Θ*) is a measure of the average overlap of gaussians in the mixture. In other words, the large sample local convergence rate for the EM algorithm tends to be asymptotically superlinear when e(Θ*) tends to zero.

...read moreread less

109 citations

Journal Article•DOI•

Attractor Dynamics in Feedforward Neural Networks

[...]

Lawrence K. Saul¹, Michael I. Jordan²•Institutions (2)

AT&T Labs¹, University of California, Berkeley²

01 Jun 2000-Neural Computation

TL;DR: This work establishes global convergence of the dynamics by providing a Lyapunov function and shows that the dynamics generate the signals required for unsupervised learning.

...read moreread less

Abstract: We study the probabilistic generative models parameterized by feedforward neural networks. An attractor dynamics for probabilistic inference in these models is derived from a mean field approximation for large, layered sigmoidal networks. Fixed points of the dynamics correspond to solutions of the mean field equations, which relate the statistics of each unit to those of its Markov blanket. We establish global convergence of the dynamics by providing a Lyapunov function and show that the dynamics generate the signals required for unsupervised learning. Our results for feedforward networks provide a counterpart to those of Cohen-Grossberg and Hopfield for symmetric networks.

...read moreread less

24 citations