Michael I. Jordan

Researcher at University of California, Berkeley

Publications - 1110

Citations - 241763

Michael I. Jordan is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Computer science & Inference. The author has an hindex of 176, co-authored 1016 publications receiving 216204 citations. Previous affiliations of Michael I. Jordan include Stanford University & Princeton University.

Papers

PDF

Open Access

More filters

Proceedings Article

Learning from eXtreme Bandit Feedback

Romain Lopez, +2 more

TL;DR: In this paper, a selective importance sampling estimator (sIS) is proposed for batch learning from extreme bandit feedback in the setting of extremely large action spaces, where the selected actions for the sIS estimator are the top-p actions of the logging policy, where p is adjusted from the data and is significantly smaller than the size of the action space.

...read moreread less

Proceedings Article

Fast Algorithms for Computational Optimal Transport and Wasserstein Barycenter.

Wenshuo Guo, +2 more

TL;DR: In this paper, the authors provide theoretical complexity analysis for new algorithms to compute the optimal transport distance between two discrete probability distributions, and demonstrate their favorable practical performance over state-of-the-art primal-dual algorithms and their capability in solving other problems in large-scale, such as the Wasserstein barycenter problem for multiple probability distributions.

...read moreread less

Proceedings Article

Uncertainty Sets for Image Classifiers using Conformal Prediction

Anastasios Angelopoulos, +3 more

TL;DR: In this article, the authors present an algorithm that modifies any classifier to output a predictive set containing the true label with a user-specified probability, such as 90%, which provides a formal finite sample coverage guarantee for every model and dataset.

...read moreread less

Proceedings ArticleDOI

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

Zixiang Chen, +4 more

TL;DR: A general framework that unifies model-based and model-free RL, and an Admissible Bellman Characterization (ABC) class that subsumes nearly all Markov Decision Process (MDP) models in the literature for tractable RL is proposed, which provides a generic interface to design and analyze new RL models and algorithms.

...read moreread less

Posted Content

Beta processes, stick-breaking, and power laws

Tamara Broderick, +2 more

- 03 Jun 2011 -

arXiv: Methodology

TL;DR: A posterior inference algorithm is presented for the beta-Bernoulli process that exploits the stickbreaking representation, and experimental results for a discrete factor-analysis model are presented.

...read moreread less

Collapse