M
Michael L. Littman
Researcher at Brown University
Publications - 336
Citations - 46236
Michael L. Littman is an academic researcher from Brown University. The author has contributed to research in topics: Reinforcement learning & Markov decision process. The author has an hindex of 78, co-authored 323 publications receiving 41859 citations. Previous affiliations of Michael L. Littman include Telcordia Technologies & AT&T.
Papers
More filters
Proceedings ArticleDOI
An object-oriented representation for efficient reinforcement learning
TL;DR: Object-Oriented MDPs (OO-MDPs) are introduced, a representation based on objects and their interactions, which is a natural way of modeling environments and offers important generalization opportunities and a polynomial bound on its sample complexity is proved.
Journal ArticleDOI
Reinforcement Learning in Finite MDPs: PAC Analysis
TL;DR: The current state-of-the-art for near-optimal behavior in finite Markov Decision Processes with a polynomial number of samples is summarized by presenting bounds for the problem in a unified theoretical framework.
Journal ArticleDOI
Reinforcement learning improves behaviour from evaluative feedback.
TL;DR: Advances in fundamental technical areas such as generalization, planning, exploration and empirical methodology, leading to increasing applicability to real-life problems are seen, partly driven by the increasing availability of rich data.
Proceedings Article
A Generalized Reinforcement-Learning Model: Convergence and Applications
TL;DR: This paper shows how many of the important theoretical results concerning reinforcement learning in MDPs extend to a generalized MDP model that includes M DPs, two-player games and MDP’s under a worst-case optimality criterion as special cases.
Automatic Cross-Language Retrieval Using Latent Semantic Indexing
TL;DR: A method for fully automated cross-language document retrieval in which no query translation is required and this automatic method performs comparably to a retrieval method based on machine translation (MT-LSI).