scispace - formally typeset
M

Michael L. Littman

Researcher at Brown University

Publications -  336
Citations -  46236

Michael L. Littman is an academic researcher from Brown University. The author has contributed to research in topics: Reinforcement learning & Markov decision process. The author has an hindex of 78, co-authored 323 publications receiving 41859 citations. Previous affiliations of Michael L. Littman include Telcordia Technologies & AT&T.

Papers
More filters
Proceedings ArticleDOI

Knows what it knows: a framework for self-aware learning

TL;DR: The KWIK (knows what it knows) framework was designed particularly for its utility in learning settings where active exploration can impact the training examples the learner is exposed to, as is true in reinforcement-learning and active-learning problems.
Proceedings Article

Integrating sample-based planning and model-based reinforcement learning

TL;DR: This work defines the first complete RL solution for compactly represented (exponentially sized) state spaces with efficiently learnable dynamics that is both sample efficient and whose computation time does not grow rapidly with the number of states.
Proceedings Article

Exact Solutions to Time-Dependent MDPs

TL;DR: An extension of the Markov decision process model in which a continuous time dimension is included in the state space is described, which allows for the representation and exact solution of a wide range of problems in which transitions or rewards vary over time.
Journal ArticleDOI

Contingent planning under uncertainty via stochastic satisfiability

TL;DR: Two new probabilistic planning techniques are described-- c-MAXPLAN and ZANDER--that generate contingent plans in Probabilistic propositional domains that operate by transforming the planning problem into a stochastic satisfiability problem and solving that problem instead.
Journal Article

Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction

TL;DR: This work presents a meta-modelling framework for estimating goals and joint intentions in social interaction that combines explicit and implicit goals, as well as implications for future research in this area.