T
Thomas G. Dietterich
Researcher at Oregon State University
Publications - 286
Citations - 58937
Thomas G. Dietterich is an academic researcher from Oregon State University. The author has contributed to research in topics: Reinforcement learning & Markov decision process. The author has an hindex of 74, co-authored 279 publications receiving 51935 citations. Previous affiliations of Thomas G. Dietterich include University of Wyoming & Stanford University.
Papers
More filters
Book ChapterDOI
Machine Learning and Ecosystem Informatics: Challenges and Opportunities
TL;DR: The ways in which machine learning--in combination with novel sensors--can help transform the ecosystem sciences from small-scale hypothesis- driven science to global-scale data-driven science are discussed.
Two heuristics for solving POMDPs having a delayed need to observe
TL;DR: The chain MDP algorithm is described which in many cases is able to capture more of the sensing costs than the even odd POMDP approximation and both heuristics compute value functions that are upper bounded by i e bet ter than the value function of the underlying MDP and in the case of the even MDP also lower bounded by the POM DP s optimal value function.
Book ChapterDOI
Support Vectors for Reinforcement Learning
Thomas G. Dietterich,Xin Wang +1 more
TL;DR: Three ways of combining linear programming with kernel methods to find value function approximations for reinforcement learning are presented and the third seeks only to ensure that good actions have an advantage over bad actions.
Posted Content
Transductive Optimization of Top k Precision
TL;DR: Transductive Top K (TTK) as discussed by the authors minimizes the hinge loss over all training instances under the constraint that exactly $k$ test instances are predicted as positive, which is similar to our approach.
Journal ArticleDOI
Evaluating wildland fire liability standards - does regulation incentivise good management?
TL;DR: In this article, the effects of two different types of liability regulations are examined, strict liability and negligence standards, in a model of land manager decision-making about the timing and spatial location of timber harvest and fuel treatment.