Showing papers by "Thomas G. Dietterich published in 2011"

PDF

Open Access

Proceedings Article•

Learning probabilistic behavior models in real-time strategy games

[...]

Ethan W. Dereszynski¹, Jesse Hostetler¹, Alan Fern¹, Thomas G. Dietterich¹, Thao-Trang Hoang¹, Mark Udarbe¹ - Show less +2 more•Institutions (1)

Oregon State University¹

10 Oct 2011

TL;DR: The behavior model is based on the well-developed and generic paradigm of hidden Markov models, which supports a variety of uses for the design of AI players and human assistants and provides both a qualitative and quantitative assessment of the learned model's utility.

...read moreread less

Abstract: We study the problem of learning probabilistic models of high-level strategic behavior in the real-time strategy (RTS) game StarCraft. The models are automatically learned from sets of game logs and aim to capture the common strategic states and decision points that arise in those games. Unlike most work on behavior/strategy learning and prediction in RTS games, our data-centric approach is not biased by or limited to any set of preconceived strategic concepts. Further, since our behavior model is based on the well-developed and generic paradigm of hidden Markov models, it supports a variety of uses for the design of AI players and human assistants. For example, the learned models can be used to make probabilistic predictions of a player's future actions based on observations, to simulate possible future trajectories of a player, or to identify uncharacteristic or novel strategies in a game database. In addition, the learned qualitative structure of the model can be analyzed by humans in order to categorize common strategic elements. We demonstrate our approach by learning models from 331 expert-level games and provide both a qualitative and quantitative assessment of the learned model's utility.

...read moreread less

85 citations

Journal Article•DOI•

Spatiotemporal Models for Data-Anomaly Detection in Dynamic Environmental Monitoring Campaigns

[...]

Ethan W. Dereszynski¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Aug 2011-ACM Transactions on Sensor Networks

TL;DR: A method for real-time automated data quality control that exploits the spatial and temporal correlations in the data to distinguish sensor failures from valid observations and is able to flag faulty observations and predict the true values of the missing or corrupt readings.

...read moreread less

Abstract: The ecological sciences have benefited greatly from recent advances in wireless sensor technologies. These technologies allow researchers to deploy networks of automated sensors, which can monitor a landscape at very fine temporal and spatial scales. However, these networks are subject to harsh conditions, which lead to malfunctions in individual sensors and failures in network communications. The resulting data streams often exhibit incorrect data measurements and missing values. Identifying and correcting these is time-consuming and error-prone. We present a method for real-time automated data quality control (QC) that exploits the spatial and temporal correlations in the data to distinguish sensor failures from valid observations. The model adapts to each deployment site by learning a Bayesian network structure that captures spatial relationships between sensors, and it extends the structure to a dynamic Bayesian network to incorporate temporal correlations. This model is able to flag faulty observations and predict the true values of the missing or corrupt readings. The performance of the model is evaluated on data collected by the SensorScope Project. The results show that the spatiotemporal model demonstrates clear advantages over models that include only temporal or only spatial correlations, and that the model is capable of accurately imputing corrupted values.

...read moreread less

76 citations

Proceedings Article•

Collective Graphical Models

[...]

Daniel Sheldon¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

12 Dec 2011

TL;DR: A highly-efficient Gibbs sampling algorithm for sampling from the posterior distribution of the sufficient statistics conditioned on noisy aggregate observations is derived and proved to be correctness and effectiveness experimentally.

...read moreread less

Abstract: There are many settings in which we wish to fit a model of the behavior of individuals but where our data consist only of aggregate information (counts or low-dimensional contingency tables). This paper introduces Collective Graphical Models—a framework for modeling and probabilistic inference that operates directly on the sufficient statistics of the individual model. We derive a highly-efficient Gibbs sampling algorithm for sampling from the posterior distribution of the sufficient statistics conditioned on noisy aggregate observations, prove its correctness, and demonstrate its effectiveness experimentally.

...read moreread less

61 citations

Proceedings Article•

Incorporating boosted regression trees into ecological latent variable models

[...]

Rebecca A. Hutchinson¹, Li-Ping Liu¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

07 Aug 2011

TL;DR: A methodology for integrating non-parametric tree methods into probabilistic latent variable models by extending functional gradient boosting is presented in the context of occupancy-detection modeling, where the goal is to model the distribution of a species from imperfect detections.

...read moreread less

Abstract: Important ecological phenomena are often observed indirectly. Consequently, probabilistic latent variable models provide an important tool, because they can include explicit models of the ecological phenomenon of interest and the process by which it is observed. However, existing latent variable methods rely on hand-formulated parametric models, which are expensive to design and require extensive preprocessing of the data. Nonparametric methods (such as regression trees) automate these decisions and produce highly accurate models. However, existing tree methods learn direct mappings from inputs to outputs—they cannot be applied to latent variable models. This paper describes a methodology for integrating non-parametric tree methods into probabilistic latent variable models by extending functional gradient boosting. The approach is presented in the context of occupancy-detection (OD) modeling, where the goal is to model the distribution of a species from imperfect detections. Experiments on 12 real and 3 synthetic bird species compare standard and tree-boosted OD models (latent variable models) with standard and tree-boosted logistic regression models (without latent structure). All methods perform similarly when predicting the observed variables, but the OD models learn better representations of the latent process. Most importantly, tree-boosted OD models learn the best latent representations when non-linearities and interactions are present.

...read moreread less

51 citations

Proceedings Article•DOI•

Stacked spatial-pyramid kernel: An object-class recognition method to combine scores from random trees

[...]

Natalia Larios¹, Lin Junyuan², Mabel M. Zhang³, David A. Lytle², Andrew R. Moldenke², Linda G. Shapiro¹, Thomas G. Dietterich² - Show less +3 more•Institutions (3)

University of Washington¹, Oregon State University², University of California, San Diego³

05 Jan 2011

TL;DR: The object-class recognition method proposed in this paper combines different feature types in a new stacking framework that efficiently quantizes input data and boosts classification accuracy, while allowing the use of spatial information.

...read moreread less

Abstract: The combination of local features, complementary feature types, and relative position information has been successfully applied to many object-class recognition tasks. Stacking is a common classification approach that combines the results from multiple classifiers, having the added benefit of allowing each classifier to handle a different feature space. However, the standard stacking method by its own nature discards any spatial information contained in the features, because only the combination of raw classification scores are input to the final classifier. The object-class recognition method proposed in this paper combines different feature types in a new stacking framework that efficiently quantizes input data and boosts classification accuracy, while allowing the use of spatial information. This classification method is applied to the task of automated insect-species identification for biomonitoring purposes. The test data set for this work contains 4722 images with 29 insect species, belonging to the three most common orders used to measure stream water quality, several of which are closely related and very difficult to distinguish. The specimens are in different 3D positions, different orientations, and different developmental and degradation stages with wide intra-class variation. On this very challenging data set, our new algorithm outperforms other classifiers, showing the benefits of using spatial information in the stacking framework with multiple dissimilar feature types.

...read moreread less

26 citations

Proceedings Article•

Inverting Grice's Maxims to Learn Rules from Natural Language Extractions

[...]

Mohammad S. Sorower¹, Janardhan Rao Doppa¹, Walker Orr¹, Prasad Tadepalli¹, Thomas G. Dietterich¹, Xiaoli Z. Fern¹ - Show less +2 more•Institutions (1)

Oregon State University¹

12 Dec 2011

TL;DR: A mention model is introduced that models the probability of facts being mentioned in the text based on what other facts have already been mentioned and domain knowledge in the form of Horn clause rules and must simultaneously search the space of rules and learn the parameters of the mention model.

...read moreread less

Abstract: We consider the problem of learning rules from natural language text sources. These sources, such as news articles and web texts, are created by a writer to communicate information to a reader, where the writer and reader share substantial domain knowledge. Consequently, the texts tend to be concise and mention the minimum information necessary for the reader to draw the correct conclusions. We study the problem of learning domain knowledge from such concise texts, which is an instance of the general problem of learning in the presence of missing data. However, unlike standard approaches to missing data, in this setting we know that facts are more likely to be missing from the text in cases where the reader can infer them from the facts that are mentioned combined with the domain knowledge. Hence, we can explicitly model this "missingness" process and invert it via probabilistic inference to learn the underlying domain knowledge. This paper introduces a mention model that models the probability of facts being mentioned in the text based on what other facts have already been mentioned and domain knowledge in the form of Horn clause rules. Learning must simultaneously search the space of rules and learn the parameters of the mention model. We accomplish this via an application of Expectation Maximization within a Markov Logic framework. An experimental evaluation on synthetic and natural text data shows that the method can learn accurate rules and apply them to new texts to make correct inferences. Experiments also show that the method out-performs the standard EM approach that assumes mentions are missing at random.

...read moreread less

21 citations

Journal Article•DOI•

Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning

[...]

Neville Mehta¹, Soumya Ray², Prasad Tadepalli¹, Thomas G. Dietterich¹•Institutions (2)

Oregon State University¹, Case Western Reserve University²

16 Mar 2011-Ai Magazine

TL;DR: It is demonstrated that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.

...read moreread less

Abstract: Sequential decision tasks present many opportunities for the study of transfer learning. A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justiﬁed and compares favorably to manually designed task hierarchies in learning efﬁciency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.

...read moreread less

14 citations

Journal Article•DOI•

FolderPredictor: Reducing the cost of reaching the right folder

[...]

Xinlong Bao¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

24 Jan 2011-ACM Transactions on Intelligent Systems and Technology

TL;DR: F FolderPredictor applies a cost-sensitive prediction algorithm to the user's previous file access information to predict the next folder that will be accessed, which reduces the number of clicks spent on locating a file by 50%.

...read moreread less

Abstract: Helping computer users rapidly locate files in their folder hierarchies is a practical research problem involving both intelligent systems and user interface design. This article reports on FolderPredictor, a software system that can reduce the cost of locating files in hierarchical folders. FolderPredictor applies a cost-sensitive prediction algorithm to the user's previous file access information to predict the next folder that will be accessed. Experimental results show that, on average, FolderPredictor reduces the number of clicks spent on locating a file by 50p. Several variations of the cost-sensitive prediction algorithm are discussed. An experimental study shows that the best algorithm among them is a mixture of the most recently used (MRU) folder and the cost-sensitive predictions. Furthermore, FolderPredictor does not require users to adapt to a new interface, but rather meshes with the existing interface for opening files on the Windows platform.

...read moreread less

7 citations

Proceedings Article•

Learning Rules from Incomplete Examples via Implicit Mention Models

[...]

Janardhan Rao Doppa¹, Mohammad S. Sorower¹, Mohammad NasrEsfahani¹, Jed Irvine¹, Walker Orr¹, Thomas G. Dietterich¹, Xiaoli Z. Fern¹, Prasad Tadepalli¹ - Show less +4 more•Institutions (1)

Oregon State University¹

17 Nov 2011

TL;DR: This paper proposes an approach that combines simultaneous learning of multiple predictive rules with differential scoring of evidence which adapts to a presumed model of data generation and explains the empirical results.

...read moreread less

Abstract: We study the problem of learning general rules from concrete facts extracted from natural data sources such as the newspaper stories and medical histories. Natural data sources present two challenges to automated learning, namely, radical incompleteness and systematic bias. In this paper, we propose an approach that combines simultaneous learning of multiple predictive rules with differential scoring of evidence which adapts to a presumed model of data generation. Learning multiple predicates simultaneously mitigates the problem of radical incompleteness, while the differential scoring would help reduce the effects of systematic bias. We evaluate our approach empirically on both textual and non-textual sources. We further present a theoretical analysis that elucidates our approach and explains the empirical results.

...read moreread less

6 citations

Journal Article•DOI•

Design of an Automated System for Imaging and Sorting Soil Mesofauna

[...]

Michael A. Chamblin¹, Robert Paasch², David A. Lytle, Andrew R. Moldenke², Linda G. Shapiro³, Thomas G. Dietterich² - Show less +2 more•Institutions (3)

Nike¹, Oregon State University², University of Washington³

01 Jan 2011

TL;DR: In this paper, a mechanical system was designed and developed to automate the process of identifying and population counts of soil mesofauna, which is very time consuming because of the large diversity and quantities of specimens in soil samples.

...read moreread less

Abstract: Identification and population counts of soil mesofauna can be an important tool for soil ecologists to determine soil biodiversity. The process of performing population counts, which includes classifying and sorting specimens, is very time consuming because of the large diversity and quantities of specimens in soil samples. A mechanical system was designed and developed to automate this process. The system featured automated imaging, positioning, and sorting. Images acquired with system were used by pattern recognition software to classify individual specimens from a soil sample. Positioning was a required function of the system for automation and was accomplished using a pair of linear stages mounted orthogonally to each other. The system also featured a six-axis robot attached with a pipette end effector for sorting classified specimens, a required function for validation. The final system was successful in creating high-quality images of 19 species of soil mesofauna and sorting them.

...read moreread less

1 citations

Journal Article•DOI•

Integrating Learning from Examples into the Search for Diagnostic Policies

[...]

Valentina Bayer-Zubek¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

09 Sep 2011-arXiv: Artificial Intelligence

TL;DR: In this article, a new family of systematic search algorithms based on the AO* algorithm is proposed to solve the problem of learning diagnostic policies from training examples, which is a complete description of the decision-making actions of a diagnostician.

...read moreread less

Abstract: This paper studies the problem of learning diagnostic policies from training examples. A diagnostic policy is a complete description of the decision-making actions of a diagnostician (i.e., tests followed by a diagnostic decision) for all possible combinations of test results. An optimal diagnostic policy is one that minimizes the expected total cost, which is the sum of measurement costs and misdiagnosis costs. In most diagnostic settings, there is a tradeoff between these two kinds of costs. This paper formalizes diagnostic decision making as a Markov Decision Process (MDP). The paper introduces a new family of systematic search algorithms based on the AO* algorithm to solve this MDP. To make AO* efficient, the paper describes an admissible heuristic that enables AO* to prune large parts of the search space. The paper also introduces several greedy algorithms including some improvements over previously-published methods. The paper then addresses the question of learning diagnostic policies from examples. When the probabilities of diseases and test results are computed from training data, there is a great danger of overfitting. To reduce overfitting, regularizers are integrated into the search algorithms. Finally, the paper compares the proposed methods on five benchmark diagnostic data sets. The studies show that in most cases the systematic search methods produce better diagnostic policies than the greedy methods. In addition, the studies show that for training sets of realistic size, the systematic search algorithms are practical on todays desktop computers.

...read moreread less