Top 9 papers published by Thomas G. Dietterich from Oregon State University in 2014

Journal Article•DOI•

The eBird enterprise: An integrated approach to development and application of citizen science

[...]

Brian L. Sullivan¹, Jocelyn L. Aycrigg², Jessie H. Barry¹, Rick Bonney¹, Nicholas E. Bruns¹, Caren B. Cooper¹, Theo Damoulas¹, André A. Dhondt¹, Thomas G. Dietterich³, Andrew Farnsworth¹, Daniel Fink¹, John W. Fitzpatrick¹, Thomas Fredericks¹, Jeff Gerbracht¹, Carla P. Gomes¹, Wesley M. Hochachka¹, Marshall J. Iliff¹, Carl Lagoze⁴, Frank A. La Sorte¹, Matt Merrifield⁵, Will Morris¹, Tina B. Phillips¹, Mark D. Reynolds⁵, Amanda D. Rodewald¹, Kenneth V. Rosenberg¹, Nancy M. Trautmann¹, Andrea Wiggins⁶, David W. Winkler¹, Weng-Keen Wong³, Christopher L. Wood¹, Jun Yu³, Steve Kelling¹ - Show less +28 more•Institutions (6)

Cornell University¹, University of Idaho², Oregon State University³, University of Michigan⁴, The Nature Conservancy⁵, University of New Mexico⁶

01 Jan 2014-Biological Conservation

TL;DR: The eBird project as mentioned in this paper has become a major source of biodiversity data, increasing our knowledge of the dynamics of species distributions, and having a direct impact on the conservation of birds and their habitats.

...read moreread less

682 citations

Proceedings Article•

Learnability of the Superset Label Learning Problem

[...]

Li-Ping Liu¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

21 Jun 2014

TL;DR: Empirical Risk Minimizing learners that use the superset error as the empirical risk measure are analyzed and the conditions for ERM learnability and sample complexity for the realizable case are given.

...read moreread less

Abstract: In the Superset Label Learning (SLL) problem, weak supervision is provided in the form of a superset of labels that contains the true label. If the classifier predicts a label outside of the superset, it commits a superset error. Most existing SLL algorithms learn a multiclass classifier by minimizing the superset error. However, only limited theoretical analysis has been dedicated to this approach. In this paper, we analyze Empirical Risk Minimizing learners that use the superset error as the empirical risk measure. SLL data can arise either in the form of independent instances or as multiple-instance bags. For both scenarios, we give the conditions for ERM learnability and sample complexity for the realizable case.

...read moreread less

85 citations

Proceedings Article•DOI•

Prune-and-Score: Learning for Greedy Coreference Resolution

[...]

Chao Ma¹, Janardhan Rao Doppa¹, J. Walker Orr, Prashanth Mannem², Xiaoli Z. Fern¹, Thomas G. Dietterich¹, Prasad Tadepalli¹ - Show less +3 more•Institutions (2)

Oregon State University¹, International Institute of Information Technology, Hyderabad²

01 Oct 2014

TL;DR: This work proposes a novel search-based approach for greedy coreference resolution, where the mentions are processed in order and added to previous coreference clusters, and shows that the Prune-and-Score approach is superior to using a single scoring function to make both decisions and outperforms several state-of-the-art approaches on multiple benchmark corpora including OntoNotes.

...read moreread less

Abstract: We propose a novel search-based approach for greedy coreference resolution, where the mentions are processed in order and added to previous coreference clusters. Our method is distinguished by the use of two functions to make each coreference decision: a pruning function that prunes bad coreference decisions from further consideration, and a scoring function that then selects the best among the remaining decisions. Our framework reduces learning of these functions to rank learning, which helps leverage powerful off-the-shelf rank-learners. We show that our Prune-and-Score approach is superior to using a single scoring function to make both decisions and outperforms several state-of-the-art approaches on multiple benchmark corpora including OntoNotes.

...read moreread less

35 citations

Proceedings Article•

State aggregation in Monte Carlo tree search

[...]

Jesse Hostetler¹, Alan Fern¹, Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

27 Jul 2014

TL;DR: This paper studies state aggregation as a way of reducing stochastic branching in tree search, and finds that trajectory sampling algorithms like UCT can be adapted easily, but that sparse sampling algorithms present difficulties.

...read moreread less

Abstract: Monte Carlo tree search (MCTS) algorithms are a popular approach to online decision-making in Markov decision processes (MDPs). These algorithms can, however, perform poorly in MDPs with high stochastic branching factors. In this paper, we study state aggregation as a way of reducing stochastic branching in tree search. Prior work has studied formal properties of MDP state aggregation in the context of dynamic programming and reinforcement learning, but little attention has been paid to state aggregation in MCTS. Our main result is a performance loss bound for a class of value function-based state aggregation criteria in expectimax search trees. We also consider how to construct MCTS algorithms that operate in the abstract state space but require a simulator of the ground dynamics only. We find that trajectory sampling algorithms like UCT can be adapted easily, but that sparse sampling algorithms present difficulties. As a proof of concept, we experimentally confirm that state aggregation can improve the finite-sample performance of UCT.

...read moreread less

34 citations

Proceedings Article•

Learning scripts as hidden Markov models

[...]

J. Walker Orr, Prasad Tadepalli, Janardhan Rao Doppa, Xiaoli Z. Fern, Thomas G. Dietterich - Show less +1 more

27 Jul 2014

TL;DR: This paper proposes the first formal framework for scripts based on Hidden Markov Models (HMMs) and develops an algorithm for structure and parameter learning based on Expectation Maximization, which is superior to several informed baselines for predicting missing events in partial observation sequences.

...read moreread less

Abstract: Scripts have been proposed to model the stereotypical event sequences found in narratives. They can be applied to make a variety of inferences including filling gaps in the narratives and resolving ambiguous references. This paper proposes the first formal framework for scripts based on Hidden Markov Models (HMMs). Our framework supports robust inference and learning algorithms, which are lacking in previous clustering models. We develop an algorithm for structure and parameter learning based on Expectation Maximization and evaluate it on a number of natural datasets. The results show that our algorithm is superior to several informed baselines for predicting missing events in partial observation sequences.

...read moreread less

29 citations

Journal Article•

Active lmitation learning: formal and practical reductions to I.I.D. learning

[...]

Kshitij Judah¹, Alan Fern¹, Thomas G. Dietterich¹, Prasad adepalli¹•Institutions (1)

Oregon State University¹

01 Jan 2014-Journal of Machine Learning Research

TL;DR: This paper considers active imitation learning with the goal of reducing this effort by querying the expert about the desired action at individual states, which are selected based on answers to past queries and the learner's interactions with an environment simulator.

...read moreread less

Abstract: In standard passive imitation learning, the goal is to learn a policy that performs as well as a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the expert about the desired action at individual states, which are selected based on answers to past queries and the learner's interactions with an environment simulator. We introduce a new approach based on reducing active imitation learning to active i.i.d. learning, which can leverage progress in the i.i.d. setting. Our first contribution is to analyze reductions for both non-stationary and stationary policies, showing for the first time that the label complexity (number of queries) of active imitation learning can be less than that of passive learning. Our second contribution is to introduce a practical algorithm inspired by the reductions, which is shown to be highly effective in five test domains compared to a number of alternatives.

...read moreread less

21 citations

Journal Article•DOI•

Reconstructing Velocities of Migrating Birds from Weather Radar – A Case Study in Computational Sustainability

[...]

Andrew Farnsworth¹, Daniel Sheldon², Jeffrey Geevarghese², Jed Irvine³, Benjamin M. Van Doren¹, Kevin F. Webb¹, Thomas G. Dietterich³, Steve Kelling¹ - Show less +4 more•Institutions (3)

Cornell University¹, University of Massachusetts Amherst², Oregon State University³

19 Jun 2014-Ai Magazine

TL;DR: Recent work on an AI system to quantify bird migration using radar data is described, which is part of the larger BirdCast project to model and forecast bird migration at large scales using radar, weather, and citizen science data.

...read moreread less

Abstract: Bird migration occurs at the largest of global scales, but monitoring such movements can be challenging. In the US there is an operational network of weather radars providing freely accessible data for monitoring meteorological phenomena in the atmosphere. Individual radars are sensitive enough to detect birds, and can provide insight into migratory behaviors of birds at scales that are not possible using other sensors. Archived data from the WSR-88D network of US weather radars hold valuable and detailed information about the continent-scale migratory movements of birds over the last 20 years. However, significant technical challenges must be overcome to understand this information and harness its potential for science and conservation. We describe recent work on an AI system to quantify bird migration using radar data, which is part of the larger BirdCast project to model and forecast bird migration at large scales using radar, weather, and citizen science data.

...read moreread less

16 citations

Posted Content•

Gaussian Approximation of Collective Graphical Models

[...]

Li-Ping Liu¹, Daniel Sheldon², Thomas G. Dietterich¹•Institutions (2)

Oregon State University¹, University of Massachusetts Amherst²

20 May 2014-arXiv: Learning

TL;DR: Gaussian approximations to the Collective Graphical Model are studied to show that the CGM distribution converges to a multivariate Gaussian distribution (GCGM) that maintains the conditional independence properties of the original CGM.

...read moreread less

Abstract: The Collective Graphical Model (CGM) models a population of independent and identically distributed individuals when only collective statistics (i.e., counts of individuals) are observed. Exact inference in CGMs is intractable, and previous work has explored Markov Chain Monte Carlo (MCMC) and MAP approximations for learning and inference. This paper studies Gaussian approximations to the CGM. As the population grows large, we show that the CGM distribution converges to a multivariate Gaussian distribution (GCGM) that maintains the conditional independence properties of the original CGM. If the observations are exact marginals of the CGM or marginals that are corrupted by Gaussian noise, inference in the GCGM approximation can be computed efficiently in closed form. If the observations follow a different noise model (e.g., Poisson), then expectation propagation provides efficient and accurate approximate inference. The accuracy and speed of GCGM inference is compared to the MCMC and MAP methods on a simulated bird migration problem. The GCGM matches or exceeds the accuracy of the MAP method while being significantly faster.

...read moreread less

5 citations

Proceedings Article•

Gaussian Approximation of Collective Graphical Models

[...]

Li-Ping Liu¹, Daniel Sheldon², Thomas G. Dietterich¹•Institutions (2)

Oregon State University¹, University of Massachusetts Amherst²

21 Jun 2014

TL;DR: In this article, a multivariate Gaussian distribution (GCGM) is proposed for the collective graph model, which maintains the conditional independence properties of the original CGM and can be computed efficiently in closed form.

...read moreread less

Abstract: The Collective Graphical Model (CGM) models a population of independent and identically distributed individuals when only collective statistics (i.e., counts of individuals) are observed. Exact inference in CGMs is intractable, and previous work has explored Markov Chain Monte Carlo (MCMC) and MAP approximations for learning and inference. This paper studies Gaussian approximations to the CGM. As the population grows large, we show that the CGM distribution converges to a multivariate Gaussian distribution (GCGM) that maintains the conditional independence properties of the original CGM. If the observations are exact marginals of the CGM or marginals that are corrupted by Gaussian noise, inference in the GCGM approximation can be computed efficiently in closed form. If the observations follow a different noise model (e.g., Poisson), then expectation propagation provides efficient and accurate approximate inference. The accuracy and speed of GCGM inference is compared to the MCMC and MAP methods on a simulated bird migration problem. The GCGM matches or exceeds the accuracy of the MAP method while being significantly faster.

...read moreread less

3 citations

Showing papers by "Thomas G. Dietterich published in 2014"