Showing papers in "arXiv: Artificial Intelligence in 2012"

PDF

Open Access

Posted Content•

Heuristic Search Value Iteration for POMDPs

[...]

Trey Smith¹, Reid Simmons¹•Institutions (1)

11 Jul 2012-arXiv: Artificial Intelligence

TL;DR: Heuristic search value iteration (HSVI) as mentioned in this paper is an anytime algorithm that returns a policy and a provable bound on its regret with respect to the optimal policy, which can be used to solve POMDP problems.

...read moreread less

Abstract: We present a novel POMDP planning algorithm called heuristic search value iteration (HSVI).HSVI is an anytime algorithm that returns a policy and a provable bound on its regret with respect to the optimal policy. HSVI gets its power by combining two well-known techniques: attention-focusing search heuristics and piecewise linear convex representations of the value function. HSVI's soundness and convergence have been proven. On some benchmark problems from the literature, HSVI displays speedups of greater than 100 with respect to other state-of-the-art POMDP value iteration algorithms. We also apply HSVI to a new rover exploration problem 10 times larger than most POMDP problems in the literature.

...read moreread less

439 citations

Posted Content•

Near-optimal Nonmyopic Value of Information in Graphical Models

[...]

Andreas Krause¹, Carlos Guestrin¹•Institutions (1)

Carnegie Mellon University¹

04 Jul 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, the authors presented an efficient randomized algorithm providing a constant factor (1-1/e-epsilon) approximation guarantee for any epsilon > 0 with high confidence.

...read moreread less

Abstract: A fundamental issue in real-world systems, such as sensor networks, is the selection of observations which most effectively reduce uncertainty. More specifically, we address the long standing problem of nonmyopically selecting the most informative subset of variables in a graphical model. We present the first efficient randomized algorithm providing a constant factor (1-1/e-epsilon) approximation guarantee for any epsilon > 0 with high confidence. The algorithm leverages the theory of submodular functions, in combination with a polynomial bound on sample complexity. We furthermore prove that no polynomial time algorithm can provide a constant factor approximation better than (1 - 1/e) unless P = NP. Finally, we provide extensive evidence of the effectiveness of our method on two complex real-world datasets.

...read moreread less

360 citations

Posted Content•

Continuous Time Bayesian Networks

[...]

Uri Nodelman¹, Christian R. Shelton¹, Daphne Koller¹•Institutions (1)

Stanford University¹

12 Dec 2012-arXiv: Artificial Intelligence

TL;DR: A probabilistic semantics for the language in terms of the generative model a CTBN defines over sequences of events is presented, and an algorithm for approximate inference which takes advantage of the structure within the process is provided.

...read moreread less

Abstract: In this paper we present a language for finite state continuous time Bayesian networks (CTBNs), which describe structured stochastic processes that evolve over continuous time. The state of the system is decomposed into a set of local variables whose values change over time. The dynamics of the system are described by specifying the behavior of each local variable as a function of its parents in a directed (possibly cyclic) graph. The model specifies, at any given point in time, the distribution over two aspects: when a local variable changes its value and the next value it takes. These distributions are determined by the variable s CURRENT value AND the CURRENT VALUES OF its parents IN the graph.More formally, each variable IS modelled AS a finite state continuous time Markov process whose transition intensities are functions OF its parents.We present a probabilistic semantics FOR the language IN terms OF the generative model a CTBN defines OVER sequences OF events.We list types OF queries one might ask OF a CTBN, discuss the conceptual AND computational difficulties associated WITH exact inference, AND provide an algorithm FOR approximate inference which takes advantage OF the structure within the process.

...read moreread less

296 citations

Journal Article•DOI•

The Arcade Learning Environment: An Evaluation Platform for General Agents

[...]

Marc G. Bellemare¹, Yavar Naddaf, Joel Veness¹, Michael Bowling¹•Institutions (1)

University of Alberta¹

19 Jul 2012-arXiv: Artificial Intelligence

TL;DR: The Arcade Learning Environment (ALE) as mentioned in this paper is a platform for evaluating the development of general, domain-independent AI technology, which provides an interface to hundreds of Atari 2600 game environments, each one different, interesting, and designed to be a challenge for human players.

...read moreread less

Abstract: In this article we introduce the Arcade Learning Environment (ALE): both a challenge problem and a platform and methodology for evaluating the development of general, domain-independent AI technology. ALE provides an interface to hundreds of Atari 2600 game environments, each one different, interesting, and designed to be a challenge for human players. ALE presents significant research challenges for reinforcement learning, model learning, model-based planning, imitation learning, transfer learning, and intrinsic motivation. Most importantly, it provides a rigorous testbed for evaluating and comparing approaches to these problems. We illustrate the promise of ALE by developing and benchmarking domain-independent agents designed using well-established AI techniques for both reinforcement learning and planning. In doing so, we also propose an evaluation methodology made possible by ALE, reporting empirical results on over 55 different games. All of the software, including the benchmark agents, is publicly available.

...read moreread less

254 citations

Posted Content•

A Description Logic Primer

[...]

Markus Krötzsch

19 Jan 2012-arXiv: Artificial Intelligence

TL;DR: The main concepts and features are explained with examples before syn- tax and semantics of the DLSROIQ are defined in detail.

...read moreread less

Abstract: This paper provides a self-contained first introduction to d escription log- ics (DLs). The main concepts and features are explained with examples before syn- tax and semantics of the DLSROIQ are defined in detail. Additional sections review light-weight DL languages, discuss the relationship to the Web Ontology Language OWL and give pointers to further reading.

...read moreread less

204 citations

Posted Content•

A Linear-Programming Approximation of AC Power Flows

[...]

Carleton Coffrin¹, Pascal Van Hentenryck²•Institutions (2)

NICTA¹, Australian National University²

16 Jun 2012-arXiv: Artificial Intelligence

TL;DR: In this article, the authors proposed linear programming models (LPAC) that incorporate reactive power and voltage magnitudes in a linear power flow approximation to ensure voltage stability and AC power flow feasibility.

...read moreread less

Abstract: Linear active-power-only DC power flow approximations are pervasive in the planning and control of power systems. However, these approximations fail to capture reactive power and voltage magnitudes, both of which are necessary in many applications to ensure voltage stability and AC power flow feasibility. This paper proposes linear-programming models (the LPAC models) that incorporate reactive power and voltage magnitudes in a linear power flow approximation. The LPAC models are built on a convex approximation of the cosine terms in the AC equations, as well as Taylor approximations of the remaining nonlinear terms. Experimental comparisons with AC solutions on a variety of standard IEEE and MatPower benchmarks show that the LPAC models produce accurate values for active and reactive power, phase angles, and voltage magnitudes. The potential benefits of the LPAC models are illustrated on two "proof-of-concept" studies in power restoration and capacitor placement.

...read moreread less

203 citations

Posted Content•

Metrics for Finite Markov Decision Processes

[...]

Norm Ferns¹, Prakash Panangaden¹, Doina Precup¹•Institutions (1)

McGill University¹

11 Jul 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, the authors present metrics for measuring the similarity of states in a finite Markov decision process (MDP) based on the notion of bisimulation, with an aim towards solving discounted infinite horizon reinforcement learning tasks.

...read moreread less

Abstract: We present metrics for measuring the similarity of states in a finite Markov decision process (MDP). The formulation of our metrics is based on the notion of bisimulation for MDPs, with an aim towards solving discounted infinite horizon reinforcement learning tasks. Such metrics can be used to aggregate states, as well as to better structure other value function approximators (e.g., memory-based or nearest-neighbor approximators). We provide bounds that relate our metric distances to the optimal values of states in the given MDP.

...read moreread less

201 citations

Posted Content•

Decentralized Sensor Fusion With Distributed Particle Filters

[...]

Matthew Rosencrantz¹, Geoffrey J. Gordon¹, Sebastian Thrun¹•Institutions (1)

University of Pittsburgh¹

19 Oct 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, a scalable Bayesian technique for decentralized state estimation from multiple platforms in dynamic environments is presented. But the authors do so through an interactive communication protocol aimed at maximizing information flow, which is evaluated in a distributed surveillance scenario that arises in a robotic system for playing the game of laser tag.

...read moreread less

Abstract: This paper presents a scalable Bayesian technique for decentralized state estimation from multiple platforms in dynamic environments. As has long been recognized, centralized architectures impose severe scaling limitations for distributed systems due to the enormous communication overheads. We propose a strictly decentralized approach in which only nearby platforms exchange information. They do so through an interactive communication protocol aimed at maximizing information flow. Our approach is evaluated in the context of a distributed surveillance scenario that arises in a robotic system for playing the game of laser tag. Our results, both from simulation and using physical robots, illustrate an unprecedented scaling capability to large teams of vehicles.

...read moreread less

173 citations

Journal Article•DOI•

Concepts and Their Dynamics: A Quantum-Theoretic Modeling of Human Thought

[...]

Diederik Aerts, Liane Gabora¹, Sandro Sozzo•Institutions (1)

University of British Columbia¹

05 Jun 2012-arXiv: Artificial Intelligence

TL;DR: The relevance of complex numbers, the appearance of entanglement, and the role of Fock space in explaining contextual emergence, all as unique features of the quantum modeling are explicitly revealed in this article by analyzing human concepts and their dynamics.

...read moreread less

Abstract: We analyze different aspects of our quantum modeling approach of human concepts, and more specifically focus on the quantum effects of contextuality, interference, entanglement and emergence, illustrating how each of them makes its appearance in specific situations of the dynamics of human concepts and their combinations. We point out the relation of our approach, which is based on an ontology of a concept as an entity in a state changing under influence of a context, with the main traditional concept theories, i.e. prototype theory, exemplar theory and theory theory. We ponder about the question why quantum theory performs so well in its modeling of human concepts, and shed light on this question by analyzing the role of complex amplitudes, showing how they allow to describe interference in the statistics of measurement outcomes, while in the traditional theories statistics of outcomes originates in classical probability weights, without the possibility of interference. The relevance of complex numbers, the appearance of entanglement, and the role of Fock space in explaining contextual emergence, all as unique features of the quantum modeling, are explicitly revealed in this paper by analyzing human concepts and their dynamics.

...read moreread less

171 citations

Posted Content•

Prediction, Expectation, and Surprise: Methods, Designs, and Study of a Deployed Traffic Forecasting Service

[...]

Eric Horvitz¹, Johnson T. Apacible¹, Raman K. Sarin¹, Lin Liao²•Institutions (2)

Microsoft¹, University of Washington²

04 Jul 2012-arXiv: Artificial Intelligence

TL;DR: Research on developing models that forecast traffic flow and congestion in the Greater Seattle area is presented, which has led to the deployment of a service named JamBayes, that is being actively used by over 2,500 users via smartphones and desktop versions of the system.

...read moreread less

Abstract: We present research on developing models that forecast traffic flow and congestion in the Greater Seattle area. The research has led to the deployment of a service named JamBayes, that is being actively used by over 2,500 users via smartphones and desktop versions of the system. We review the modeling effort and describe experiments probing the predictive accuracy of the models. Finally, we present research on building models that can identify current and future surprises, via efforts on modeling and forecasting unexpected situations.

...read moreread less

167 citations

Posted Content•

Pearl's Calculus of Intervention Is Complete

[...]

Yimin Huang¹, Marco Valtorta¹•Institutions (1)

University of South Carolina¹

27 Jun 2012-arXiv: Artificial Intelligence

TL;DR: It is proved that the three basic do-calculus rules that Pearl presents are complete, in the sense that, if a causal effect is identifiable, there exists a sequence of applications of the rules of the do-Calculus that transforms the causal effect formula into a formula that only includes observational quantities.

...read moreread less

Abstract: This paper is concerned with graphical criteria that can be used to solve the problem of identifying casual effects from nonexperimental data in a causal Bayesian network structure, i.e., a directed acyclic graph that represents causal relationships. We first review Pearl's work on this topic [Pearl, 1995], in which several useful graphical criteria are presented. Then we present a complete algorithm [Huang and Valtorta, 2006b] for the identifiability problem. By exploiting the completeness of this algorithm, we prove that the three basic do-calculus rules that Pearl presents are complete, in the sense that, if a causal effect is identifiable, there exists a sequence of applications of the rules of the do-calculus that transforms the causal effect formula into a formula that only includes observational quantities.

...read moreread less

Posted Content•

Knapsack based Optimal Policies for Budget-Limited Multi-Armed Bandits

[...]

Long Tran-Thanh¹, Archie C. Chapman², Alex Rogers¹, Nicholas R. Jennings¹•Institutions (2)

University of Southampton¹, University of Sydney²

09 Apr 2012-arXiv: Artificial Intelligence

TL;DR: Two pulling policies are developed, namely: (i) KUBE; and (ii) fractional KUBe, which are computationally less expensive and prove logarithmic upper bounds for the regret of both policies, and show that these bounds are asymptotically optimal.

...read moreread less

Abstract: In budget-limited multi-armed bandit (MAB) problems, the learner's actions are costly and constrained by a fixed budget. Consequently, an optimal exploitation policy may not be to pull the optimal arm repeatedly, as is the case in other variants of MAB, but rather to pull the sequence of different arms that maximises the agent's total reward within the budget. This difference from existing MABs means that new approaches to maximising the total reward are required. Given this, we develop two pulling policies, namely: (i) KUBE; and (ii) fractional KUBE. Whereas the former provides better performance up to 40% in our experimental settings, the latter is computationally less expensive. We also prove logarithmic upper bounds for the regret of both policies, and show that these bounds are asymptotically optimal (i.e. they only differ from the best possible regret by a constant factor).

...read moreread less

Posted Content•

Probabilistic Similarity Logic

[...]

Matthias Bröcheler¹, Lilyana Mihalkova¹, Lise Getoor¹•Institutions (1)

University of Maryland, College Park¹

15 Mar 2012-arXiv: Artificial Intelligence

TL;DR: Probabilistic similarity logic (PSL) as discussed by the authors is a general-purpose framework for joint reasoning about similarity in relational domains that incorporates probabilistic reasoning about similarities and relational structure in a principled way.

...read moreread less

Abstract: Many machine learning applications require the ability to learn from and reason about noisy multi-relational data. To address this, several effective representations have been developed that provide both a language for expressing the structural regularities of a domain, and principled support for probabilistic inference. In addition to these two aspects, however, many applications also involve a third aspect-the need to reason about similarities-which has not been directly supported in existing frameworks. This paper introduces probabilistic similarity logic (PSL), a general-purpose framework for joint reasoning about similarity in relational domains that incorporates probabilistic reasoning about similarities and relational structure in a principled way. PSL can integrate any existing domain-specific similarity measures and also supports reasoning about similarities between sets of entities. We provide efficient inference and learning techniques for PSL and demonstrate its effectiveness both in common relational tasks and in settings that require reasoning about similarity.

...read moreread less

Posted Content•

Counting Belief Propagation

[...]

Kristian Kersting, Babak Ahmadi, Sriraam Natarajan¹•Institutions (1)

University of Wisconsin-Madison¹

09 May 2012-arXiv: Artificial Intelligence

TL;DR: The experiments show that counting BP is applicable to a variety of important AI tasks such as (dynamic) relational models and boolean model counting, and that significant efficiency gains are obtainable, often by orders of magnitude.

...read moreread less

Abstract: A major benefit of graphical models is that most knowledge is captured in the model structure. Many models, however, produce inference problems with a lot of symmetries not reflected in the graphical structure and hence not exploitable by efficient inference techniques such as belief propagation (BP). In this paper, we present a new and simple BP algorithm, called counting BP, that exploits such additional symmetries. Starting from a given factor graph, counting BP first constructs a compressed factor graph of clusternodes and clusterfactors, corresponding to sets of nodes and factors that are indistinguishable given the evidence. Then it runs a modified BP algorithm on the compressed graph that is equivalent to running BP on the original factor graph. Our experiments show that counting BP is applicable to a variety of important AI tasks such as (dynamic) relational models and boolean model counting, and that significant efficiency gains are obtainable, often by orders of magnitude.

...read moreread less

Posted Content•

Introducing Variable Importance Tradeoffs into CP-Nets

[...]

Ronen I. Brafman¹, Carmel Domshlak¹•Institutions (1)

Ben-Gurion University of the Negev¹

12 Dec 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, the use of TCP-nets, an enhancement of CP-networks, as a tool for representing, reasoning about qualitative preference statements is discussed. But TCP nets are not suitable for the problem of preference elicitation, as they do not have the time, knowledge or expert support required to specify complex multi-attribute utility functions.

...read moreread less

Abstract: The ability to make decisions and to assess potential courses of action is a corner-stone of many AI applications, and usually this requires explicit information about the decision-maker s preferences. IN many applications, preference elicitation IS a serious bottleneck.The USER either does NOT have the time, the knowledge, OR the expert support required TO specify complex multi - attribute utility functions. IN such cases, a method that IS based ON intuitive, yet expressive, preference statements IS required. IN this paper we suggest the USE OF TCP - nets, an enhancement OF CP - nets, AS a tool FOR representing, AND reasoning about qualitative preference statements.We present AND motivate this framework, define its semantics, AND show how it can be used TO perform constrained optimization.

...read moreread less

Posted Content•

Improved Memory-Bounded Dynamic Programming for Decentralized POMDPs

[...]

Sven Seuken¹, Shlomo Zilberstein²•Institutions (2)

Harvard University¹, University of Massachusetts Amherst²

20 Jun 2012-arXiv: Artificial Intelligence

TL;DR: Memory-Bounded Dynamic Programming is generalized and its scalability is improved by reducing the complexity with respect to the number of observations from exponential to polynomial, and error bounds on solution quality are derived.

...read moreread less

Abstract: Memory-Bounded Dynamic Programming (MBDP) has proved extremely effective in solving decentralized POMDPs with large horizons. We generalize the algorithm and improve its scalability by reducing the complexity with respect to the number of observations from exponential to polynomial. We derive error bounds on solution quality with respect to this new approximation and analyze the convergence behavior. To evaluate the effectiveness of the improvements, we introduce a new, larger benchmark problem. Experimental results show that despite the high complexity of decentralized POMDPs, scalable solution techniques such as MBDP perform surprisingly well.

...read moreread less

Posted Content•

Recognizing Activities and Spatial Context Using Wearable Sensors

[...]

Amarnag Subramanya¹, Alvin Raj¹, Jeff A. Bilmes¹, Dieter Fox¹•Institutions (1)

University of Washington¹

27 Jun 2012-arXiv: Artificial Intelligence

TL;DR: This work introduces a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that individual is located, and applies virtual evidence to improve data annotation, giving the user high flexibility when labeling training data.

...read moreread less

Abstract: We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that ndividual is located. Our model is novel in that it utilizes a dynamic graphical model to jointly estimate both activity and spatial context over time based on the simultaneous use of asynchronous observations consisting of GPS measurements, and measurements from a small mountable sensor board. Joint inference is quite desirable as it has the ability to improve accuracy of the model. A key goal, however, in designing our overall system is to be able to perform accurate inference decisions while minimizing the amount of hardware an individual must wear. This minimization leads to greater comfort and flexibility, decreased power requirements and therefore increased battery life, and reduced cost. We show results indicating that our joint measurement model outperforms measurements from either the sensor board or GPS alone, using two types of probabilistic inference procedures, namely particle filtering and pruned exact inference.

...read moreread less

Posted Content•

OpenGM: A C++ Library for Discrete Graphical Models

[...]

Bjoern Andres¹, Thorsten Beier², Joerg H. Kappes²•Institutions (2)

Harvard University¹, Heidelberg University²

01 Jun 2012-arXiv: Artificial Intelligence

TL;DR: OpenGM is a C++ template library for defining discrete graphical models and performing inference on these models, using a wide range of state-of-the-art algorithms, and its algorithms, HDF5 file format and command line tools are modular and extendible.

...read moreread less

Abstract: OpenGM is a C++ template library for defining discrete graphical models and performing inference on these models, using a wide range of state-of-the-art algorithms. No restrictions are imposed on the factor graph to allow for higher-order factors and arbitrary neighborhood structures. Large models with repetitive structure are handled efficiently because (i) functions that occur repeatedly need to be stored only once, and (ii) distinct functions can be implemented differently, using different encodings alongside each other in the same model. Several parametric functions (e.g. metrics), sparse and dense value tables are provided and so is an interface for custom C++ code. Algorithms are separated by design from the representation of graphical models and are easily exchangeable. OpenGM, its algorithms, HDF5 file format and command line tools are modular and extendible.

...read moreread less

Posted Content•

Sensitivity Analysis in Bayesian Networks: From Single to Multiple Parameters

[...]

Hei Chan¹, Adnan Darwiche¹•Institutions (1)

University of California, Los Angeles¹

11 Jul 2012-arXiv: Artificial Intelligence

TL;DR: In this article, the authors identify the solution space of multiple parameter changes that would be needed to enforce a query constraint and find the optimal solution, that is, the one which disturbs the current probability distribution the least.

...read moreread less

Abstract: Previous work on sensitivity analysis in Bayesian networks has focused on single parameters, where the goal is to understand the sensitivity of queries to single parameter changes, and to identify single parameter changes that would enforce a certain query constraint. In this paper, we expand the work to multiple parameters which may be in the CPT of a single variable, or the CPTs of multiple variables. Not only do we identify the solution space of multiple parameter changes that would be needed to enforce a query constraint, but we also show how to find the optimal solution, that is, the one which disturbs the current probability distribution the least (with respect to a specific measure of disturbance). We characterize the computational complexity of our new techniques and discuss their applications to developing and debugging Bayesian networks, and to the problem of reasoning about the value (reliability) of new information.

...read moreread less

Posted Content•

Dynamic Programming for Structured Continuous Markov Decision Problems

[...]

Zhengzhu Feng¹, Richard Dearden², Nicolas Meuleau², Richard Washington²•Institutions (2)

University of Massachusetts Amherst¹, Ames Research Center²

11 Jul 2012-arXiv: Artificial Intelligence

TL;DR: In this article, the state space is dynamically partitioned into regions where the value function is the same throughout the region, where the state variables can be expressed by piecewise constant representations.

...read moreread less

Abstract: We describe an approach for exploiting structure in Markov Decision Processes with continuous state variables. At each step of the dynamic programming, the state space is dynamically partitioned into regions where the value function is the same throughout the region. We first describe the algorithm for piecewise constant representations. We then extend it to piecewise linear representations, using techniques from POMDPs to represent and reason about linear surfaces efficiently. We show that for complex, structured problems, our approach exploits the natural structure so that optimal solutions can be computed efficiently.

...read moreread less

Posted Content•

Causal Inference by Surrogate Experiments: z-Identifiability

[...]

Elias Bareinboim¹, Judea Pearl¹•Institutions (1)

University of California, Los Angeles¹

16 Oct 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, the problem of estimating the effect of intervening on a set of variables X from experiments on a different set, Z, that is more accessible to manipulation is addressed, which reduces to ordinary identifiability when Z = empty and can be given syntactic characterization using the do-calculus.

...read moreread less

Abstract: We address the problem of estimating the effect of intervening on a set of variables X from experiments on a different set, Z, that is more accessible to manipulation. This problem, which we call z-identifiability, reduces to ordinary identifiability when Z = empty and, like the latter, can be given syntactic characterization using the do-calculus [Pearl, 1995; 2000]. We provide a graphical necessary and sufficient condition for z-identifiability for arbitrary sets X,Z, and Y (the outcomes). We further develop a complete algorithm for computing the causal effect of X on Y using information provided by experiments on Z. Finally, we use our results to prove completeness of do-calculus relative to z-identifiability, a result that does not follow from completeness relative to ordinary identifiability.

...read moreread less

Posted Content•

Software Verification and Graph Similarity for Automated Evaluation of Students' Assignments

[...]

Milena Vujosevic-Janicic¹, Mladen Nikolić¹, Dušan Tošić¹, Viktor Kuncak²•Institutions (2)

University of Belgrade¹, École Polytechnique Fédérale de Lausanne²

29 Jun 2012-arXiv: Artificial Intelligence

TL;DR: Results of the evaluation show that the synergy of proposed approaches improves the quality and precision of automated grading and that automatically generated grades are highly correlated with instructor-assigned grades.

...read moreread less

Abstract: In this paper we promote introducing software verification and control flow graph similarity measurement in automated evaluation of students' programs. We present a new grading framework that merges results obtained by combination of these two approaches with results obtained by automated testing, leading to improved quality and precision of automated grading. These two approaches are also useful in providing a comprehensible feedback that can help students to improve the quality of their programs We also present our corresponding tools that are publicly available and open source. The tools are based on LLVM low-level intermediate code representation, so they could be applied to a number of programming languages. Experimental evaluation of the proposed grading framework is performed on a corpus of university students' programs written in programming language C. Results of the experiments show that automatically generated grades are highly correlated with manually determined grades suggesting that the presented tools can find real-world applications in studying and grading.

...read moreread less

Posted Content•

Learning Arithmetic Circuits

[...]

Daniel Lowd¹, Pedro Domingos¹•Institutions (1)

University of Washington¹

13 Jun 2012-arXiv: Artificial Intelligence

TL;DR: In this article, the authors propose to learn a Bayesian network with a score function that directly penalizes the cost of inference by greedy splitting conditional distributions, at each step scoring the candidates by compiling the resulting network into an arithmetic circuit and using its size as the penalty.

...read moreread less

Abstract: Graphical models are usually learned without regard to the cost of doing inference with them. As a result, even if a good model is learned, it may perform poorly at prediction, because it requires approximate inference. We propose an alternative: learning models with a score function that directly penalizes the cost of inference. Specifically, we learn arithmetic circuits with a penalty on the number of edges in the circuit (in which the cost of inference is linear). Our algorithm is equivalent to learning a Bayesian network with context-specific independence by greedily splitting conditional distributions, at each step scoring the candidates by compiling the resulting network into an arithmetic circuit, and using its size as the penalty. We show how this can be done efficiently, without compiling a circuit from scratch for each candidate. Experiments on several real-world domains show that our algorithm is able to learn tractable models with very large treewidth, and yields more accurate predictions than a standard context-specific Bayesian network learner, in far less time.

...read moreread less

Posted Content•

Empowerment for Continuous Agent-Environment Systems

[...]

Tobias Jung¹, Daniel Polani², Peter Stone¹•Institutions (2)

University of Texas at Austin¹, University of Hertfordshire²

31 Jan 2012-arXiv: Artificial Intelligence

TL;DR: The goal of this article is to extend empowerment to the significantly more important and relevant case of continuous vector-valued state spaces and initially unknown state transition probabilities.

...read moreread less

Abstract: This paper develops generalizations of empowerment to continuous states. Empowerment is a recently introduced information-theoretic quantity motivated by hypotheses about the efficiency of the sensorimotor loop in biological organisms, but also from considerations stemming from curiosity-driven learning. Empowemerment measures, for agent-environment systems with stochastic transitions, how much influence an agent has on its environment, but only that influence that can be sensed by the agent sensors. It is an information-theoretic generalization of joint controllability (influence on environment) and observability (measurement by sensors) of the environment by the agent, both controllability and observability being usually defined in control theory as the dimensionality of the control/observation spaces. Earlier work has shown that empowerment has various interesting and relevant properties, e.g., it allows us to identify salient states using only the dynamics, and it can act as intrinsic reward without requiring an external reward. However, in this previous work empowerment was limited to the case of small-scale and discrete domains and furthermore state transition probabilities were assumed to be known. The goal of this paper is to extend empowerment to the significantly more important and relevant case of continuous vector-valued state spaces and initially unknown state transition probabilities. The continuous state space is addressed by Monte-Carlo approximation; the unknown transitions are addressed by model learning and prediction for which we apply Gaussian processes regression with iterated forecasting. In a number of well-known continuous control tasks we examine the dynamics induced by empowerment and include an application to exploration and online model learning.

...read moreread less

Posted Content•

Hierarchical POMDP Controller Optimization by Likelihood Maximization

[...]

Marc Toussaint¹, Laurent Charlin², Pascal Poupart³•Institutions (3)

Technical University of Berlin¹, University of Toronto², University of Waterloo³

13 Jun 2012-arXiv: Artificial Intelligence

TL;DR: In this article, a hierarchical discovery problem in partially observable domains can be tackled using a similar maximum likelihood approach, which transforms the problem into a dynamic Bayesian network through which a hierarchical structure can naturally be discovered while optimizing the policy.

...read moreread less

Abstract: Planning can often be simpli ed by decomposing the task into smaller tasks arranged hierarchically. Charlin et al. [4] recently showed that the hierarchy discovery problem can be framed as a non-convex optimization problem. However, the inherent computational di culty of solving such an optimization problem makes it hard to scale to realworld problems. In another line of research, Toussaint et al. [18] developed a method to solve planning problems by maximumlikelihood estimation. In this paper, we show how the hierarchy discovery problem in partially observable domains can be tackled using a similar maximum likelihood approach. Our technique rst transforms the problem into a dynamic Bayesian network through which a hierarchical structure can naturally be discovered while optimizing the policy. Experimental results demonstrate that this approach scales better than previous techniques based on non-convex optimization.

...read moreread less

Posted Content•

The Dynamic Controllability of Conditional STNs with Uncertainty

[...]

Luke Hunsberger¹, Roberto Posenato², Carlo Combi²•Institutions (2)

Vassar College¹, University of Verona²

10 Dec 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, the authors define a Conditional Simple Temporal Network with Uncertainty (CSTNU) that combines the simple temporal constraints from a Simple-Temporal Network (STN) with the conditional nodes from a CTP and the contingent links from a STNU.

...read moreread less

Abstract: Recent attempts to automate business processes and medical-treatment processes have uncovered the need for a formal framework that can accommodate not only temporal constraints, but also observations and actions with uncontrollable durations To meet this need, this paper defines a Conditional Simple Temporal Network with Uncertainty (CSTNU) that combines the simple temporal constraints from a Simple Temporal Network (STN) with the conditional nodes from a Conditional Simple Temporal Problem (CSTP) and the contingent links from a Simple Temporal Network with Uncertainty (STNU) A notion of dynamic controllability for a CSTNU is defined that generalizes the dynamic consistency of a CTP and the dynamic controllability of an STNU The paper also presents some sound constraint-propagation rules for dynamic controllability that are expected to form the backbone of a dynamic-controllability-checking algorithm for CSTNUs

...read moreread less

Posted Content•

Optimization in SMT with LA(Q) Cost Functions

[...]

Roberto Sebastiani¹, Silvia Tomasi¹•Institutions (1)

University of Trento¹

07 Feb 2012-arXiv: Artificial Intelligence

TL;DR: This paper presents and discusses two general procedures for leveraging SMT to handle the minimization of LA(Q) cost functions, combining SMT with standard minimization techniques and implemented the proposed approach within the MathSAT SMT solver.

...read moreread less

Abstract: In the contexts of automated reasoning and formal verification, important decision problems are effectively encoded into Satisfiability Modulo Theories (SMT). In the last decade efficient SMT solvers have been developed for several theories of practical interest (e.g., linear arithmetic, arrays, bit-vectors). Surprisingly, very few work has been done to extend SMT to deal with optimization problems; in particular, we are not aware of any work on SMT solvers able to produce solutions which minimize cost functions over arithmetical variables. This is unfortunate, since some problems of interest require this functionality. In this paper we start filling this gap. We present and discuss two general procedures for leveraging SMT to handle the minimization of LA(Q) cost functions, combining SMT with standard minimization techniques. We have implemented the proposed approach within the MathSAT SMT solver. Due to the lack of competitors in AR and SMT domains, we experimentally evaluated our implementation against state-of-the-art tools for the domain of linear generalized disjunctive programming (LGDP), which is closest in spirit to our domain, on sets of problems which have been previously proposed as benchmarks for the latter tools. The results show that our tool is very competitive with, and often outperforms, these tools on these problems, clearly demonstrating the potential of the approach.

...read moreread less

Journal Article•DOI•

The third open Answer Set Programming competition

[...]

Francesco Calimeri¹, Giovambattista Ianni¹, Francesco Ricca¹•Institutions (1)

University of Calabria¹

14 Jun 2012-arXiv: Artificial Intelligence

TL;DR: The format of the competition and the rationale behind it are discussed, the results for both tracks are reported, and comparison with the second ASP competition and state-of-the-art solutions for some of the benchmark domains are discussed.

...read moreread less

Abstract: Answer Set Programming (ASP) is a well-established paradigm of declarative programming in close relationship with other declarative formalisms such as SAT Modulo Theories, Constraint Handling Rules, FO(.), PDDL and many others. Since its first informal editions, ASP systems have been compared in the now well-established ASP Competition. The Third (Open) ASP Competition, as the sequel to the ASP Competitions Series held at the University of Potsdam in Germany (2006-2007) and at the University of Leuven in Belgium in 2009, took place at the University of Calabria (Italy) in the first half of 2011. Participants competed on a pre-selected collection of benchmark problems, taken from a variety of domains as well as real world applications. The Competition ran on two tracks: the Model and Solve (MS and the System Track, run on the basis of fixed, public problem encodings, written in a standard ASP language. This paper discusses the format of the Competition and the rationale behind it, then reports the results for both tracks. Comparison with the second ASP competition and state-of-the-art solutions for some of the benchmark domains is eventually discussed. To appear in Theory and Practice of Logic Programming (TPLP).

...read moreread less

Posted Content•

Iterative Join-Graph Propagation

[...]

Rina Dechter¹, Kalev Kask¹, Robert Mateescu¹•Institutions (1)

University of California, Irvine¹

12 Dec 2012-arXiv: Artificial Intelligence

TL;DR: In this paper, an iterative version of join-tree clustering that applies the message passing of join tree clustering algorithm to joingraphs rather than to join-trees, iteratively.

...read moreread less

Abstract: The paper presents an iterative version of join-tree clustering that applies the message passing of join-tree clustering algorithm to join-graphs rather than to join-trees, iteratively. It is inspired by the success of Pearl's belief propagation algorithm as an iterative approximation scheme on one hand, and by a recently introduced mini-clustering i. success as an anytime approximation method, on the other. The proposed Iterative Join-graph Propagation IJGP belongs to the class of generalized belief propagation methods, recently proposed using analogy with algorithms in statistical physics. Empirical evaluation of this approach on a number of problem classes demonstrates that even the most time-efficient variant is almost always superior to IBP and MC i, and is sometimes more accurate by as much as several orders of magnitude.

...read moreread less

Posted Content•

Expectation Maximization and Complex Duration Distributions for Continuous Time Bayesian Networks

[...]

Uri Nodelman¹, Christian R. Shelton², Daphne Koller¹•Institutions (2)

Stanford University¹, University of California, Riverside²

04 Jul 2012-arXiv: Artificial Intelligence

TL;DR: The EM algorithm is used to extend the representation of CTBNs to allow a much richer class of transition durations distributions, known as phase distributions, which are a highly expressive semi-parametric representation, which can approximate any duration distribution arbitrarily closely.

...read moreread less

Abstract: Continuous time Bayesian networks (CTBNs) describe structured stochastic processes with finitely many states that evolve over continuous time. A CTBN is a directed (possibly cyclic) dependency graph over a set of variables, each of which represents a finite state continuous time Markov process whose transition model is a function of its parents. We address the problem of learning the parameters and structure of a CTBN from partially observed data. We show how to apply expectation maximization (EM) and structural expectation maximization (SEM) to CTBNs. The availability of the EM algorithm allows us to extend the representation of CTBNs to allow a much richer class of transition durations distributions, known as phase distributions. This class is a highly expressive semi-parametric representation, which can approximate any duration distribution arbitrarily closely. This extension to the CTBN framework addresses one of the main limitations of both CTBNs and DBNs - the restriction to exponentially / geometrically distributed duration. We present experimental results on a real data set of people's life spans, showing that our algorithm learns reasonable models - structure and parameters - from partially observed data, and, with the use of phase distributions, achieves better performance than DBNs.

...read moreread less

Collapse