Showing papers in "Journal of Artificial Intelligence Research in 2005"

PDF

Open Access

Journal Article•DOI•

Perseus: randomized point-based value iteration for POMDPs

[...]

Matthijs T. J. Spaan¹, Nikos Vlassis¹•Institutions (1)

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: This work presents a randomized point-based value iteration algorithm called PERSEUS, which backs up only a (randomly selected) subset of points in the belief set, sufficient for improving the value of each belief point in the set.

...read moreread less

Abstract: Partially observable Markov decision processes (POMDPs) form an attractive and principled framework for agent planning under uncertainty. Point-based approximate techniques for POMDPs compute a policy based on a finite set of points collected in advance from the agent's belief space. We present a randomized point-based value iteration algorithm called PERSEUS. The algorithm performs approximate value backup stages, ensuring that in each backup stage the value of each point in the belief set is improved; the key observation is that a single backup may improve the value of many belief points. Contrary to other point-based methods, PERSEUS backs up only a (randomly selected) subset of points in the belief set, sufficient for improving the value of each belief point in the set. We show how the same idea can be extended to dealing with continuous action spaces. Experimental results show the potential of PERSEUS in large scale POMDP problems.

...read moreread less

674 citations

Journal Article•DOI•

Learning concept hierarchies from text corpora using formal concept analysis

[...]

Philipp Cimiano¹, Andreas Hotho², Steffen Staab³•Institutions (3)

Karlsruhe Institute of Technology¹, University of Kassel², University of Koblenz and Landau³

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: A novel approach to the automatic acquisition of taxonomies or concept hierarchies from a text corpus based on Formal Concept Analysis, which model the context of a certain term as a vector representing syntactic dependencies which are automatically acquired from the text corpus with a linguistic parser.

...read moreread less

Abstract: We present a novel approach to the automatic acquisition of taxonomies or concept hierarchies from a text corpus. The approach is based on Formal Concept Analysis (FCA), a method mainly used for the analysis of data, i.e. for investigating and processing explicitly given information. We follow Harris' distributional hypothesis and model the context of a certain term as a vector representing syntactic dependencies which are automatically acquired from the text corpus with a linguistic parser. On the basis of this context information, FCA produces a lattice that we convert into a special kind of partial order constituting a concept hierarchy. The approach is evaluated by comparing the resulting concept hierarchies with hand-crafted taxonomies for two domains: tourism and finance. We also directly compare our approach with hierarchical agglomerative clustering as well as with Bi-Section-KMeans as an instance of a divisive clustering algorithm. Furthermore, we investigate the impact of using different measures weighting the contribution of each attribute as well as of applying a particular smoothing technique to cope with data sparseness.

...read moreread less

587 citations

Journal Article•DOI•

A framework for sequential planning in multi-agent settings

[...]

Piotr J. Gmytrasiewicz¹, Prashant Doshi¹•Institutions (1)

University of Illinois at Chicago¹

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: In this paper, the authors extend the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state space.

...read moreread less

Abstract: This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state space. Agents maintain beliefs over physical states of the environment and over models of other agents, and they use Bayesian updates to maintain their beliefs over time. The solutions map belief states to actions. Models of other agents may include their belief states and are related to agent types considered in games of incomplete information. We express the agents' autonomy by postulating that their models are not directly manipulable or observable by other agents. We show that important properties of POMDPs, such as convergence of value iteration, the rate of convergence, and piece-wise linearity and convexity of the value functions carry over to our framework. Our approach complements a more traditional approach to interactive settings which uses Nash equilibria as a solution paradigm. We seek to avoid some of the drawbacks of equilibria which may be non-unique and do not capture off-equilibrium behaviors. We do so at the cost of having to represent, process and continuously revise models of other agents. Since the agent's beliefs may be arbitrarily nested, the optimal solutions to decision making problems are only asymptotically computable. However, approximate belief updates and approximately optimal plans are computable. We illustrate our framework using a simple application domain, and we show examples of belief updates and value functions.

...read moreread less

315 citations

Journal Article•DOI•

Risk-sensitive reinforcement learning applied to control under constraints

[...]

Peter Geibel¹, Fritz Wysotzki²•Institutions (2)

University of Osnabrück¹, Technical University of Berlin²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: A model free, heuristic reinforcement learning algorithm that aims at finding good deterministic policies based on weighting the original value function and the risk, which was successfully applied to the control of a feed tank with stochastic inflows that lies upstream of a distillation column.

...read moreread less

Abstract: In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with respect to a policy as the probability of entering such a state when the policy is pursued. We consider the problem of finding good policies whose risk is smaller than some user-specified threshold, and formalize it as a constrained MDP with two criteria. The first criterion corresponds to the value function originally given. We will show that the risk can be formulated as a second criterion function based on a cumulative return, whose definition is independent of the original value function. We present a model free, heuristic reinforcement learning algorithm that aims at finding good deterministic policies. It is based on weighting the original value function and the risk. The weight parameter is adapted in order to find a feasible solution for the constrained problem that has a good performance with respect to the value function. The algorithm was successfully applied to the control of a feed tank with stochastic inflows that lies upstream of a distillation column. This control task was originally formulated as an optimal control problem with chance constraints, and it was solved under certain assumptions on the model to obtain an optimal solution. The power of our learning algorithm is that it can be used even when some of these restrictive assumptions are relaxed.

...read moreread less

283 citations

Journal Article•DOI•

Finding approximate POMDP solutions through belief compression

[...]

Nicholas Roy¹, Geoffrey J. Gordon², Sebastian Thrun³•Institutions (3)

Massachusetts Institute of Technology¹, Carnegie Mellon University², Stanford University³

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: This thesis describes a scalable approach to POMDP planning which uses low-dimensional representations of the belief space and demonstrates how to make use of a variant of Principal Components Analysis (PCA) called Exponential family PCA in order to compress certain kinds of large real-world PomDPs, and find policies for these problems.

...read moreread less

Abstract: Standard value function approaches to finding policies for Partially Observable Markov Decision Processes (POMDPs) are generally considered to be intractable for large models. The intractability of these algorithms is to a large extent a consequence of computing an exact, optimal policy over the entire belief space. However, in real-world POMDP problems, computing the optimal policy for the full belief space is often unnecessary for good control even for problems with complicated policy classes. The beliefs experienced by the controller often lie near a structured, low-dimensional subspace embedded in the high-dimensional belief space. Finding a good approximation to the optimal value function for only this subspace can be much easier than computing the full value function. We introduce a new method for solving large-scale POMDPs by reducing the dimensionality of the belief space. We use Exponential family Principal Components Analysis (Collins, Dasgupta, & Schapire, 2002) to represent sparse, high-dimensional belief spaces using small sets of learned features of the belief state. We then plan only in terms of the low-dimensional belief features. By planning in this low-dimensional space, we can find policies for POMDP models that are orders of magnitude larger than models that can be handled by conventional techniques. We demonstrate the use of this algorithm on a synthetic problem and on mobile robot navigation tasks.

...read moreread less

244 citations

Journal Article•DOI•

CIXL2: a crossover operator for evolutionary algorithms based on population features

[...]

Domingo Ortiz-Boyer, César HerváMartínez, Nicolás García-Pedrajas

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: A crossover operator for evolutionary algorithms with real values that is based on the statistical theory of population distributions that takes into account the localization and dispersion features of the best individuals of the population with the objective that these features would be inherited by the offspring.

...read moreread less

Abstract: In this paper we propose a crossover operator for evolutionary algorithms with real values that is based on the statistical theory of population distributions The operator is based on the theoretical distribution of the values of the genes of the best individuals in the population The proposed operator takes into account the localization and dispersion features of the best individuals of the population with the objective that these features would be inherited by the offspring Our aim is the optimization of the balance between exploration and exploitation in the search process In order to test the efficiency and robustness of this crossover, we have used a set of functions to be optimized with regard to different criteria, such as, multimodality, separability, regularity and epistasis With this set of functions we can extract conclusions in function of the problem at hand We analyze the results using ANOVA and multiple comparison statistical tests As an example of how our crossover can be used to solve artificial intelligence problems, we have applied the proposed model to the problem of obtaining the weight of each network in a ensemble of neural networks The results obtained are above the performance of standard methods

...read moreread less

225 citations

Journal Article•DOI•

Macro-FF: improving AI planning with automatically learned macro-operators

[...]

Adi Botea¹, Markus Enzenberger¹, Martin Müller¹, Jonathan Schaeffer¹•Institutions (1)

University of Alberta¹

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: Two automated methods that learn relevant information from previous experience in a domain and use it to solve new problem instances are presented and compared and indicate a large reduction in search effort in those complex domains where structural information can be inferred.

...read moreread less

Abstract: Despite recent progress in AI planning, many benchmarks remain challenging for current planners. In many domains, the performance of a planner can greatly be improved by discovering and exploiting information about the domain structure that is not explicitly encoded in the initial PDDL formulation. In this paper we present and compare two automated methods that learn relevant information from previous experience in a domain and use it to solve new problem instances. Our methods share a common four-step strategy. First, a domain is analyzed and structural information is extracted, then macro-operators are generated based on the previously discovered structure. A filtering and ranking procedure selects the most useful macro-operators. Finally, the selected macros are used to speed up future searches. We have successfully used such an approach in the fourth international planning competition IPC-4. Our system, Macro-FF, extends Hoffmann's state-of-the-art planner FF 2.3 with support for two kinds of macro-operators, and with engineering enhancements. We demonstrate the effectiveness of our ideas on benchmarks from international planning competitions. Our results indicate a large reduction in search effort in those complex domains where structural information can be inferred.

...read moreread less

212 citations

Journal Article•DOI•

Learning from labeled and unlabeled data: an empirical study across techniques and domains

[...]

Nitesh V. Chawla¹, Grigoris Karakoulas²•Institutions (2)

University of Notre Dame¹, University of Toronto²

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: This paper presents an empirical study of various semi-supervised learning techniques on a variety of datasets, and attempts to answer various questions such as the effect of independence or relevance amongst features, theeffect of the size of the labeled and unlabeled sets and the effects of noise.

...read moreread less

Abstract: There has been increased interest in devising learning techniques that combine unlabeled data with labeled data -- i.e. semi-supervised learning. However, to the best of our knowledge, no study has been performed across various techniques and different types and amounts of labeled and unlabeled data. Moreover, most of the published work on semi-supervised learning techniques assumes that the labeled and unlabeled data come from the same distribution. It is possible for the labeling process to be associated with a selection bias such that the distributions of data points in the labeled and unlabeled sets are different. Not correcting for such bias can result in biased function approximation with potentially poor performance. In this paper, we present an empirical study of various semi-supervised learning techniques on a variety of datasets. We attempt to answer various questions such as the effect of independence or relevance amongst features, the effect of the size of the labeled and unlabeled sets and the effect of noise. We also investigate the impact of sample-selection bias on the semi -supervised learning techniques under study and implement a bivariate probit technique particularly designed to correct for such bias.

...read moreread less

182 citations

Journal Article•DOI•

The deterministic part of IPC-4: an overview

[...]

Jörg Hoffmann¹, Stefan Edelkamp²•Institutions (2)

Max Planck Society¹, Technical University of Dortmund²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: An overview of the organization and results of the deterministic part of the 4th International Planning Competition, i.e., of the part concerned with evaluating systems doing deterministic planning, is provided.

...read moreread less

Abstract: We provide an overview of the organization and results of the deterministic part of the 4th International Planning Competition, i.e., of the part concerned with evaluating systems doing deterministic planning. IPC-4 attracted even more competing systems than its already large predecessors, and the competition event was revised in several important respects. After giving an introduction to the IPC, we briefly explain the main differences between the deterministic part of IPC-4 and its predecessors. We then introduce formally the language used, called PDDL2.2 that extends PDDL2.1 by derived predicates and timed initial literals. We list the competing systems and overview the results of the competition. The entire set of data is far too large to be presented in full. We provide a detailed summary; the complete data is available in an online appendix. We explain how we awarded the competition prizes.

...read moreread less

163 citations

Journal Article•DOI•

Graduality in argumentation

[...]

Claudette Cayrol, Marie-Christine Lagasquie-Schiex

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: The purpose is to introduce "graduality" in the selection of the best arguments, i.e. to be able to partition the set of the arguments in more than the two usual subsets of "selected" and "non-selected" arguments in order to represent different levels of selection.

...read moreread less

Abstract: Argumentation is based on the exchange and valuation of interacting arguments, followed by the selection of the most acceptable of them (for example, in order to take a decision, to make a choice). Starting from the framework proposed by Dung in 1995, our purpose is to introduce "graduality" in the selection of the best arguments, i.e. to be able to partition the set of the arguments in more than the two usual subsets of "selected" and "non-selected" arguments in order to represent different levels of selection. Our basic idea is that an argument is all the more acceptable if it can be preferred to its attackers. First, we discuss general principles underlying a "gradual" valuation of arguments based on their interactions. Following these principles, we define several valuation models for an abstract argumentation system. Then, we introduce "graduality" in the concept of acceptability of arguments. We propose new acceptability classes and a refinement of existing classes taking advantage of an available "gradual" valuation.

...read moreread less

163 citations

Journal Article•DOI•

Pure Nash equilibria: hard and easy games

[...]

Georg Gottlob¹, Gianluigi Greco², Francesco Scarcello²•Institutions (2)

Vienna University of Technology¹, University of Calabria²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: In this article, the authors investigated complexity issues related to pure Nash equilibria of strategic games and showed that, even in very restrictive settings, determining whether a game has a pure Nash equilibrium is NP-hard.

...read moreread less

Abstract: We investigate complexity issues related to pure Nash equilibria of strategic games. We show that, even in very restrictive settings, determining whether a game has a pure Nash Equilibrium is NP-hard, while deciding whether a game has a strong Nash equilibrium is Σ2P-complete. We then study practically relevant restrictions that lower the complexity. In particular, we are interested in quantitative and qualitative restrictions of the way each player's payoff depends on moves of other players. We say that a game has small neighborhood if the utility function for each player depends only on (the actions of) a logarithmically small number of other players. The dependency structure of a game G can be expressed by a graph G(G) or by a hypergraph H(G). By relating Nash equilibrium problems to constraint satisfaction problems (CSPs), we show that if G has small neighborhood and if H(G) has bounded hypertree width (or if G(G) has bounded treewidth), then finding pure Nash and Pareto equilibria is feasible in polynomial time. If the game is graphical, then these problems are LOGCFL-complete and thus in the class NC2 of highly parallelizable problems.

...read moreread less

Journal Article•DOI•

Where Ignoring delete lists works: local search topology in planning benchmarks

[...]

Jörg Hoffmann¹•Institutions (1)

Max Planck Society¹

27 Nov 2005-Journal of Artificial Intelligence Research

TL;DR: The overall investigation gives a rare example of a successful analysis of the connections between typical-case problem structure, and search performance, and gives hints on how the topological phenomena might be automatically recognizable by domain analysis techniques.

...read moreread less

Abstract: Between 1998 and 2004, the planning community has seen vast progress in terms of the sizes of benchmark examples that domain-independent planners can tackle successfully. The key technique behind this progress is the use of heuristic functions based on relaxing the planning task at hand, where the relaxation is to assume that all delete lists are empty. The unprecedented success of such methods, in many commonly used benchmark examples, calls for an understanding of what classes of domains these methods are well suited for. In the investigation at hand, we derive a formal background to such an understanding. We perform a case study covering a range of 30 commonly used STRIPS and ADL benchmark domains, including all examples used in the first four international planning competitions. We prove connections between domain structure and local search topology – heuristic cost surface properties – under an idealized version of the heuristic functions used in modern planners. The idealized heuristic function is called h + , and differs from the practically used functions in that it returns the length of an optimal relaxed plan, which is NP-hard to compute. We identify several key characteristics of the topology under h + , concerning the existence/non-existence of unrecognized dead ends, as well as the existence/non-existence of constant upper bounds on the difficulty of escaping local minima and benches. These distinctions divide the (set of all) planning domains into a taxonomy of classes of varying h + topology. As it turns out, many of the 30 investigated domains lie in classes with a relatively easy topology. Most particularly, 12 of the domains lie in classes where FF’s search algorithm, provided with h + , is a polynomial solving mechanism. We also present results relating h + to its approximation as implemented in FF. The behavior regarding dead ends is provably the same. We summarize the results of an empirical investigation showing that, in many domains, the topological qualities of h + are largely inherited by the approximation. The overall investigation gives a rare example of a successful analysis of the connections between typical-case problem structure, and search performance. The theoretical investigation also gives hints on how the topological phenomena might be automatically recognizable by domain analysis techniques. We outline some preliminary steps we made into that direction.

...read moreread less

Journal Article•DOI•

The first probabilistic track of the international planning competition

[...]

Håkan L. S. Younes¹, Michael L. Littman², David Weissman², John Asmuth²•Institutions (2)

Carnegie Mellon University¹, Rutgers University²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: The 2004 International Planning Competition, IPC-4, included a probabilistic planning track for the first time and the new domain specification language was created for the track, the evaluation methodology, the competition domains developed, and the results of the participating teams.

...read moreread less

Abstract: The 2004 International Planning Competition, IPC-4, included a probabilistic planning track for the first time. We describe the new domain specification language we created for the track, our evaluation methodology, the competition domains we developed, and the results of the participating teams.

...read moreread less

Journal Article•DOI•

Keys, nominals, and concrete domains

[...]

Carsten Lutz¹, Carlos Areces², Ian Horrocks³, Ulrike Sattler³•Institutions (3)

Dresden University of Technology¹, French Institute for Research in Computer Science and Automation², University of Manchester³

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: This work introduces a number of natural description logics and performs a detailed analysis of their decidability and computational complexity, finding that naive extensions with key constraints easily lead to undecidability, whereas more careful extensions yield NExpTime-complete DLs for a variety of useful concrete domains.

...read moreread less

Abstract: Many description logics (DLs) combine knowledge representation on an abstract, logical level with an interface to "concrete" domains like numbers and strings with built-in predicates such as <, +, and prefix-of. These hybrid DLs have turned out to be useful in several application areas, such as reasoning about conceptual database models. We propose to further extend such DLs with key constraints that allow the expression of statements like "US citizens are uniquely identified by their social security number". Based on this idea, we introduce a number of natural description logics and perform a detailed analysis of their decidability and computational complexity. It turns out that naive extensions with key constraints easily lead to undecidability, whereas more careful extensions yield NExp-Time-complete DLs for a variety of useful concrete domains.

...read moreread less

Journal Article•DOI•

mGPT: a probabilistic planner based on heuristic search

[...]

Blai Bonet¹, Hector Geffner²•Institutions (2)

Simón Bolívar University¹, Pompeu Fabra University²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: The version of the GPT planner used in the probabilistic track of the 4th International Planning Competition (IPC-4), called mGPT, solves Markov Decision Processes specified in the PPDDL language by extracting and using different classes of lower bounds along with various heuristic-search algorithms.

...read moreread less

Abstract: We describe the version of the GPT planner used in the probabilistic track of the 4th International Planning Competition (IPC-4). This version, called mGPT, solves Markov Decision Processes specified in the PPDDL language by extracting and using different classes of lower bounds along with various heuristic-search algorithms. The lower bounds are extracted from deterministic relaxations where the alternative probabilistic effects of an action are mapped into different, independent, deterministic actions. The heuristic-search algorithms use these lower bounds for focusing the updates and delivering a consistent value function over all states reachable from the initial state and the greedy policy.

...read moreread less

Journal Article•DOI•

Learning content selection rules for generating object descriptions in dialogue

[...]

Pamela W. Jordan¹, Marilyn A. Walker²•Institutions (2)

University of Pittsburgh¹, University of Sheffield²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: This paper uses the annotated COCONUT corpus of task-oriented design dialogues to develop feature sets based on Dale and Reiter's incremental model, Brennan and Clark's conceptual pact model, and Jordan's intentional influences model and uses these feature sets in a machine learning experiment to automatically learn a model of content selection for object descriptions.

...read moreread less

Abstract: A fundamental requirement of any task-oriented dialogue system is the ability to generate object descriptions that refer to objects in the task domain. The subproblem of content selection for object descriptions in task-oriented dialogue has been the focus of much previous work and a large number of models have been proposed. In this paper, we use the annotated COCONUT corpus of task-oriented design dialogues to develop feature sets based on Dale and Reiter's (1995) incremental model, Brennan and Clark's (1996) conceptual pact model, and Jordan's (2000b) intentional influences model, and use these feature sets in a machine learning experiment to automatically learn a model of content selection for object descriptions. Since Dale and Reiter's model requires a representation of discourse structure, the corpus annotations are used to derive a representation based on Grosz and Sidner's (1986) theory of the intentional structure of discourse, as well as two very simple representations of discourse structure based purely on recency. We then apply the rule-induction program RIPPER to train and test the content selection component of an object description generator on a set of 393 object descriptions from the corpus. To our knowledge, this is the first reported experiment of a trainable content selection component for object description generation in dialogue. Three separate content selection models that are based on the three theoretical models, all independently achieve accuracies significantly above the MAJORITY CLASS baseline (17%) on unseen test data, with the intentional influences model (42.4%) performing significantly better than either the incremental model (30.4%) or the conceptual pact model (28.9%). But the best performing models combine all the feature sets, achieving accuracies near 60%. Surprisingly, a simple recency-based representation of discourse structure does as well as one based on intentional structure. To our knowledge, this is also the first empirical comparison of a representation of Grosz and Sidner's model of discourse structure with a simpler model for any generation task.

...read moreread less

Journal Article•DOI•

Hybrid BDI-POMDP framework for multiagent teaming

[...]

Ranjit Nair¹, Milind Tambe²•Institutions (2)

Honeywell¹, University of Southern California²

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: This article introduces RMTDP (Role-based Markov Team Decision Problem), a new distributed POMDP model for analysis of role allocations, and describes a role allocation technique that takes into account future uncertainties in the domain; prior work in multiagent role allocation has failed to address such uncertainties.

...read moreread less

Abstract: Many current large-scale multiagent team implementations can be characterized as following the "belief-desire-intention" (BDI) paradigm, with explicit representation of team plans. Despite their promise, current BDI team approaches lack tools for quantitative performance analysis under uncertainty. Distributed partially observable Markov decision problems (POMDPs) are well suited for such analysis, but the complexity of finding optimal policies in such models is highly intractable. The key contribution of this article is a hybrid BDI-POMDP approach, where BDI team plans are exploited to improve POMDP tractability and POMDP analysis improves BDI team plan performance. Concretely, we focus on role allocation, a fundamental problem in BDI teams: which agents to allocate to the different roles in the team. The article provides three key contributions. First, we describe a role allocation technique that takes into account future uncertainties in the domain; prior work in multiagent role allocation has failed to address such uncertainties. To that end, we introduce RMTDP (Role-based Markov Team Decision Problem), a new distributed POMDP model for analysis of role allocations. Our technique gains in tractability by significantly curtailing RMTDP policy search; in particular, BDI team plans provide incomplete RMTDP policies, and the RMTDP policy search fills the gaps in such incomplete policies by searching for the best role allocation. Our second key contribution is a novel decomposition technique to further improve RMTDP policy search efficiency. Even though limited to searching role allocations, there are still combinatorially many role allocations, and evaluating each in RMTDP to identify the best is extremely difficult. Our decomposition technique exploits the structure in the BDI team plans to significantly prune the search space of role allocations. Our third key contribution is a significantly faster policy evaluation algorithm suited for our BDI-POMDP hybrid approach. Finally, we also present experimental results from two domains: mission rehearsal simulation and RoboCupRescue disaster rescue simulation.

...read moreread less

Journal Article•DOI•

Combining knowledge- and corpus-based word-sense-disambiguation methods

[...]

Andrés Montoyo¹, Armando Suárez¹, German Rigau, Manuel Palomar¹•Institutions (1)

University of Alicante¹

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: The hypothesis is that word-sense disambiguation requires several knowledge sources in order to solve the semantic ambiguity of the words.

...read moreread less

Abstract: In this paper we concentrate on the resolution of the lexical ambiguity that arises when a given word has several different meanings. This specific task is commonly referred to as word sense disambiguation (WSD). The task of WSD consists of assigning the correct sense to words using an electronic dictionary as the source of word definitions. We present two WSD methods based on two main methodological approaches in this research area: a knowledge-based method and a corpus-based method. Our hypothesis is that word-sense disambiguation requires several knowledge sources in order to solve the semantic ambiguity of the words. These sources can be of different kinds-- for example, syntagmatic, paradigmatic or statistical information. Our approach combines various sources of knowledge, through combinations of the two WSD methods mentioned above. Mainly, the paper concentrates on how to combine these methods and sources of information in order to achieve good results in the disambiguation. Finally, this paper presents a comprehensive study and experimental work on evaluation of the methods and their combinations.

...read moreread less

Journal Article•DOI•

Combining spatial and temporal logics: expressiveness vs. complexity

[...]

David Gabelaia¹, Roman Kontchakov¹, Agi Kurucz¹, Frank Wolter², Michael Zakharyaschev¹ - Show less +1 more•Institutions (2)

King's College London¹, University of Liverpool²

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: It is demonstrated how different combining principles as well as spatial and temporal primitives can produce NP-, PSPACE-, EXPSPACE-, 2EXPSPace-complete, and even undecidable spatio-temporal logics out of components that are at most NP- or PSPACE-complete.

...read moreread less

Abstract: In this paper, we construct and investigate a hierarchy of spatio-temporal formalisms that result from various combinations of propositional spatial and temporal logics such as the propositional temporal logic PT L, the spatial logics RCC-8, BRCC-8, S4u and their fragments. The obtained results give a clear picture of the trade-off between expressiveness and 'computational realisability' within the hierarchy. We demonstrate how different combining principles as well as spatial and temporal primitives can produce NP-, PSPACE-, EXPSPACE-, 2EXPSPACE-complete, and even undecidable spatio-temporal logics out of components that are at most NP- or PSPACE-complete.

...read moreread less

Journal Article•DOI•

Solving set constraint satisfaction problems using ROBDDs

[...]

Peter Hawkins¹, Vitaly Lagoon¹, Peter J. Stuckey¹•Institutions (1)

University of Melbourne¹

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: It is shown that it is possible to construct an efficient set domain propagator which compactly represents many set domains and set constraints using ROBDDs and how to incorporate less strict consistency notions into the ROBDD framework, such as set bounds, cardinality bounds and lexicographic bounds consistency.

...read moreread less

Abstract: In this paper we present a new approach to modeling finite set domain constraint problems using Reduced Ordered Binary Decision Diagrams (ROBDDs). We show that it is possible to construct an efficient set domain propagator which compactly represents many set domains and set constraints using ROBDDs. We demonstrate that the ROBDD-based approach provides unprecedented flexibility in modeling constraint satisfaction problems, leading to performance improvements. We also show that the ROBDD-based modeling approach can be extended to the modeling of integer and multiset constraint problems in a straightforward manner. Since domain propagation is not always practical, we also show how to incorporate less strict consistency notions into the ROBDD framework, such as set bounds, cardinality bounds and lexicographic bounds consistency. Finally, we present experimental results that demonstrate the ROBDD-based solver performs better than various more conventional constraint solvers on several standard set constraint problems.

...read moreread less

Journal Article•DOI•

Extremal behaviour in multiagent contract negotiation

[...]

Paul E. Dunne¹•Institutions (1)

University of Liverpool¹

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: This paper addresses the issue of the number of restricted rational deals that may be required to implement a particular reallocation when it is possible to do so and constructs examples showing that this number may be exponential, even when all of the agent utility functions are monotonic.

...read moreread less

Abstract: We examine properties of a model of resource allocation in which several agents exchange resources in order to optimise their individual holdings. The schemes discussed relate to well-known negotiation protocols proposed in earlier work and we consider a number of alternative notions of "rationality" covering both quantitative measures, e.g. cooperative and individual rationality and more qualitative forms, e.g. Pigou-Dalton transfers. While it is known that imposing particular rationality and structural restrictions may result in some reallocations of the resource set becoming unrealisable, in this paper we address the issue of the number of restricted rational deals that may be required to implement a particular reallocation when it is possible to do so. We construct examples showing that this number may be exponential (in the number of resources m), even when all of the agent utility functions are monotonic. We further show that k agents may achieve in a single deal a reallocation requiring exponentially many rational deals if at most k - 1 agents can participate, this same reallocation being unrealisable by any sequences of rational deals in which at most k - 2 agents are involved.

...read moreread less

Journal Article•DOI•

Relational Dynamic Bayesian Networks

[...]

Sumit Sanghai¹, Pedro Domingos¹, Daniel S. Weld¹•Institutions (1)

University of Washington¹

02 Dec 2005-Journal of Artificial Intelligence Research

TL;DR: RDBNs are a generalization of dynamic probabilistic relational models (DPRMs), which are an extension of dynamic Bayesian networks (DBNs) to rst-order logic and two new forms of particle ltering are proposed.

...read moreread less

Abstract: Stochastic processes that involve the creation of objects and relations over time are widespread, but relatively poorly studied. For example, accurate fault diagnosis in factory assembly processes requires inferring the probabilities of erroneous assembly operations, but doing this efciently and accurately is difcult. Modeled as dynamic Bayesian networks, these processes have discrete variables with very large domains and extremely high dimensionality. In this paper, we introduce relational dynamic Bayesian networks (RDBNs), which are an extension of dynamic Bayesian networks (DBNs) to rst-order logic. RDBNs are a generalization of dynamic probabilistic relational models (DPRMs), which we had proposed in our previous work to model dynamic uncertain domains. We rst extend the Rao-Blackwellised particle ltering described in our earlier work to RDBNs. Next, we lift the assumptions associated with Rao-Blackwellization in RDBNs and propose two new forms of particle ltering. The rst one uses abstraction hierarchies over the predicates to smooth the particle lter’ s estimates. The second employs kernel density estimation with a kernel function specically designed for relational domains. Experiments show these two methods greatly outperform standard particle ltering on the task of assembly plan execution monitoring.

...read moreread less

Journal Article•DOI•

Efficiency versus convergence of Boolean kernels for on-line learning algorithms

[...]

Roni Khardon¹, Dan Roth², Rocco A. Servedio³•Institutions (3)

Tufts University¹, University of Illinois at Urbana–Champaign², Columbia University³

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: It is proved that it is computationally hard to simulate Winnow's behavior for learning DNF over an expanded feature space of exponentially many conjunctions, and thus that such kernel functions for Winnow are not efficiently computable.

...read moreread less

Abstract: The paper studies machine learning problems where each example is described using a set of Boolean features and where hypotheses are represented by linear threshold elements. One method of increasing the expressiveness of learned hypotheses in this context is to expand the feature set to include conjunctions of basic features. This can be done explicitly or where possible by using a kernel function. Focusing on the well known Perceptron and Winnow algorithms, the paper demonstrates a tradeoff between the computational efficiency with which the algorithm can be run over the expanded feature space and the generalization ability of the corresponding learning algorithm. We first describe several kernel functions which capture either limited forms of conjunctions or all conjunctions. We show that these kernels can be used to efficiently run the Perceptron algorithm over a feature space of exponentially many conjunctions; however we also show that using such kernels, the Perceptron algorithm can provably make an exponential number of mistakes even when learning simple functions. We then consider the question of whether kernel functions can analogously be used to run the multiplicative-update Winnow algorithm over an expanded feature space of exponentially many conjunctions. Known upper bounds imply that the Winnow algorithm can learn Disjunctive Normal Form (DNF) formulae with a polynomial mistake bound in this setting. However, we prove that it is computationally hard to simulate Winnow's behavior for learning DNF over such a feature set. This implies that the kernel functions which correspond to running Winnow for this problem are not efficiently computable, and that there is no general construction that can run Winnow with kernels.

...read moreread less

Journal Article•DOI•

On the practical use of variable elimination in constraint optimization problems: 'still-life' as a case study

[...]

Javier Larrosa¹, Enric Morancho¹, David Niso¹•Institutions (1)

Polytechnic University of Catalonia¹

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: In this article, the applicability of variable elimination to the problem of finding still-lifes was studied and several alternatives: variable elimination as a stand-alone algorithm, interleaved with search, and as a source of good quality lower bounds.

...read moreread less

Abstract: Variable elimination is a general technique for constraint processing. It is often discarded because of its high space complexity. However, it can be extremely useful when combined with other techniques. In this paper we study the applicability of variable elimination to the challenging problem of finding still-lifes. We illustrate several alternatives: variable elimination as a stand-alone algorithm, interleaved with search, and as a source of good quality lower bounds. We show that these techniques are the best known option both theoretically and empirically. In our experiments we have been able to solve the n = 20 instance, which is far beyond reach with alternative approaches.

...read moreread less

Journal Article•DOI•

Hiding satisfying assignments: two are better than one

[...]

Dimitris Achlioptas¹, Haixia Jia², Cristopher Moore²•Institutions (2)

Microsoft¹, University of New Mexico²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: In this article, the authors introduce a simple twist on this basic model, which appears to dramatically increase its hardness, making it much harder for algorithms to find any truth assignment, and they give theoretical and experimental evidence supporting this assertion.

...read moreread less

Abstract: The evaluation of incomplete satisfiability solvers depends critically on the availability of hard satisfiable instances. A plausible source of such instances consists of random k- SAT formulas whose clauses are chosen uniformly from among all clauses satisfying some randomly chosen truth assignment A. Unfortunately, instances generated in this manner tend to be relatively easy and can be solved efficiently by practical heuristics. Roughly speaking, for a number of different algorithms, A acts as a stronger and stronger attractor as the formula's density increases. Motivated by recent results on the geometry of the space of satisfying truth assignments of random k-SAT and NAE-k-SAT formulas, we introduce a simple twist on this basic model, which appears to dramatically increase its hardness. Namely, in addition to forbidding the clauses violated by the hidden assignment A, we also forbid the clauses violated by its complement, so that both A and A are satisfying. It appears that under this "symmetrization" the effects of the two attractors largely cancel out, making it much harder for algorithms to find any truth assignment. We give theoretical and experimental evidence supporting this assertion.

...read moreread less

Journal Article•DOI•

Binary encodings of non-binary constraint satisfaction problems: algorithms and experimental results

[...]

Nikolaos Samaras¹, Konstantinos I. Stergiou²•Institutions (2)

University of Macedonia¹, University of the Aegean²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: It is demonstrated that the use of standard techniques for binary CSPs in the encodings of non-binary problems is problematic and results in models that are very rarely competitive with the non- binary representation.

...read moreread less

Abstract: A non-binary Constraint Satisfaction Problem (CSP) can be solved directly using extended versions of binary techniques. Alternatively, the non-binary problem can be translated into an equivalent binary one. In this case, it is generally accepted that the translated problem can be solved by applying well-established techniques for binary CSPs. In this paper we evaluate the applicability of the latter approach. We demonstrate that the use of standard techniques for binary CSPs in the encodings of non-binary problems is problematic and results in models that are very rarely competitive with the non-binary representation. To overcome this, we propose specialized arc consistency and search algorithms for binary encodings, and we evaluate them theoretically and empirically. We consider three binary representations; the hidden variable encoding, the dual encoding, and the double encoding. Theoretical and empirical results show that, for certain classes of non-binary constraints, binary encodings are a competitive option, and in many cases, a better one than the non-binary representation.

...read moreread less

Journal Article•DOI•

Optiplan: unifying IP-based and graph-based planning

[...]

Menkes van den Briel¹, Subbarao Kambhampati¹•Institutions (1)

Arizona State University¹

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: The architecture of Optiplan is described and the integer programming formulation that enabled it to perform reasonably well in the international planning competition is provided.

...read moreread less

Abstract: The Optiplan planning system is the first integer programming-based planner that successfully participated in the international planning competition. This engineering note describes the architecture of Optiplan and provides the integer programming formulation that enabled it to perform reasonably well in the competition. We also touch upon some recent developments that make integer programming encodings significantly more competitive.

...read moreread less

Journal Article•DOI•

Cooperative information sharing to improve distributed learning in multi-agent systems

[...]

Partha Sarathi Dutta¹, Nicholas R. Jennings¹, Luc Moreau¹•Institutions (1)

University of Southampton¹

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: This work proposes a novel information-sharing protocol, post-task-completion sharing, for the distribution of state information between agents, and shows the improvement in the quality of estimates produced using this strategy over the widely used protocol of sharing information between nearest neighbours.

...read moreread less

Abstract: Effective coordination of agents' actions in partially-observable domains is a major challenge of multi-agent systems research. To address this, many researchers have developed techniques that allow the agents to make decisions based on estimates of the states and actions of other agents that are typically learnt using some form of machine learning algorithm. Nevertheless, many of these approaches fail to provide an actual means by which the necessary information is made available so that the estimates can be learnt. To this end, we argue that cooperative communication of state information between agents is one such mechanism. However, in a dynamically changing environment, the accuracy and timeliness of this communicated information determine the fidelity of the learned estimates and the usefulness of the actions taken based on these. Given this, we propose a novel information-sharing protocol, post-task-completion sharing, for the distribution of state information. vVe then show, through a formal analysis, the improvement in the quality of estimates produced using our strategy over the widely used protocol of sharing information between nearest neighbours. Moreover, communication heuristics designed around our information-sharing principle are subjected to empirical evaluation along with other benchmark strategies (including Littman's Q-routing and Stone's TPOT-RL) in a simulated call-routing application. These studies, conducted across a range of environmental settings, show that, compared to the different benchmarks used, our strategy generates an improvement of up to 60% in the call connection rate; of more than 1000% in the ability to connect long-distance calls; and incurs as low as 0.25 of the message overhead.

...read moreread less

Journal Article•DOI•

An improved search algorithm for optimal multiple-sequence alignment

[...]

Stefan Schroedl

01 Jan 2005-Journal of Artificial Intelligence Research

TL;DR: An algorithm is presented that outperforms one of the currently most successful algorithms for optimal multiple sequence alignments, Partial Expansion A*, both in time and memory and is able to calculate for the first time the optimal alignment for almost all of the problems in Reference 1 of the benchmark database BAliBASE.

...read moreread less

Abstract: Multiple sequence alignment (MSA) is a ubiquitous problem in computational biology. Although it is NP-hard to find an optimal solution for an arbitrary number of sequences, due to the importance of this problem researchers are trying to push the limits of exact algorithms further. Since MSA can be cast as a classical path finding problem, it is attracting a growing number of AI researchers interested in heuristic search algorithms as a challenge with actual practical relevance. In this paper, we first review two previous, complementary lines of research. Based on Hirschberg's algorithm, Dynamic Programming needs O(kNk-1) space to store both the search frontier and the nodes needed to reconstruct the solution path, for k sequences of length N. Best first search, on the other hand, has the advantage of bounding the search space that has to be explored using a heuristic. However, it is necessary to maintain all explored nodes up to the final solution in order to prevent the search from re-expanding them at higher cost. Earlier approaches to reduce the Closed list are either incompatible with pruning methods for the Open list, or must retain at least the boundary of the Closed list. In this article, we present an algorithm that attempts at combining the respective advantages; like A* it uses a heuristic for pruning the search space, but reduces both the maximum Open and Closed size to O(kNk-1), as in Dynamic Programming. The underlying idea is to conduct a series of searches with successively increasing upper bounds, but using the DP ordering as the key for the Open priority queue. With a suitable choice of thresholds, in practice, a running time below four times that of A* can be expected. In our experiments we show that our algorithm outperforms one of the currently most successful algorithms for optimal multiple sequence alignments, Partial Expansion A*, both in time and memory. Moreover, we apply a refined heuristic based on optimal alignments not only of pairs of sequences, but of larger subsets. This idea is not new; however, to make it practically relevant we show that it is equally important to bound the heuristic computation appropriately, or the overhead can obliterate any possible gain. Furthermore, we discuss a number of improvements in time and space efficiency with regard to practical implementations. Our algorithm, used in conjunction with higher-dimensional heuristics, is able to calculate for the first time the optimal alignment for almost all of the problems in Reference 1 of the benchmark database BAliBASE.

...read moreread less

Journal Article•DOI•

Linking search space structure, run-time dynamics, and problem difficulty: a step toward demystifying tabu search

[...]

Jean-Paul Watson¹, L. Darrell Whitley², Adele E. Howe²•Institutions (2)

Sandia National Laboratories¹, Colorado State University²

01 Jul 2005-Journal of Artificial Intelligence Research

TL;DR: In this article, a simple variant of a straightforward random walk is used to model the problem of tabu search in the context of job-shop scheduling, and it is shown that the random walk model accounts for nearly all of the variability in the cost required to locate both optimal and sub-optimal solutions to random JSPs.

...read moreread less

Abstract: Tabu search is one of the most effective heuristics for locating high-quality solutions to a diverse array of NP-hard combinatorial optimization problems. Despite the widespread success of tabu search, researchers have a poor understanding of many key theoretical aspects of this algorithm, including models of the high-level run-time dynamics and identification of those search space features that influence problem difficulty. We consider these questions in the context of the job-shop scheduling problem (JSP), a domain where tabu search algorithms have been shown to be remarkably effective. Previously, we demonstrated that the mean distance between random local optima and the nearest optimal solution is highly correlated with problem difficulty for a well-known tabu search algorithm for the JSP introduced by Taillard. In this paper, we discuss various shortcomings of this measure and develop a new model of problem difficulty that corrects these deficiencies. We show that Taillard's algorithm can be modeled with high fidelity as a simple variant of a straightforward random walk. The random walk model accounts for nearly all of the variability in the cost required to locate both optimal and sub-optimal solutions to random JSPs, and provides an explanation for differences in the difficulty of random versus structured JSPs. Finally, we discuss and empirically substantiate two novel predictions regarding tabu search algorithm behavior. First, the method for constructing the initial solution is highly unlikely to impact the performance of tabu search. Second, tabu tenure should be selected to be as small as possible while simultaneously avoiding search stagnation; values larger than necessary lead to significant degradations in performance.

...read moreread less