Showing papers in "Journal of Artificial Intelligence Research in 2009"

PDF

Open Access

Journal Article•DOI•

[...]

Alessandro Artale¹, Diego Calvanese¹, Roman Kontchakov², Michael Zakharyaschev²•Institutions (2)

Free University of Bozen-Bolzano¹, Birkbeck, University of London²

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: This article carries out a thorough and systematic investigation of inference in extensions of the original DL-Lite logics along five axes, by adding the Boolean connectives and number restrictions to concept constructs and adopting or dropping the unique name assumption.

...read moreread less

Abstract: The recently introduced series of description logics under the common moniker 'DL-Lite' has attracted attention of the description logic and semantic web communities due to the low computational complexity of inference, on the one hand, and the ability to represent conceptual modeling formalisms, on the other. The main aim of this article is to carry out a thorough and systematic investigation of inference in extensions of the original DL-Lite logics along five axes: by (i) adding the Boolean connectives and (ii) number restrictions to concept constructs, (iii) allowing role hierarchies, (iv) allowing role disjointness, symmetry, asymmetry, reflexivity, irreflexivity and transitivity constraints, and (v) adopting or dropping the unique name assumption. We analyze the combined complexity of satisfiability for the resulting logics, as well as the data complexity of instance checking and answering positive existential queries. Our approach is based on embedding DL-Lite logics in suitable fragments of the one-variable first-order logic, which provides useful insights into their properties and, in particular, computational behavior.

...read moreread less

592 citations

Journal Article•DOI•

Hypertableau reasoning for description logics

[...]

Boris Motik¹, Rob Shearer¹, Ian Horrocks¹•Institutions (1)

University of Oxford¹

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: This work presents a novel reasoning calculus for the description logic SHOIQ+--a knowledge representation formalism with applications in areas such as the SemanticWeb and shows significant performance improvements over state-of-the-art reasoners on several well-known ontologies.

...read moreread less

Abstract: We present a novel reasoning calculus for the description logic SHOIQ+--a knowledge representation formalism with applications in areas such as the SemanticWeb. Unnecessary nondeterminism and the construction of large models are two primary sources of inefficiency in the tableau-based reasoning calculi used in state-of-the-art reasoners. In order to reduce nondeterminism, we base our calculus on hypertableau and hyperresolution calculi, which we extend with a blocking condition to ensure termination. In order to reduce the size of the constructed models, we introduce anywhere pairwise blocking. We also present an improved nominal introduction rule that ensures termination in the presence of nominals, inverse roles, and number restrictions--a combination of DL constructs that has proven notoriously difficult to handle. Our implementation shows significant performance improvements over state-of-the-art reasoners on several well-known ontologies.

...read moreread less

450 citations

Journal Article•DOI•

Wikipedia-based semantic interpretation for natural language processing

[...]

Evgeniy Gabrilovich¹, Shaul Markovitch¹•Institutions (1)

Technion – Israel Institute of Technology¹

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: This work proposes a novel method, called Explicit Semantic Analysis (ESA), for fine-grained semantic interpretation of unrestricted natural language texts, which represents meaning in a high-dimensional space of concepts derived from Wikipedia, the largest encyclopedia in existence.

...read moreread less

Abstract: Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, called Explicit Semantic Analysis (ESA), for fine-grained semantic interpretation of unrestricted natural language texts. Our method represents meaning in a high-dimensional space of concepts derived from Wikipedia, the largest encyclopedia in existence. We explicitly represent the meaning of any text in terms of Wikipedia-based concepts. We evaluate the effectiveness of our method on text categorization and on computing the degree of semantic relatedness between fragments of natural language text. Using ESA results in significant improvements over the previous state of the art in both tasks. Importantly, due to the use of natural concepts, the ESA model is easy to explain to human users.

...read moreread less

420 citations

Journal Article•DOI•

Efficient informative sensing using multiple robots

[...]

Amarjeet Singh¹, Andreas Krause², Carlos Guestrin³, William J. Kaiser¹•Institutions (3)

University of California, Los Angeles¹, California Institute of Technology², Carnegie Mellon University³

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: ESIP (efficient Single-robot Informative Path planning), an approximation algorithm for optimizing the path of a single robot, and a general technique, sequential allocation, which can be used to extend any single robot planning algorithm, such as eSIP, for the multi-ro robot problem.

...read moreread less

Abstract: The need for efficient monitoring of spatio-temporal dynamics in large environmental applications, such as the water quality monitoring in rivers and lakes, motivates the use of robotic sensors in order to achieve sufficient spatial coverage. Typically, these robots have bounded resources, such as limited battery or limited amounts of time to obtain measurements. Thus, careful coordination of their paths is required in order to maximize the amount of information collected, while respecting the resource constraints. In this paper, we present an efficient approach for near-optimally solving the NP-hard optimization problem of planning such informative paths. In particular, we first develop eSIP (efficient Single-robot Informative Path planning), an approximation algorithm for optimizing the path of a single robot. Hereby, we use a Gaussian Process to model the underlying phenomenon, and use the mutual information between the visited locations and remainder of the space to quantify the amount of information collected. We prove that the mutual information collected using paths obtained by using eSIP is close to the information obtained by an optimal solution. We then provide a general technique, sequential allocation, which can be used to extend any single robot planning algorithm, such as eSIP, for the multi-robot problem. This procedure approximately generalizes any guarantees for the single-robot problem to the multi-robot case. We extensively evaluate the effectiveness of our approach on several experiments performed infield for two important environmental sensing applications, lake and river monitoring, and simulation experiments performed using several real world sensor network data sets.

...read moreread less

352 citations

Journal Article•DOI•

Llull and Copeland voting computationally resist bribery and constructive control

[...]

Piotr Faliszewski¹, Edith Hemaspaandra², Lane A. Hemaspaandra³, Jörg Rothe⁴•Institutions (4)

AGH University of Science and Technology¹, Rochester Institute of Technology², University of Rochester³, University of Düsseldorf⁴

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: Among systems with a polynomial-time winner problem, Copeland voting is the first natural election system proven to have full resistance to constructive control and vulnerability results for microbribery are proven via a novel technique involving min-cost network flow.

...read moreread less

Abstract: Control and bribery are settings in which an external agent seeks to influence the outcome of an election. Constructive control of elections refers to attempts by an agent to, via such actions as addition/deletion/partition of candidates or voters, ensure that a given candidate wins. Destructive control refers to attempts by an agent to, via the same actions, preclude a given candidate's victory. An election system in which an agent can sometimes affect the result and it can be determined in polynomial time on which inputs the agent can succeed is said to be vulnerable to the given type of control. An election system in which an agent can sometimes affect the result, yet in which it is NP-hard to recognize the inputs on which the agent can succeed, is said to be resistant to the given type of control. Aside from election systems with an NP-hard winner problem, the only systems previously known to be resistant to all the standard control types were highly artificial election systems created by hybridization. This paper studies a parameterized version of Copeland voting, denoted by Copelandα, where the parameter α is a rational number between 0 and 1 that specifies how ties are valued in the pairwise comparisons of candidates. In every previously studied constructive or destructive control scenario, we determine which of resistance or vulnerability holds for Copelandα for each rational α, 0 ≤ α ≤ 1. In particular, we prove that Copeland0.5, the system commonly referred to as "Copeland voting," provides full resistance to constructive control, and we prove the same for Copelandα, for all rational α, 0 < α < 1. Among systems with a polynomial-time winner problem, Copeland voting is the first natural election system proven to have full resistance to constructive control. In addition, we prove that both Copeland0 and Copeland1 (interestingly, Copeland1 is an election system developed by the thirteenth-century mystic Llull) are resistant to all standard types of constructive control other than one variant of addition of candidates. Moreover,we show that for each rational α, 0 ≤ α ≤ 1, Copelandα voting is fully resistant to bribery attacks, and we establish fixed-parameter tractability of bounded-case control for Copelandα. We also study Copelandα elections under more flexible models such as microbribery and extended control, we integrate the potential irrationality of voter preferences into many of our results, and we prove our results in both the unique-winner model and the nonunique-winner model. Our vulnerability results for microbribery are proven via a novel technique involving min-cost network flow.

...read moreread less

288 citations

Journal Article•DOI•

Interactive policy learning through confidence-based autonomy

[...]

Sonia Chernova¹, Manuela Veloso¹•Institutions (1)

Carnegie Mellon University¹

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: The algorithm selects demonstrations based on a measure of action selection confidence, and results show that using Confident Execution the agent requires fewer demonstrations to learn the policy than when demonstrations are selected by a human teacher.

...read moreread less

Abstract: We present Confidence-Based Autonomy (CBA), an interactive algorithm for policy learning from demonstration. The CBA algorithm consists of two components which take advantage of the complimentary abilities of humans and computer agents. The first component, Confident Execution, enables the agent to identify states in which demonstration is required, to request a demonstration from the human teacher and to learn a policy based on the acquired data. The algorithm selects demonstrations based on a measure of action selection confidence, and our results show that using Confident Execution the agent requires fewer demonstrations to learn the policy than when demonstrations are selected by a human teacher. The second algorithmic component, Corrective Demonstration, enables the teacher to correct any mistakes made by the agent through additional demonstrations in order to improve the policy and future task performance. CBA and its individual components are compared and evaluated in a complex simulated driving domain. The complete CBA algorithm results in the best overall learning performance, successfully reproducing the behavior of the teacher while balancing the tradeoff between number of demonstrations and number of incorrect actions during learning.

...read moreread less

255 citations

Journal Article•DOI•

An anytime algorithm for optimal coalition structure generation

[...]

Talal Rahwan¹, Sarvapali D. Ramchurn¹, Nicholas R. Jennings¹, Andrea Giovannucci²•Institutions (2)

University of Southampton¹, Pompeu Fabra University²

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: An anytime algorithm to solve the coalition structure generation problem, which uses a novel representation of the search space, which partitions the space of possible solutions into sub-spaces such that it is possible to compute upper and lower bounds on the values of the best coalition structures in them.

...read moreread less

Abstract: Coalition formation is a fundamental type of interaction that involves the creation of coherent groupings of distinct, autonomous, agents in order to efficiently achieve their individual or collective goals. Forming effective coalitions is a major research challenge in the field of multi-agent systems. Central to this endeavour is the problem of determining which of the many possible coalitions to form in order to achieve some goal. This usually requires calculating a value for every possible coalition, known as the coalition value, which indicates how beneficial that coalition would be if it was formed. Once these values are calculated, the agents usually need to find a combination of coalitions, in which every agent belongs to exactly one coalition, and by which the overall outcome of the system is maximized. However, this coalition structure generation problem is extremely challenging due to the number of possible solutions that need to be examined, which grows exponentially with the number of agents involved. To date, therefore, many algorithms have been proposed to solve this problem using different techniques -- ranging from dynamic programming, to integer programming, to stochastic search -- all of which suffer from major limitations relating to execution time, solution quality, and memory requirements. With this in mind, we develop an anytime algorithm to solve the coalition structure generation problem. Specifically, the algorithm uses a novel representation of the search space, which partitions the space of possible solutions into sub-spaces such that it is possible to compute upper and lower bounds on the values of the best coalition structures in them. These bounds are then used to identify the sub-spaces that have no potential of containing the optimal solution so that they can be pruned. The algorithm, then, searches through the remaining sub-spaces very efficiently using a branch-and-bound technique to avoid examining all the solutions within the searched subspace(s). In this setting, we prove that our algorithm enumerates all coalition structures efficiently by avoiding redundant and invalid solutions automatically. Moreover, in order to effectively test our algorithm we develop a new type of input distribution which allows us to generate more reliable benchmarks compared to the input distributions previously used in the field. Given this new distribution, we show that for 27 agents our algorithm is able to find solutions that are optimal in 0.175% of the time required by the fastest available algorithm in the literature. The algorithm is anytime, and if interrupted before it would have normally terminated, it can still provide a solution that is guaranteed to be within a bound from the optimal one. Moreover, the guarantees we provide on the quality of the solution are significantly better than those provided by the previous state of the art algorithms designed for this purpose. For example, for the worst case distribution given 25 agents, our algorithm is able to find a 90% efficient solution in around 10% of time it takes to find the optimal solution.

...read moreread less

254 citations

Journal Article•DOI•

How hard is bribery in elections

[...]

Piotr Faliszewski¹, Edith Hemaspaandra², Lane A. Hemaspaandra³•Institutions (3)

AGH University of Science and Technology¹, Rochester Institute of Technology², University of Rochester³

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: This work obtains both polynomial-time bribery algorithms and proofs of the intractability of bribery, and results show that the complexity of bribery is extremely sensitive to the setting.

...read moreread less

Abstract: We study the complexity of influencing elections through bribery: How computationally complex is it for an external actor to determine whether by paying certain voters to change their preferences a specified candidate can be made the election's winner? We study this problem for election systems as varied as scoring protocols and Dodgson voting, and in a variety of settings regarding homogeneous-vs.-nonhomogeneous electorate bribability, bounded-size-vs.-arbitrary-sized candidate sets, weighted-vs.-unweighted voters, and succinct-vs.-nonsuccinct input specification. We obtain both polynomial-time bribery algorithms and proofs of the intractability of bribery, and indeed our results show that the complexity of bribery is extremely sensitive to the setting. For example, we find settings in which bribery is NP-complete but manipulation (by voters) is in P, and we find settings in which bribing weighted voters is NP-complete but bribing voters with individual bribe thresholds is in P. For the broad class of elections (including plurality, Borda, k- approval,and veto) known as scoring protocols, we prove a dichotomy result for bribery of weighted voters: We find a simple-to-evaluate condition that classifies every case as either NP-complete or in P.

...read moreread less

233 citations

Journal Article•DOI•

Cross-lingual annotation projection of semantic roles

[...]

Sebastian Padó¹, Mirella Lapata²•Institutions (2)

University of Stuttgart¹, University of Edinburgh²

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: An experimental evaluation on an English-German parallel corpus is provided which demonstrates the feasibility of inducing high-precision German semantic role annotation both for manually and automatically annotated English data.

...read moreread less

Abstract: This article considers the task of automatically inducing role-semantic annotations in the FrameNet paradigm for new languages. We propose a general framework that is based on annotation projection, phrased as a graph optimization problem. It is relatively inexpensive and has the potential to reduce the human effort involved in creating role-semantic resources. Within this framework, we present projection models that exploit lexical and syntactic information. We provide an experimental evaluation on an English-German parallel corpus which demonstrates the feasibility of inducing high-precision German semantic role annotation both for manually and automatically annotated English data.

...read moreread less

185 citations

Journal Article•DOI•

Relaxed survey propagation for the weighted maximum satisfiability problem

[...]

Frank Hutter¹, Holger H. Hoos¹, Kevin Leyton-Brown¹, Thomas Stützle²•Institutions (2)

University of British Columbia¹, Université libre de Bruxelles²

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: An automatic framework for this algorithm configuration problem is described and methods for optimizing a target algorithm's performance on a given class of problem instances by varying a set of ordinal and/or categorical parameters are provided.

...read moreread less

Abstract: The identification of performance-optimizing parameter settings is an important part of the development and application of algorithms. We describe an automatic framework for this algorithm configuration problem. More formally, we provide methods for optimizing a target algorithm's performance on a given class of problem instances by varying a set of ordinal and/or categorical parameters. We review a family of local-search-based algorithm configuration procedures and present novel techniques for accelerating them by adaptively limiting the time spent for evaluating individual configurations. We describe the results of a comprehensive experimental evaluation of our methods, based on the configuration of prominent complete and incomplete algorithms for SAT. We also present what is, to our knowledge, the first published work on automatically configuring the CPLEX mixed integer programming solver. All the algorithms we considered had default parameter settings that were manually identified with considerable effort. Nevertheless, using our automated algorithm configuration procedures, we achieved substantial and consistent performance improvements.

...read moreread less

163 citations

Journal Article•DOI•

Sentence compression as tree transduction

[...]

Trevor Cohn¹, Mirella Lapata¹•Institutions (1)

University of Edinburgh¹

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: This paper presents a tree-to-tree transduction method for sentence compression based on synchronous tree substitution grammar, a formalism that allows local distortion of the tree topology and can naturally capture structural mismatches.

...read moreread less

Abstract: This paper presents a tree-to-tree transduction method for sentence compression. Our model is based on synchronous tree substitution grammar, a formalism that allows local distortion of the tree topology and can thus naturally capture structural mismatches. We describe an algorithm for decoding in this framework and show how the model can be trained discriminatively within a large margin framework. Experimental results on sentence compression bring significant improvements over a state-of-the-art model.

...read moreread less

Journal Article•DOI•

Compiling uncertainty away in conformant planning problems with bounded width

[...]

Héctor Palacios¹, Hector Geffner¹•Institutions (1)

Pompeu Fabra University¹

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: This work lays out a general translation scheme that is sound and establishes the conditions under which the translation is also complete, and shows that the complexity of the complete translation is exponential in a parameter of the problem called the conformant width, which for most benchmarks is bounded.

...read moreread less

Abstract: Conformant planning is the problem of finding a sequence of actions for achieving a goal in the presence of uncertainty in the initial state or action effects. The problem has been approached as a path-finding problem in belief space where good belief representations and heuristics are critical for scaling up. In this work, a different formulation is introduced for conformant problems with deterministic actions where they are automatically converted into classical ones and solved by an off-the-shelf classical planner. The translation maps literals L and sets of assumptions t about the initial situation, into new literals KL/t that represent that L must be true if t is initially true. We lay out a general translation scheme that is sound and establish the conditions under which the translation is also complete. We show that the complexity of the complete translation is exponential in a parameter of the problem called the conformant width, which for most benchmarks is bounded. The planner based on this translation exhibits good performance in comparison with existing planners, and is the basis for T0, the best performing planner in the Conformant Track of the 2006 International Planning Competition.

...read moreread less

Journal Article•DOI•

The complexity of circumscription in DLs

[...]

Piero A. Bonatti, Carsten Lutz, Frank Wolter

26 Aug 2009-Journal of Artificial Intelligence Research

TL;DR: This paper studies DLs extended with circumscription under different language restrictions and under different constraints on the sets of minimized, fixed, and varying predicates, and pinpoint the exact computational complexity of reasoning for DLs ranging from ALC to ALCIO and ALCQO.

...read moreread less

Abstract: As fragments of first-order logic, Description logics (DLs) do not provide nonmonotonic features such as defeasible inheritance and default rules. Since many applications would benefit from the availability of such features, several families of nonmonotonic DLs have been developed that are mostly based on default logic and autoepistemic logic. In this paper, we consider circumscription as an interesting alternative approach to nonmonotonic DLs that, in particular, supports defeasible inheritance in a natural way. We study DLs extended with circumscription under different language restrictions and under different constraints on the sets of minimized, fixed, and varying predicates, and pinpoint the exact computational complexity of reasoning for DLs ranging from ALC to ALCIO and ALCQO. When the minimized and fixed predicates include only concept names but no role names, then reasoning is complete for NExpTime^NP. It becomes complete for NP^NExpTime when the number of minimized and fixed predicates is bounded by a constant. If roles can be minimized or fixed, then complexity ranges from NExpTime^NP to undecidability.

...read moreread less

Journal Article•DOI•

Mechanisms for making crowds truthful

[...]

Radu Jurca¹, Boi Faltings²•Institutions (2)

Google¹, École Polytechnique Fédérale de Lausanne²

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: The correlation between the true knowledge of an agent and her beliefs regarding the likelihoods of reports of other agents can be exploited to make honest reporting a Nash equilibrium.

...read moreread less

Abstract: We consider schemes for obtaining truthful reports on a common but hidden signal from large groups of rational, self-interested agents. One example are online feedback mechanisms, where users provide observations about the quality of a product or service so that other users can have an accurate idea of what quality they can expect. However, (i) providing such feedback is costly, and (ii) there are many motivations for providing incorrect feedback. Both problems can be addressed by reward schemes which (i) cover the cost of obtaining and reporting feedback, and (ii) maximize the expected reward of a rational agent who reports truthfully. We address the design of such incentive-compatible rewards for feedback generated in environments with pure adverse selection. Here, the correlation between the true knowledge of an agent and her beliefs regarding the likelihoods of reports of other agents can be exploited to make honest reporting a Nash equilibrium. In this paper we extend existing methods for designing incentive-compatible rewards by also considering collusion. We analyze different scenarios, where, for example, some or all of the agents collude. For each scenario we investigate whether a collusion-resistant, incentive-compatible reward scheme exists, and use automated mechanism design to specify an algorithm for deriving an efficient reward mechanism.

...read moreread less

Journal Article•DOI•

Optimal value of information in graphical models

[...]

Andreas Krause¹, Carlos Guestrin²•Institutions (2)

California Institute of Technology¹, Carnegie Mellon University²

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: This paper presents the first efficient optimal algorithms for selecting observations for a class of probabilistic graphical models, and proves that the optimizing value of information is NPPP-hard even for polytrees.

...read moreread less

Abstract: Many real-world decision making tasks require us to choose among several expensive observations. In a sensor network, for example, it is important to select the subset of sensors that is expected to provide the strongest reduction in uncertainty. In medical decision making tasks, one needs to select which tests to administer before deciding on the most effective treatment. It has been general practice to use heuristic-guided procedures for selecting observations. In this paper, we present the first efficient optimal algorithms for selecting observations for a class of probabilistic graphical models. For example, our algorithms allow to optimally label hidden variables in Hidden Markov Models (HMMs). We provide results for both selecting the optimal subset of observations, and for obtaining an optimal conditional observation plan. Furthermore we prove a surprising result: In most graphical models tasks, if one designs an efficient algorithm for chain graphs, such as HMMs, this procedure can be generalized to poly-tree graphical models. We prove that the optimizing value of information is NPPP-hard even for polytrees. It also follows from our results that just computing decision theoretic value of information objective functions, which are commonly used in practice, is a #P-complete problem even on Naive Bayes models (a simple special case of polytrees). In addition, we consider several extensions, such as using our algorithms for scheduling observation selection for multiple sensors. We demonstrate the effectiveness of our approach on several real-world datasets, including a prototype sensor network deployment for energy conservation in buildings.

...read moreread less

Journal Article•DOI•

Unsupervised methods for determining object and relation synonyms on the web

[...]

Alexander Yates¹, Oren Etzioni²•Institutions (2)

Temple University¹, University of Washington²

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: This paper presents a scalable, fully-implemented system that runs in O(KN log N) time in the number of extractions, N, and the maximum number of synonyms per word, K, and introduces a probabilistic relational model for predicting whether two strings are co-referential based on the similarity of the assertions containing them.

...read moreread less

Abstract: The task of identifying synonymous relations and objects, or synonym resolution, is critical for high-quality information extraction. This paper investigates synonym resolution in the context of unsupervised information extraction, where neither hand-tagged training examples nor domain knowledge is available. The paper presents a scalable, fully-implemented system that runs in O(KN log N) time in the number of extractions, N, and the maximum number of synonyms per word, K. The system, called RESOLVER, introduces a probabilistic relational model for predicting whether two strings are co-referential based on the similarity of the assertions containing them. On a set of two million assertions extracted from the Web, RESOLVER resolves objects with 78% precision and 68% recall, and resolves relations with 90% precision and 35% recall. Several variations of RESOLVER's probabilistic model are explored, and experiments demonstrate that under appropriate conditions these variations can improve F1 by 5%. An extension to the basic RESOLVER system allows it to handle polysemous names with 97% precision and 95% recall on a data set from the TREC corpus.

...read moreread less

Journal Article•DOI•

Eliciting single-peaked preferences using comparison queries

[...]

Vincent Conitzer¹•Institutions (1)

Duke University¹

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: This paper focuses on single-peaked preferences, and shows that such preferences can be elicited using only a linear number of comparison queries, if either the order with respect to which preferences are single- peaked is known, or at least one other agent's complete preferences are known.

...read moreread less

Abstract: Voting is a general method for aggregating the preferences of multiple agents. Each agent ranks all the possible alternatives, and based on this, an aggregate ranking of the alternatives (or at least a winning alternative) is produced. However, when there are many alternatives, it is impractical to simply ask agents to report their complete preferences. Rather, the agents' preferences, or at least the relevant parts thereof, need to be elicited. This is done by asking the agents a (hopefully small) number of simple queries about their preferences, such as comparison queries, which ask an agent to compare two of the alternatives. Prior work on preference elicitation in voting has focused on the case of unrestricted preferences. It has been shown that in this setting, it is sometimes necessary to ask each agent (almost) as many queries as would be required to determine an arbitrary ranking of the alternatives. In contrast, in this paper, we focus on single-peaked preferences. We show that such preferences can be elicited using only a linear number of comparison queries, if either the order with respect to which preferences are single-peaked is known, or at least one other agent's complete preferences are known. We show that using a sublinear number of queries does not suffice. We also consider the case of cardinally single-peaked preferences. For this case, we show that if the alternatives' cardinal positions are known, then an agent's preferences can be elicited using only a logarithmic number of queries; however, we also show that if the cardinal positions are not known, then a sublinear number of queries does not suffice. We present experimental results for all elicitation algorithms. We also consider the problem of only eliciting enough information to determine the aggregate ranking, and show that even for this more modest objective, a sublinear number of queries per agent does not suffice for known ordinal or unknown cardinal positions. Finally, we discuss whether and how these techniques can be applied when preferences are almost single-peaked.

...read moreread less

Journal Article•DOI•

Asynchronous forward bounding for distributed COPs

[...]

Amir Gershman¹, Amnon Meisels¹, Roie Zivan¹•Institutions (1)

Ben-Gurion University of the Negev¹

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: The asynchronous forward bounding algorithm (AFB) algorithm as mentioned in this paper is a distributed optimization search algorithm that keeps one consistent partial assignment at all times, and is further enhanced by the addition of a backjumping mechanism.

...read moreread less

Abstract: A new search algorithm for solving distributed constraint optimization problems (DisCOPs) is presented. Agents assign variables sequentially and compute bounds on partial assignments asynchronously. The asynchronous bounds computation is based on the propagation of partial assignments. The asynchronous forward-bounding algorithm (AFB) is a distributed optimization search algorithm that keeps one consistent partial assignment at all times. The algorithm is described in detail and its correctness proven. Experimental evaluation shows that AFB outperforms synchronous branch and bound by many orders of magnitude, and produces a phase transition as the tightness of the problem increases. This is an analogous effect to the phase transition that has been observed when local consistency maintenance is applied to MaxCSPs. The AFB algorithm is further enhanced by the addition of a backjumping mechanism, resulting in the AFB-BJ algorithm. Distributed backjumping is based on accumulated information on bounds of all values and on processing concurrently a queue of candidate goals for the next move back. The AFB-BJ algorithm is compared experimentally to other DisCOP algorithms (ADOPT, DPOP, OptAPO) and is shown to be a very efficient algorithm for DisCOPs.

...read moreread less

Journal Article•DOI•

Policy iteration for decentralized control of Markov decision processes

[...]

Daniel S. Bernstein¹, Christopher Amato¹, Eric A. Hansen², Shlomo Zilberstein¹•Institutions (2)

University of Massachusetts Amherst¹, Mississippi State University²

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: In this article, the authors present an optimal policy iteration algorithm for solving DEC-POMDPs, which alternates between expanding the controller and performing value-preserving transformations.

...read moreread less

Abstract: Coordination of distributed agents is required for problems arising in many areas, including multi-robot systems, networking and e-commerce. As a formal framework for such problems, we use the decentralized partially observable Markov decision process (DECPOMDP). Though much work has been done on optimal dynamic programming algorithms for the single-agent version of the problem, optimal algorithms for the multiagent case have been elusive. The main contribution of this paper is an optimal policy iteration algorithm for solving DEC-POMDPs. The algorithm uses stochastic finite-state controllers to represent policies. The solution can include a correlation device, which allows agents to correlate their actions without communicating. This approach alternates between expanding the controller and performing value-preserving transformations, which modify the controller without sacrificing value. We present two efficient value-preserving transformations: one can reduce the size of the controller and the other can improve its value while keeping the size fixed. Empirical results demonstrate the usefulness of value-preserving transformations in increasing value while keeping controller size to a minimum. To broaden the applicability of the approach, we also present a heuristic version of the policy iteration algorithm, which sacrifices convergence to optimality. This algorithm further reduces the size of the controllers at each step by assuming that probability distributions over the other agents' actions are known. While this assumption may not hold in general, it helps produce higher quality solutions in our test problems.

...read moreread less

Journal Article•DOI•

Modularity aspects of disjunctive stable models

[...]

Tomi Janhunen¹, Emilia Oikarinen¹, Hans Tompits², Stefan Woltran²•Institutions (2)

Helsinki University of Technology¹, Vienna University of Technology²

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: A novel module theorem is established which enables the decomposition of DLP-functions given their strongly connected components based on positive dependencies induced by rules and the concept of modular equivalence is introduced for the mutual comparison of DLPs using a generalization of a translation-based verification method.

...read moreread less

Abstract: Practically all programming languages allow the programmer to split a program into several modules which brings along several advantages in software development In this paper, we are interested in the area of answer-set programming where fully declarative and nonmonotonic languages are applied In this context, obtaining a modular structure for programs is by no means straightforward since the output of an entire program cannot in general be composed from the output of its components To better understand the effects of disjunctive information on modularity we restrict the scope of analysis to the case of disjunctive logic programs (DLPs) subject to stable-model semantics We define the notion of a DLP-function, where a well-defined input/output interface is provided, and establish a novel module theorem which indicates the compositionality of stable-model semantics for DLP-functions The module theorem extends the well-known splitting-set theorem and enables the decomposition of DLP-functions given their strongly connected components based on positive dependencies induced by rules In this setting, it is also possible to split shared disjunctive rules among components using a generalized shifting technique The concept of modular equivalence is introduced for the mutual comparison of DLP-functions using a generalization of a translation-based verification method

...read moreread less

Journal Article•DOI•

Monte Carlo sampling methods for approximating interactive POMDPs

[...]

Prashant Doshi¹, Piotr J. Gmytrasiewicz²•Institutions (2)

University of Georgia¹, University of Illinois at Chicago²

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: A general method for obtaining approximate solutions of I-POMDPs based on particle filtering (PF) is described and the interactive PF is introduced, which descends the levels of the interactive belief hierarchies and samples and propagates beliefs at each level.

...read moreread less

Abstract: Partially observable Markov decision processes (POMDPs) provide a principled framework for sequential planning in uncertain single agent settings. An extension of POMDPs to multiagent settings, called interactive POMDPs (I-POMDPs), replaces POMDP belief spaces with interactive hierarchical belief systems which represent an agent's belief about the physical world, about beliefs of other agents, and about their beliefs about others' beliefs. This modification makes the difficulties of obtaining solutions due to complexity of the belief and policy spaces even more acute. We describe a general method for obtaining approximate solutions of I-POMDPs based on particle filtering (PF). We introduce the interactive PF, which descends the levels of the interactive belief hierarchies and samples and propagates beliefs at each level. The interactive PF is able to mitigate the belief space complexity, but it does not address the policy space complexity. To mitigate the policy space complexity - sometimes also called the curse of history - we utilize a complementary method based on sampling likely observations while building the look ahead reachability tree. While this approach does not completely address the curse of history, it beats back the curse's impact substantially. We provide experimental results and chart future work.

...read moreread less

Journal Article•DOI•

Solving #SAT and Bayesian inference with backtracking search

[...]

Fahiem Bacchus¹, Shannon Dalmao¹, Toniann Pitassi¹•Institutions (1)

University of Toronto¹

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: It is shown that standard backtracking search when augmented with a simple memoization scheme (caching) can solve any sum-of-products problem with time complexity that is at least as good any other state- of-the-art exact algorithm, and that it can also achieve the best known time-space tradeoff.

...read moreread less

Abstract: Inference in Bayes Nets (BAYES) is an important problem with numerous applications in probabilistic reasoning. Counting the number of satisfying assignments of a propositional formula (#SAT) is a closely related problem of fundamental theoretical importance. Both these problems, and others, are members of the class of sum-of-products (SUMPROD) problems. In this paper we show that standard backtracking search when augmented with a simple memoization scheme (caching) can solve any sum-of-products problem with time complexity that is at least as good any other state-of-the-art exact algorithm, and that it can also achieve the best known time-space tradeoff. Furthermore, backtracking's ability to utilize more flexible variable orderings allows us to prove that it can achieve an exponential speedup over other standard algorithms for SUMPROD on some instances. The ideas presented here have been utilized in a number of solvers that have been applied to various types of sum-of-product problem's. These system's have exploited the fact that backtracking can naturally exploit more of the problem's structure to achieve improved performance on a range of problem instances. Empirical evidence of this performance gain has appeared in published works describing these solvers, and we provide references to these works.

...read moreread less

Journal Article•DOI•

Soft goals can be compiled away

[...]

Emil Keyder¹, Hector Geffner¹•Institutions (1)

Pompeu Fabra University¹

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: It is shown that optimal and satisficing cost-based planners do better on the compiled problems than optimal and satisfying netbenefit planners on the original problems with explicit soft goals, and that penalties, or negative preferences expressing conditions to avoid, can also be compiled away using a similar idea.

...read moreread less

Abstract: Soft goals extend the classical model of planning with a simple model of preferences. The best plans are then not the ones with least cost but the ones with maximum utility, where the utility of a plan is the sum of the utilities of the soft goals achieved minus the plan cost. Finding plans with high utility appears to involve two linked problems: choosing a subset of soft goals to achieve and finding a low-cost plan to achieve them. New search algorithms and heuristics have been developed for planning with soft goals, and a new track has been introduced in the International Planning Competition (IPC) to test their performance. In this note, we show however that these extensions are not needed: soft goals do not increase the expressive power of the basic model of planning with action costs, as they can easily be compiled away. We apply this compilation to the problems of the net-benefit track of the most recent IPC, and show that optimal and satisficing cost-based planners do better on the compiled problems than optimal and satisficing netbenefit planners on the original problems with explicit soft goals. Furthermore, we show that penalties, or negative preferences expressing conditions to avoid, can also be compiled away using a similar idea.

...read moreread less

Journal Article•DOI•

Efficient Markov network structure discovery using independence tests

[...]

Facundo Bromberg¹, Dimitris Margaritis², Vasant Honavar²•Institutions (2)

National Technological University¹, Iowa State University²

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: Experimental comparisons on artificial and real-world data sets show GSIMN can yield significant savings with respect to GSMN*, while generating a Markov network with comparable or in some cases improved quality.

...read moreread less

Abstract: We present two algorithms for learning the structure of a Markov network from data: GSMN* and GSIMN. Both algorithms use statistical independence tests to infer the structure by successively constraining the set of structures consistent with the results of these tests. Until very recently, algorithms for structure learning were based on maximum likelihood estimation, which has been proved to be NP-hard for Markov networks due to the difficulty of estimating the parameters of the network, needed for the computation of the data likelihood. The independence-based approach does not require the computation of the likelihood, and thus both GSMN* and GSIMN can compute the structure efficiently (as shown in our experiments). GSMN* is an adaptation of the Grow-Shrink algorithm of Margaritis and Thrun for learning the structure of Bayesian networks. GSIMN extends GSMN* by additionally exploiting Pearl's well-known properties of the conditional independence relation to infer novel independences from known ones, thus avoiding the performance of statistical tests to estimate them. To accomplish this efficiently GSIMN uses the Triangle theorem, also introduced in this work, which is a simplified version of the set of Markov axioms. Experimental comparisons on artificial and real-world data sets show GSIMN can yield significant savings with respect to GSMN*, while generating a Markov network with comparable or in some cases improved quality. We also compare GSIMN to a forward-chaining implementation, called GSIMN-FCH, that produces all possible conditional independences resulting from repeatedly applying Pearl's theorems on the known conditional independence tests. The results of this comparison show that GSIMN, by the sole use of the Triangle theorem, is nearly optimal in terms of the set of independences tests that it infers.

...read moreread less

Journal Article•DOI•

Trust-based mechanisms for robust and efficient task allocation in the presence of execution uncertainty

[...]

Sarvapali D. Ramchurn¹, Claudio Mezzetti², Andrea Giovannucci³, Juan A. Rodríguez-Aguilar, Rajdeep K. Dash¹, Nicholas R. Jennings¹ - Show less +2 more•Institutions (3)

University of Southampton¹, University of Warwick², Pompeu Fabra University³

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: In this paper, the authors investigate the design of novel mechanisms that take into account the trust between agents when allocating tasks to selfish and rational agents, and develop a new class of mechanisms, called trust-based mechanisms, that can take account multiple subjective measures of the probability of an agent succeeding at a given task and produce allocations that maximise social utility, whilst ensuring that no agent obtains a negative utility.

...read moreread less

Abstract: Vickrey-Clarke-Groves (VCG) mechanisms are often used to allocate tasks to selfish and rational agents VCG mechanisms are incentive compatible, direct mechanisms that are efficient (ie, maximise social utility) and individually rational (ie, agents prefer to join rather than opt out) However, an important assumption of these mechanisms is that the agents will always successfully complete their allocated tasks Clearly, this assumption is unrealistic in many real-world applications, where agents can, and often do, fail in their endeavours Moreover, whether an agent is deemed to have failed may be perceived differently by different agents Such subjective perceptions about an agent's probability of succeeding at a given task are often captured and reasoned about using the notion of trust Given this background, in this paper we investigate the design of novel mechanisms that take into account the trust between agents when allocating tasks Specifically, we develop a new class of mechanisms, called trust-based mechanisms, that can take into account multiple subjective measures of the probability of an agent succeeding at a given task and produce allocations that maximise social utility, whilst ensuring that no agent obtains a negative utility We then show that such mechanisms pose a challenging new combinatorial optimisation problem (that is NP-complete), devise a novel representation for solving the problem, and develop an effective integer programming solution (that can solve instances with about 2×105 possible allocations in 40 seconds)

...read moreread less

Journal Article•DOI•

Multilingual part-of-speech tagging: two unsupervised approaches

[...]

Tahira Naseem¹, Benjamin Snyder¹, Jacob Eisenstein¹, Regina Barzilay¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: This work considers two ways of applying this intuition to the problem of unsupervised part-of-speech tagging: a model that directly merges tag structures for a pair of languages into a single sequence and a second model which instead incorporates multilingual context using latent variables.

...read moreread less

Abstract: We demonstrate the effectiveness of multilingual learning for unsupervised part-of-speech tagging. The central assumption of our work is that by combining cues from multiple languages, the structure of each becomes more apparent. We consider two ways of applying this intuition to the problem of unsupervised part-of-speech tagging: a model that directly merges tag structures for a pair of languages into a single sequence and a second model which instead incorporates multilingual context using latent variables. Both approaches are formulated as hierarchical Bayesian models, using Markov Chain Monte Carlo sampling techniques for inference. Our results demonstrate that by incorporating multilingual evidence we can achieve impressive performance gains across a range of scenarios. We also found that performance improves steadily as the number of available languages increases.

...read moreread less

Journal Article•DOI•

Learning Bayesian network equivalence classes with Ant Colony optimization

[...]

Rónán Daly¹, Qiang Shen²•Institutions (2)

University of Glasgow¹, Aberystwyth University²

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: A new algorithm called ACO-E is proposed, to learn the structure of a Bayesian network by conducting a search through the space of equivalence classes of Bayesian networks using Ant Colony Optimization (ACO).

...read moreread less

Abstract: Bayesian networks are a useful tool in the representation of uncertain knowledge. This paper proposes a new algorithm called ACO-E, to learn the structure of a Bayesian network. It does this by conducting a search through the space of equivalence classes of Bayesian networks using Ant Colony Optimization (ACO). To this end, two novel extensions of traditional ACO techniques are proposed and implemented. Firstly, multiple types of moves are allowed. Secondly, moves can be given in terms of indices that are not based on construction graph nodes. The results of testing show that ACO-E performs better than a greedy search and other state-of-the-art and metaheuristic algorithms whilst searching in the space of equivalence classes.

...read moreread less

Journal Article•DOI•

Learning document-level semantic properties from free-text annotations

[...]

S.R.K. Branavan¹, Harr Chen¹, Jacob Eisenstein¹, Regina Barzilay¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2009-Journal of Artificial Intelligence Research

TL;DR: This paper presents a new method for inferring the semantic properties of documents by leveraging free-text keyphrase annotations, and finds that joint inference increases the robustness of the keyphrase clustering and encourages the latent topics to correlate with semantically meaningful properties.

...read moreread less

Abstract: This paper presents a new method for inferring the semantic properties of documents by leveraging free-text keyphrase annotations. Such annotations are becoming increasingly abundant due to the recent dramatic growth in semi-structured, user-generated online content. One especially relevant domain is product reviews, which are often annotated by their authors with pros/cons keyphrases such as "a real bargain" or "good value." These annotations are representative of the underlying semantic properties; however, unlike expert annotations, they are noisy: lay authors may use different labels to denote the same property, and some labels may be missing. To learn using such noisy annotations, we find a hidden paraphrase structure which clusters the keyphrases. The paraphrase structure is linked with a latent topic model of the review texts, enabling the system to predict the properties of unannotated documents and to effectively aggregate the semantic properties of multiple reviews. Our approach is implemented as a hierarchical Bayesian model with joint inference. We find that joint inference increases the robustness of the keyphrase clustering and encourages the latent topics to correlate with semantically meaningful properties. Multiple evaluations demonstrate that our model substantially outperforms alternative approaches for summarizing single and multiple documents into a set of semantically salient keyphrases.

...read moreread less

Journal Article•DOI•

The complexity of circumscription in description logic

[...]

Piero A. Bonatti¹, Carsten Lutz², Frank Wolter³•Institutions (3)

University of Naples Federico II¹, University of Bremen², University of Liverpool³

01 May 2009-Journal of Artificial Intelligence Research

TL;DR: As fragments of first-order logic, Description logics do not provide nonmonotonic features such as defeasible inheritance and default rules, but many applications would benefit from the ava...

...read moreread less

Abstract: As fragments of first-order logic, Description logics (DLs) do not provide nonmonotonic features such as defeasible inheritance and default rules. Since many applications would benefit from the availability of such features, several families of nonmonotonic DLs have been developed that are mostly based on default logic and autoepistemic logic. In this paper, we consider circumscription as an interesting alternative approach to nonmonotonic DLs that, in particular, supports defeasible inheritance in a natural way. We study DLs extended with circumscription under different language restrictions and under different constraints on the sets of minimized, fixed, and varying predicates, and pinpoint the exact computational complexity of reasoning for DLs ranging from ALC to ALCIO and ALCQO. When the minimized and fixed predicates include only concept names but no role names, then reasoning is complete for NEXPNP. It becomes complete for NPNEXP when the number of minimized and fixed predicates is bounded by a constant. If roles can be minimized or fixed, then complexity ranges from NEXPNP to undecidability.

...read moreread less

Journal Article•DOI•

Content modeling using latent permutations

[...]

Harr Chen¹, S.R.K. Branavan¹, Regina Barzilay¹, David R. Karger¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Sep 2009-Journal of Artificial Intelligence Research

TL;DR: The authors propose a generalized Mallows model to constrain latent topic assignments in a way that reflects the underlying organization of document topics. But their model is limited to cross-document alignment, document segmentation, and information ordering.

...read moreread less

Abstract: We present a novel Bayesian topic model for learning discourse-level document structure. Our model leverages insights from discourse theory to constrain latent topic assignments in a way that reflects the underlying organization of document topics. We propose a global model in which both topic selection and ordering are biased to be similar across a collection of related documents. We show that this space of orderings can be effectively represented using a distribution over permutations called the Generalized Mallows Model. We apply our method to three complementary discourse-level tasks: cross-document alignment, document segmentation, and information ordering. Our experiments show that incorporating our permutation-based model in these applications yields substantial improvements in performance over previously proposed methods.

...read moreread less