scispace - formally typeset
Search or ask a question

Showing papers by "Phokion G. Kolaitis published in 2005"


Journal ArticleDOI
TL;DR: This paper gives an algebraic specification that selects, among all solutions to the data exchange problem, a special class of solutions that is called universal and shows that a universal solution has no more and no less data than required for data exchange and that it represents the entire space of possible solutions.

1,221 citations


Proceedings ArticleDOI
Phokion G. Kolaitis1
13 Jun 2005
TL;DR: The main aim in this paper is to present an overview of recent advances in data exchange and metadata management, where the schema mappings are between relational schemas.
Abstract: Schema mappings are high-level specifications that describe the relationship between database schemas. Schema mappings are prominent in several different areas of database management, including database design, information integration, data exchange, metadata management, and peer-to-peer data management systems. Our main aim in this paper is to present an overview of recent advances in data exchange and metadata management, where the schema mappings are between relational schemas. In addition, we highlight some research issues and directions for future work.

310 citations


Proceedings ArticleDOI
13 Jun 2005
TL;DR: The boundary between tractability andintractability for the problem of deciding the existence of a solution is explored, and broad syntactic conditions on the constraints between the peers under which testing for solutions is solvable in polynomial time are identified.
Abstract: In this article, we introduce and study a framework, called peer data exchange, for sharing and exchanging data between peers. This framework is a special case of a full-fledged peer data management system and a generalization of data exchange between a source schema and a target schema. The motivation behind peer data exchange is to model authority relationships between peers, where a source peer may contribute data to a target peer, specified using source-to-target constraints, and a target peer may use target-to-source constraints to restrict the data it is willing to receive, but cannot modify the data of the source peer.A fundamental algorithmic problem in this framework is that of deciding the existence of a solution: given a source instance and a target instance for a fixed peer data exchange setting, can the target instance be augmented in such a way that the source instance and the augmented target instance satisfy all constraints of the settingq We investigate the computational complexity of the problem for peer data exchange settings in which the constraints are given by tuple generating dependencies. We show that this problem is always in NP, and that it can be NP-complete even for “acyclic” peer data exchange settings. We also show that the data complexity of the certain answers of target conjunctive queries is in coNP, and that it can be coNP-complete even for “acyclic” peer data exchange settings.After this, we explore the boundary between tractability and intractability for deciding the existence of a solution and for computing the certain answers of target conjunctive queries. To this effect, we identify broad syntactic conditions on the constraints between the peers under which the existence-of-solutions problem is solvable in polynomial time. We also identify syntactic conditions between peer data exchange settings and target conjunctive queries that yield polynomial-time algorithms for computing the certain answers. For both problems, these syntactic conditions turn out to be tight, in the sense that minimal relaxations of them lead to intractability. Finally, we introduce the concept of a universal basis of solutions in peer data exchange and explore its properties.

142 citations


Book
01 Dec 2005
TL;DR: Finite model theory and its applications texts in theoretical computer science an eatcs series What to say and what to do when mostly your friends love reading?

73 citations


Patent
13 Jun 2005
TL;DR: In this paper, a multi-structural query system performs a high-level multi-dimensional query on a multidimensional database and enables a user to navigate a search by adding restrictions incrementally.
Abstract: A multi-structural query system performs a high-level multi-dimensional query on a multi-structural database. The query system enables a user to navigate a search by adding restrictions incrementally. The query system uses a schema to discover structure in a multi-structural database. The query system leaves a choice of nodes to return in response to a query as a constrained set of choices available to the algorithm. The query system further casts the selection of a set of nodes as an optimization. The query system uses pairwise-disjoint collections to capture a concise set of highlights of a data set within the allowed schema. The query system further comprises efficient algorithms that yield approximately optimal solutions for several classes of objective functions.

65 citations


Journal Article
TL;DR: It is shown that a so-called preferred representation of a relation over one of its bases can be computed as the minimal co-clone including a given relation, which solves some open structure identication problem as well as the open expressibility problem from database theory.
Abstract: We introduce the notion of a plain basis for a co-clone in Post’s lattice. Such a basis is a set of relations B such that every constraint C over a relation in the co-clone is logically equivalent to a conjunction of equalities and constraints over B and the same variables as C; this diers from the usual notion of a basis in that existential quantication of auxiliary variables is not allowed. We give such a basis for every co-clone and in particular for those in the innite part of the lattice; it turns out that most of these bases correspond to sets of propositional clauses, thus providing a strong link between classes of formulas dened for CSP and CNF representations. We then show that a so-called preferred representation of a relation over one of its bases can be computed ecien tly, as well as the minimal co-clone including a given relation, which solves some open structure identication problem as well as the open expressibility problem from database theory.

16 citations


Journal ArticleDOI
01 Sep 2005
TL;DR: The "ARISE/NISR Workshop on Exchange and Integration of Data" was held at the IBM Center for Advanced Studies, Toronto Lab.
Abstract: The "ARISE/NISR Workshop on Exchange and Integration of Data" was held at the IBM Center for Advanced Studies, Toronto Lab., between October 7-9, 2004.

8 citations