Proceedings ArticleDOI
A framework for the parallel processing of Datalog queries
Sumit Ganguly,Avi Silberschatz,Shalom Tsur +2 more
- Vol. 19, Iss: 2, pp 143-152
Reads0
Chats0
TLDR
The notion of a discriminating predicate, based on hash functions, that partitions the computation between the processors in order to achieve parallelism is introduced and the trade-offs between redundancy and interprocessor-communication are demonstrated.Abstract:
This paper presents several complementary methods for the parallel, bottom-up evaluation of Datalog queries. We introduce the notion of a discriminating predicate, based on hash functions, that partitions the computation between the processors in order to achieve parallelism. A parallelization scheme with the property of non-redundant computation (no duplication of computation by processors) is then studied in detail. The mapping of Datalog programs onto a network of processors, such that the results is a non-redundant computation, is also studied. The methods reported in this paper clearly demonstrate the trade-offs between redundancy and interprocessor-communication for this class of problems.read more
Citations
More filters
Journal ArticleDOI
Optimizing Multiway Joins in a Map-Reduce Environment
Foto N. Afrati,Jeffrey D. Ullman +1 more
TL;DR: This work identifies the “map-key,” the set of attributes that identify the Reduce process to which a Map process must send a particular tuple, and studies the problem of optimizing the shares, given a fixed number of Reduce processes.
Journal ArticleDOI
Parallel execution of prolog programs: a survey
TL;DR: The article describes the major techniques used for shared memory implementation of Or-par parallelism, And-parallelism, and combinations of the two and explores some related issues, such as memory management, compile-time analysis, and execution visualization.
Proceedings Article
Parallel materialisation of datalog programs in centralised, main-memory RDF systems
TL;DR: This work presents a novel approach to parallel materialisation (i.e., fixpoint computation) of datalog programs in centralised, main-memory, multi-core RDF systems with an algorithm that evenly distributes the workload to cores, and an RDF indexing data structure that supports efficient, 'mostly' lock-free parallel updates.
Journal ArticleDOI
Distributed socialite: a datalog-based language for large-scale graph analysis
TL;DR: Novel extensions and optimizations of SociaLite for parallel and distributed executions to support large-scale graph analysis to create many social apps that operate on large- scale distributed graphs are described.
Proceedings ArticleDOI
Big Data Analytics with Datalog Queries on Spark
TL;DR: This work proposes compilation and optimization techniques that tackle the important problem of efficiently supporting recursion in Spark and performs an experimental comparison with other state-of-the-art large-scale Datalog systems to verify the efficacy of these techniques and effectiveness of Spark in supporting Datalogs-based analytics.
References
More filters
Journal ArticleDOI
The Semantics of Predicate Logic as a Programming Language
TL;DR: In this paper the operational and fixpoint semantics of predicate logic programs are defined, and the connections with the proof theory and model theory of logic are investigated, and it is concluded that operational semantics is a part ofProof theory and that fixpoint semantic is a special case of model-theoretic semantics.
Journal ArticleDOI