scispace - formally typeset
Proceedings ArticleDOI

A framework for the parallel processing of Datalog queries

Reads0
Chats0
TLDR
The notion of a discriminating predicate, based on hash functions, that partitions the computation between the processors in order to achieve parallelism is introduced and the trade-offs between redundancy and interprocessor-communication are demonstrated.
Abstract
This paper presents several complementary methods for the parallel, bottom-up evaluation of Datalog queries. We introduce the notion of a discriminating predicate, based on hash functions, that partitions the computation between the processors in order to achieve parallelism. A parallelization scheme with the property of non-redundant computation (no duplication of computation by processors) is then studied in detail. The mapping of Datalog programs onto a network of processors, such that the results is a non-redundant computation, is also studied. The methods reported in this paper clearly demonstrate the trade-offs between redundancy and interprocessor-communication for this class of problems.

read more

Citations
More filters
Journal ArticleDOI

Optimizing Multiway Joins in a Map-Reduce Environment

TL;DR: This work identifies the “map-key,” the set of attributes that identify the Reduce process to which a Map process must send a particular tuple, and studies the problem of optimizing the shares, given a fixed number of Reduce processes.
Journal ArticleDOI

Parallel execution of prolog programs: a survey

TL;DR: The article describes the major techniques used for shared memory implementation of Or-par parallelism, And-parallelism, and combinations of the two and explores some related issues, such as memory management, compile-time analysis, and execution visualization.
Proceedings Article

Parallel materialisation of datalog programs in centralised, main-memory RDF systems

TL;DR: This work presents a novel approach to parallel materialisation (i.e., fixpoint computation) of datalog programs in centralised, main-memory, multi-core RDF systems with an algorithm that evenly distributes the workload to cores, and an RDF indexing data structure that supports efficient, 'mostly' lock-free parallel updates.
Journal ArticleDOI

Distributed socialite: a datalog-based language for large-scale graph analysis

TL;DR: Novel extensions and optimizations of SociaLite for parallel and distributed executions to support large-scale graph analysis to create many social apps that operate on large- scale distributed graphs are described.
Proceedings ArticleDOI

Big Data Analytics with Datalog Queries on Spark

TL;DR: This work proposes compilation and optimization techniques that tackle the important problem of efficiently supporting recursion in Spark and performs an experimental comparison with other state-of-the-art large-scale Datalog systems to verify the efficacy of these techniques and effectiveness of Spark in supporting Datalogs-based analytics.
References
More filters
Journal ArticleDOI

The Semantics of Predicate Logic as a Programming Language

TL;DR: In this paper the operational and fixpoint semantics of predicate logic programs are defined, and the connections with the proof theory and model theory of logic are investigated, and it is concluded that operational semantics is a part ofProof theory and that fixpoint semantic is a special case of model-theoretic semantics.