scispace - formally typeset
Book ChapterDOI

GraphREL: A Decomposition-Based and Selectivity-Aware Relational Framework for Processing Sub-graph Queries

TLDR
This paper proposes a novel, decomposition-based and selectivity-aware SQL translation mechanism of sub-graph search queries for relational database management systems, and carefully exploits existing database functionality such as partitioned B-trees indexes and influencing the relational query optimizers by selectivity annotations to reduce the access costs of the secondary storage to a minimum.
Abstract
Graphs are widely used for modelling complicated data such as: chemical compounds, protein interactions, XML documents and multimedia. Retrieving related graphs containing a query graph from a large graph database is a key issue in many graph-based applications such as drug discovery and structural pattern recognition. Relational database management systems (RDBMSs) have repeatedly been shown to be able to efficiently host different types of data which were not formerly anticipated to reside within relational databases such as complex objects and XML data.The key advantages of relational database systems are its well-known maturity and its ability to scale to handle vast amounts of data very efficiently. RDMBSs derive much of their performance from sophisticated optimizer components which makes use of physical properties that are specific to the relational model such as: sortedness, proper join ordering and powerful indexing mechanisms. In this paper, we study the problem of indexing and querying graph databases using the relational infrastructure. We propose a novel, decomposition-based and selectivity-aware SQL translation mechanism of sub-graph search queries. Moreover, we carefully exploit existing database functionality such as partitioned B-trees indexes and influencing the relational query optimizers by selectivity annotations to reduce the access costs of the secondary storage to a minimum. Finally, our experiments utilise an IBM DB2 RDBMS as a concrete example to confirm that relational database systems can be used as an efficient and very scalable processor for sub-graph queries.

read more

Citations
More filters
Patent

Graph query processing using plurality of engines

TL;DR: In this paper, a graph query submitted to a graph database which is modeled by an attributed graph is received, and the graph query is decomposed into a plurality of query components.
Journal ArticleDOI

Large scale graph processing systems: survey and an experimental evaluation

TL;DR: A comprehensive survey over the state-of-the-art of large scale graph processing platforms, namely, GraphChi, Apache Giraph, GPS, GraphLab and GraphX, and an extensive experimental study of five popular systems in this domain.
Proceedings ArticleDOI

SAHAD: Subgraph Analysis in Massive Networks Using Hadoop

TL;DR: SAHAD is the first such Hadoop based subgraph/subtree analysis algorithm, and performs significantly better than prior approaches for very large graphs and templates, and is also amenable to running quite easily on Amazon EC2, without needs for any system level optimization.
Proceedings ArticleDOI

A framework for querying graph-based business process models

TL;DR: The reusing framework is enhanced with a semantic query expander component that provides the users with the flexibility to get not only the perfectly matched process models to their queries but also the models with high similarity.
Proceedings ArticleDOI

G-SPARQL: a hybrid engine for querying large attributed graphs

TL;DR: An algebraic compilation mechanism for the proposed query language, G-SPARQL, which is extended from the relational algebra and based on the basic construct of building SPARQL queries, the Triple Pattern is described.
References
More filters
Proceedings ArticleDOI

Frequent subgraph discovery

TL;DR: The empirical results show that the algorithm scales linearly with the number of input transactions and it is able to discover frequent subgraphs from a set of graph transactions reasonably fast, even though it has to deal with computationally hard problems such as canonical labeling of graphs and subgraph isomorphism which are not necessary for traditional frequent itemset discovery.
Proceedings ArticleDOI

Graph indexing: a frequent structure-based approach

TL;DR: The gIndex approach not only provides and elegant solution to the graph indexing problem, but also demonstrates how database indexing and query processing can benefit form data mining, especially frequent pattern mining.
Journal Article

XRel : A path-based approach to storage and retrieval of XML documents using relational databases

TL;DR: XRel enables us to store XML documents using a fixed relational schema without any information about DTDs and also to utilize indices such as the B 1 -tree and the R-tree supported by database management systems.
Proceedings ArticleDOI

Accelerating XPath location steps

TL;DR: This work is a proposal for a database index structure that has been specifically designed to support the evaluation of XPath queries, capable to support all XPath axes and able to start traversals from arbitrary context nodes in an XML document.
Proceedings Article

A Fast Index for Semistructured Data

TL;DR: The Index Fabric is described, an indexing structure that provides the efficiency and flexibility needed to optimize ad hoc queries over semistructured data, and how "refined paths" optimize specific access paths.