scispace - formally typeset
Journal ArticleDOI

Keyword Search over Distributed Graphs with Compressed Signature

Reads0
Chats0
TLDR
A signature-based search algorithm is proposed that encodes the shortest-path distance from a vertex to any given keyword in the graph, and can find query answers by exploring fewer paths, so that the time and communication costs are low.
Abstract
Graph keyword search has drawn many research interests, since graph models can generally represent both structured and unstructured databases and keyword searches can extract valuable information for users without the knowledge of the underlying schema and query language. In practice, data graphs can be extremely large, e.g., a Web-scale graph containing billions of vertices. The state-of-the-art approaches employ centralized algorithms to process graph keyword searches, and thus they are infeasible for such large graphs, due to the limited computational power and storage space of a centralized server. To address this problem, we investigate keyword search for Web-scale graphs deployed in a distributed environment. We first give a naive search algorithm to answer the query efficiently. However, the naive search algorithm uses a flooding search strategy that incurs large time and network overhead. To remedy this shortcoming, we then propose a signature-based search algorithm. Specifically, we design a vertex signature that encodes the shortest-path distance from a vertex to any given keyword in the graph. As a result, we can find query answers by exploring fewer paths, so that the time and communication costs are low. Moreover, we reorganize the graph data in the cluster after its initial random partitioning so that the signature-based techniques are more effective. Finally, our experimental results demonstrate the feasibility of our proposed approach in performing keyword searches over Web-scale graph data.

read more

Citations
More filters
Journal ArticleDOI

A survey of community search over big graphs

TL;DR: A comprehensive review of existing community search works can be found in this paper, where the authors analyze and compare the quality of communities under their models, and the performance of different solutions.
Posted Content

A Survey of Community Search Over Big Graphs

TL;DR: This survey conducts a thorough review of existing community search works, analyzes and compares the quality of communities under their models, and the performance of different solutions, and points out new research directions.
Journal ArticleDOI

Deep fusion of multimodal features for social media retweet time prediction

TL;DR: An end-to-end deep learning model, namely DFMF (Deep Fusion of Multimodal Features), is presented to implicitly learn the latent features and interactions of tweets, social relationships, and the posting time to predict the followers’ retweet time on Twitter.
Journal ArticleDOI

A survey of typical attributed graph queries

TL;DR: This paper proposes a taxonomy of attributed graph queries based on query inputs and outputs and summarizes the definitions of queries that fall into each category and presents a fine-grained classification of queries within each category by analyzing the semantics and algorithmic motivations behind these queries.
Journal ArticleDOI

Advancing community detection using Keyword Attribute Search

TL;DR: A novel algorithm of online community detection using a technique of keyword search over the attributed graph, which makes it possible to search community of nodes in an online and computationally efficient manner compared to the existing techniques.
References
More filters
Proceedings ArticleDOI

Pregel: a system for large-scale graph processing

TL;DR: A model for processing large graphs that has been designed for efficient, scalable and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier.
Book

Approximation Algorithms for NP-Hard Problems

TL;DR: This book reviews the design techniques for approximation algorithms and the developments in this area since its inception about three decades ago and the "closeness" to optimum that is achievable in polynomial time.
Journal ArticleDOI

Multilevelk-way Partitioning Scheme for Irregular Graphs

TL;DR: This paper presents and study a class of graph partitioning algorithms that reduces the size of the graph by collapsing vertices and edges, they find ak-way partitioning of the smaller graph, and then they uncoarsen and refine it to construct ak- way partitioning for the original graph.
Journal ArticleDOI

Distributed GraphLab: a framework for machine learning and data mining in the cloud

TL;DR: GraphLab as discussed by the authors extends the GraphLab framework to the substantially more challenging distributed setting while preserving strong data consistency guarantees to reduce network congestion and mitigate the effect of network latency in the shared-memory setting.
Related Papers (5)