scispace - formally typeset
Journal ArticleDOI

Scalable Graph Processing Frameworks: A Taxonomy and Open Challenges

TLDR
A taxonomy of graph processing systems is proposed and existing systems are mapped to this classification, which captures the diversity in programming and computation models, runtime aspects of partitioning and communication, both for in-memory and distributed frameworks.
Abstract
The world is becoming a more conjunct place and the number of data sources such as social networks, online transactions, web search engines, and mobile devices is increasing even more than had been predicted. A large percentage of this growing dataset exists in the form of linked data, more generally, graphs, and of unprecedented sizes. While today's data from social networks contain hundreds of millions of nodes connected by billions of edges, inter-connected data from globally distributed sensors that forms the Internet of Things can cause this to grow exponentially larger. Although analyzing these large graphs is critical for the companies and governments that own them, big data tools designed for text and tuple analysis such as MapReduce cannot process them efficiently. So, graph distributed processing abstractions and systems are developed to design iterative graph algorithms and process large graphs with better performance and scalability. These graph frameworks propose novel methods or extend previous methods for processing graph data. In this article, we propose a taxonomy of graph processing systems and map existing systems to this classification. This captures the diversity in programming and computation models, runtime aspects of partitioning and communication, both for in-memory and distributed frameworks. Our effort helps to highlight key distinctions in architectural approaches, and identifies gaps for future research in scalable graph systems.

read more

Citations
More filters

How to… home page

Challenger Tafe, +1 more
TL;DR: In this article, the authors developed a center to address state-of-the-art research, create innovating educational programs, and support technology transfers using commercially viable results to assist the Army Research Laboratory to develop the next generation Future Combat System in the telecommunications sector that assures prevention of perceived threats, and non-line of sight/Beyond line of sight lethal support.
Posted Content

The Laws of the Web: Patterns in the Ecology of Information

TL;DR: One of the foremost researchers in the field, Huberman has established, for example, that the surfing patterns of individuals are describable by a precise law, which can lead to more efficient Web design and use.
Journal ArticleDOI

A Survey on Graph Processing Accelerators: Challenges and Opportunities

TL;DR: In this article, the authors conduct a systematical survey regarding the design and implementation of graph processing accelerators and present and discuss several challenges in details, and further explore the opportunities for the future research.
Journal ArticleDOI

EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks

TL;DR: EnGN as discussed by the authors proposes a specialized accelerator architecture to accelerate the three key stages of GNN propagation, which is abstracted as common computing patterns shared by typical GNNs, and uses graph tiling strategy to fit large graphs into EnGN and make good use of the hierarchical onchip buffers through adaptive computation reordering and tile scheduling.
Proceedings ArticleDOI

Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics

TL;DR: Slim Graph is proposed, the first programming model and framework for practical lossy graph compression that facilitates high-performance approximate graph processing, storage, and analytics and may become the common ground for developing, executing, and analyzing emerging lossygraph compression schemes.
References
More filters
Journal ArticleDOI

A note on two problems in connexion with graphs

TL;DR: A tree is a graph with one and only one path between every two nodes, where at least one path exists between any two nodes and the length of each branch is given.
Journal ArticleDOI

MapReduce: simplified data processing on large clusters

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Proceedings Article

The PageRank Citation Ranking : Bringing Order to the Web

TL;DR: This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.
Journal ArticleDOI

Data clustering: 50 years beyond K-means

TL;DR: A brief overview of clustering is provided, well known clustering methods are summarized, the major challenges and key issues in designing clustering algorithms are discussed, and some of the emerging and useful research directions are pointed out.
Journal ArticleDOI

Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility

TL;DR: This paper defines Cloud computing and provides the architecture for creating Clouds with market-oriented resource allocation by leveraging technologies such as Virtual Machines (VMs), and provides insights on market-based resource management strategies that encompass both customer-driven service management and computational risk management to sustain Service Level Agreement (SLA) oriented resource allocation.
Related Papers (5)