scispace - formally typeset
Proceedings ArticleDOI

Early experiences in using a domain-specific language for large-scale graph analysis

Reads0
Chats0
TLDR
The feasibility of using an intuitive Domain-Specific Language (DSL) for graph analysis, using a compiler to translate Green-Marl programs into an equivalent Giraph application, automatically bridging between very different programming models is discussed.
Abstract
Large-scale graph analysis has recently been drawing lots of attention from both industry and academia. Although there are already several frameworks designed for scalable graph analysis, e.g. Giraph [1], all these frameworks adopt non-traditional programming models and APIs. This can significantly lower the productivity of the framework user. This paper discusses the feasibility of using an intuitive Domain-Specific Language (DSL) for graph analysis. Specifically, we use a compiler to translate Green-Marl [5] programs into an equivalent Giraph application, automatically bridging between very different programming models. We observe that the DSL programs are concise and intuitive, and that the compiler generated Giraph implementations exhibit performance on par with that of hand-written ones. However, the DSL compilation cannot but fail if the algorithm is fundamentally not compatible with the target framework. Overall, we believe that the DSL-based approach will provide great productivity benefits once it matures.

read more

Content maybe subject to copyright    Report

Citations
More filters

What is Twitter

Proceedings ArticleDOI

Simplifying Scalable Graph Processing with a Domain-Specific Language

TL;DR: This paper uses Green-Marl, a Domain-Specific Language for graph analysis, to intuitively describe graph algorithms and extend its compiler to generate equivalent Pregel implementations, and shows that the P Regel programs generated by the Green-marl compiler perform similarly to manually coded PRegel implementations of the same algorithms.
Journal ArticleDOI

DPM: A novel distributed large-scale social graph processing framework for link prediction algorithms

TL;DR: A distributed graph processing framework called Distributed Partitioned Merge (DPM), which supports both types of algorithms and it is shown that in most tests DPM outperforms both Pregel and Fork-Join in terms of recommendation time, with a minor penalization in network usage in some scenarios.
Journal ArticleDOI

Using domain-specific languages for analytic graph databases

TL;DR: A new graph database system that integrates a robust graph storage with an efficient graph analytics engine that adopts two domain-specific languages (DSLs), one for describing graph analysis algorithms and the other for graph pattern matching queries.

Understanding and Improving Graph Algorithm Performance

Scott Beamer
TL;DR: This dissertation characterizes graph processing workloads on shared memory multiprocessors in order to understand graph algorithm performance and introduces the Graph Algorithm Iron Law (GAIL), a simple performance model that allows for reasoning about tradeoffs across layers by considering algorithmic efficiency, cache locality, and memory bandwidth utilization.
References
More filters
Proceedings ArticleDOI

What is Twitter, a social network or a news media?

TL;DR: In this paper, the authors have crawled the entire Twittersphere and found a non-power-law follower distribution, a short effective diameter, and low reciprocity, which all mark a deviation from known characteristics of human social networks.
Journal ArticleDOI

A bridging model for parallel computation

TL;DR: The bulk-synchronous parallel (BSP) model is introduced as a candidate for this role, and results quantifying its efficiency both in implementing high-level language features and algorithms, as well as in being implemented in hardware.
Proceedings ArticleDOI

Pregel: a system for large-scale graph processing

TL;DR: A model for processing large graphs that has been designed for efficient, scalable and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier.
Proceedings ArticleDOI

Pig latin: a not-so-foreign language for data processing

TL;DR: A new language called Pig Latin is described, designed to fit in a sweet spot between the declarative style of SQL, and the low-level, procedural style of map-reduce, which is an open-source, Apache-incubator project, and available for general use.
Journal ArticleDOI

Hive: a warehousing solution over a map-reduce framework

TL;DR: Hadoop is a popular open-source map-reduce implementation which is being used as an alternative to store and process extremely large data sets on commodity hardware.