scispace - formally typeset
H

HyeongSik Kim

Researcher at North Carolina State University

Publications -  24
Citations -  255

HyeongSik Kim is an academic researcher from North Carolina State University. The author has contributed to research in topics: SPARQL & RDF. The author has an hindex of 8, co-authored 19 publications receiving 232 citations. Previous affiliations of HyeongSik Kim include Bosch.

Papers
More filters
Book ChapterDOI

An intermediate algebra for optimizing RDF graph pattern matching on MapReduce

TL;DR: This paper proposes an approach for optimizing graph pattern matching by reinterpreting certain join tree structures as grouping operations which enables a greater degree of parallelism in join processing resulting in more "bushy" like query execution plans with fewer Map-Reduce cycles.
Journal ArticleDOI

From SPARQL to MapReduce: the journey using a nested TripleGroup algebra

TL;DR: The goal of this demonstration is to show how a system RAPID+, an extension of Apache Pig, enables more efficient SPARQL query processing on MapReduce using an alternative query algebra called the Nested TripleGroup Algebra (NTGA).
Proceedings ArticleDOI

Scan-Sharing for Optimizing RDF Graph Pattern Matching on MapReduce

TL;DR: This work presents a scan-sharing technique that is used to optimize the processing of graph patterns with repeated properties that eliminates the need for repeated scanning of input relations when properties are used repeatedly in graph patterns.
Proceedings ArticleDOI

Optimizing RDF(S) queries on cloud platforms

TL;DR: RAPID+ is an extended Apache Pig system that uses an algebraic approach for optimizing queries on RDF data models including queries involving inferencing, and can reinterpret such queries in a way that leads to more concise execution workflows and small intermediate data footprints that minimize disk I/Os and network transfer overhead.
Proceedings ArticleDOI

Efficient processing of RDF graph pattern matching on MapReduce platforms

TL;DR: It is argued that some of the challenges of scalable data processing techniques for RDF can be overcome by rethinking the operators for graph pattern processing, as well as adopting dynamic optimization techniques that exploit information from the previous execution steps to eliminate intermediate results that are irrelevant in the context of future execution steps.