scispace - formally typeset
J

Jian Tan

Researcher at Ohio State University

Publications -  121
Citations -  2392

Jian Tan is an academic researcher from Ohio State University. The author has contributed to research in topics: Scheduling (computing) & Computer science. The author has an hindex of 27, co-authored 107 publications receiving 2147 citations. Previous affiliations of Jian Tan include Columbia University & IBM.

Papers
More filters
Proceedings ArticleDOI

SparkBench: a comprehensive benchmarking suite for in memory data analytic platform Spark

TL;DR: This paper presents SparkBench, a Spark specific benchmarking suite, which includes a comprehensive set of applications, including machine learning, graph computation, SQL query and streaming applications, and evaluates the performance impact of a key configuration parameter to guide the design and optimization of Spark data analytic platform.
Journal ArticleDOI

MapTask scheduling in mapreduce with data locality: throughput and heavy-traffic optimality

TL;DR: A new queueing architecture is presented and a map task scheduling algorithm constituted by the Join the Shortest Queue policy together with the MaxWeight policy is proposed that is heavy-traffic optimal, i.e., it asymptotically minimizes the number of backlogged tasks as the arrival rate vector approaches the boundary of the capacity region.
Proceedings ArticleDOI

MRONLINE: MapReduce online performance tuning

TL;DR: This work proposes an online performance tuning system that monitors a job's execution, tunes associated performance-tuning parameters based on collected statistics, and provides fine-grained control over parameter configuration, and designs a gray-box based smart hill climbing algorithm that can efficiently converge to a near-optimal configuration with high probability.
Patent

Resource aware scheduling in a distributed computing environment

TL;DR: In this article, the authors present a system and methods for resource aware scheduling of processes in a distributed computing environment and present a comparison of the current reward value and the prospective reward value.
Proceedings ArticleDOI

Delay tails in MapReduce scheduling

TL;DR: Coupling Scheduler is designed and implemented, which gradually launches reduce tasks depending on map task progresses, and a criticality phenomenon for Fair Scheduler, the delay under which can change from regularly varying of index -a to -a+1, depending on the maximum number of reduce tasks of a job.