J
Jian Tan
Researcher at Ohio State University
Publications - 121
Citations - 2392
Jian Tan is an academic researcher from Ohio State University. The author has contributed to research in topics: Scheduling (computing) & Computer science. The author has an hindex of 27, co-authored 107 publications receiving 2147 citations. Previous affiliations of Jian Tan include Columbia University & IBM.
Papers
More filters
Proceedings ArticleDOI
SparkBench: a comprehensive benchmarking suite for in memory data analytic platform Spark
TL;DR: This paper presents SparkBench, a Spark specific benchmarking suite, which includes a comprehensive set of applications, including machine learning, graph computation, SQL query and streaming applications, and evaluates the performance impact of a key configuration parameter to guide the design and optimization of Spark data analytic platform.
Journal ArticleDOI
MapTask scheduling in mapreduce with data locality: throughput and heavy-traffic optimality
TL;DR: A new queueing architecture is presented and a map task scheduling algorithm constituted by the Join the Shortest Queue policy together with the MaxWeight policy is proposed that is heavy-traffic optimal, i.e., it asymptotically minimizes the number of backlogged tasks as the arrival rate vector approaches the boundary of the capacity region.
Proceedings ArticleDOI
MRONLINE: MapReduce online performance tuning
TL;DR: This work proposes an online performance tuning system that monitors a job's execution, tunes associated performance-tuning parameters based on collected statistics, and provides fine-grained control over parameter configuration, and designs a gray-box based smart hill climbing algorithm that can efficiently converge to a near-optimal configuration with high probability.
Patent
Resource aware scheduling in a distributed computing environment
Xiaoqiao Meng,Jian Tan,Li Zhang +2 more
TL;DR: In this article, the authors present a system and methods for resource aware scheduling of processes in a distributed computing environment and present a comparison of the current reward value and the prospective reward value.
Proceedings ArticleDOI
Delay tails in MapReduce scheduling
Jian Tan,Xiaoqiao Meng,Li Zhang +2 more
TL;DR: Coupling Scheduler is designed and implemented, which gradually launches reduce tasks depending on map task progresses, and a criticality phenomenon for Fair Scheduler, the delay under which can change from regularly varying of index -a to -a+1, depending on the maximum number of reduce tasks of a job.