scispace - formally typeset
J

Jun Woo Park

Researcher at Carnegie Mellon University

Publications -  7
Citations -  1456

Jun Woo Park is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Scheduling (computing) & Extreme weather. The author has an hindex of 6, co-authored 7 publications receiving 1314 citations.

Papers
More filters
Proceedings ArticleDOI

Scaling distributed machine learning with the parameter server

TL;DR: In this paper, the authors propose a parameter server framework for distributed machine learning problems, where both data and workloads are distributed over worker nodes, while the server nodes maintain globally shared parameters, represented as dense or sparse vectors and matrices.
Proceedings ArticleDOI

TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters

TL;DR: TetriSched is a scheduler that works in tandem with a calendaring reservation system to continuously re-evaluate the immediate-term scheduling plan for all pending jobs on each scheduling cycle, and is experimentally shown to achieve significantly higher SLO attainment and cluster utilization than the best-configured YARN reservation and CapacityScheduler stack deployed on a real 256 node cluster.
Proceedings Article

On the diversity of cluster workloads and its impact on research results

TL;DR: An analysis of the private and HPC cluster traces that spans job characteristics, workload heterogeneity, resource utilization, and failure rates shows that the private cluster workloads, consisting of data analytics jobs expected to be more closely related to the Google workload, display more similarity to the HPC Cluster workloads.
Proceedings ArticleDOI

Stratus: cost-aware container scheduling in the public cloud

TL;DR: Simulation experiments based on cluster workload traces from Google and TwoSigma show that Stratus reduces cost by 17-44% compared to state-of-the-art approaches to virtual cluster scheduling.
Proceedings ArticleDOI

3Sigma: distribution-based cluster scheduling for runtime uncertainty

TL;DR: Analysis of job traces from three different large-scale cluster environments shows that, while the runtimes of many jobs can be predicted well, even state-of-the-art predictors have wide error profiles, and the performance of 3Sigma approaches the end-to-end performance of a scheduler based on a hypothetical, perfect runtime predictor.