J
Jun Woo Park
Researcher at Carnegie Mellon University
Publications - 7
Citations - 1456
Jun Woo Park is an academic researcher from Carnegie Mellon University. The author has contributed to research in topics: Scheduling (computing) & Extreme weather. The author has an hindex of 6, co-authored 7 publications receiving 1314 citations.
Papers
More filters
Proceedings ArticleDOI
Scaling distributed machine learning with the parameter server
Mu Li,David G. Andersen,Jun Woo Park,Alexander J. Smola,Amr Ahmed,Vanja Josifovski,James Long,Eugene J. Shekita,Bor-Yiing Su +8 more
TL;DR: In this paper, the authors propose a parameter server framework for distributed machine learning problems, where both data and workloads are distributed over worker nodes, while the server nodes maintain globally shared parameters, represented as dense or sparse vectors and matrices.
Proceedings ArticleDOI
TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters
TL;DR: TetriSched is a scheduler that works in tandem with a calendaring reservation system to continuously re-evaluate the immediate-term scheduling plan for all pending jobs on each scheduling cycle, and is experimentally shown to achieve significantly higher SLO attainment and cluster utilization than the best-configured YARN reservation and CapacityScheduler stack deployed on a real 256 node cluster.
Proceedings Article
On the diversity of cluster workloads and its impact on research results
George Amvrosiadis,Jun Woo Park,Gregory R. Ganger,Garth A. Gibson,Elisabeth Baseman,Nathan DeBardeleben +5 more
TL;DR: An analysis of the private and HPC cluster traces that spans job characteristics, workload heterogeneity, resource utilization, and failure rates shows that the private cluster workloads, consisting of data analytics jobs expected to be more closely related to the Google workload, display more similarity to the HPC Cluster workloads.
Proceedings ArticleDOI
Stratus: cost-aware container scheduling in the public cloud
TL;DR: Simulation experiments based on cluster workload traces from Google and TwoSigma show that Stratus reduces cost by 17-44% compared to state-of-the-art approaches to virtual cluster scheduling.
Proceedings ArticleDOI
3Sigma: distribution-based cluster scheduling for runtime uncertainty
TL;DR: Analysis of job traces from three different large-scale cluster environments shows that, while the runtimes of many jobs can be predicted well, even state-of-the-art predictors have wide error profiles, and the performance of 3Sigma approaches the end-to-end performance of a scheduler based on a hypothetical, perfect runtime predictor.