A
Alexey Tumanov
Researcher at Georgia Institute of Technology
Publications - 51
Citations - 3702
Alexey Tumanov is an academic researcher from Georgia Institute of Technology. The author has contributed to research in topics: Computer science & Cloud computing. The author has an hindex of 24, co-authored 40 publications receiving 2654 citations. Previous affiliations of Alexey Tumanov include University of Toronto & Carnegie Mellon University.
Papers
More filters
Proceedings ArticleDOI
Heterogeneity and dynamicity of clouds at scale: Google trace analysis
TL;DR: Analysis of the first publicly available trace data from a sizable multi-purpose cluster finds that many longer-running jobs have relatively stable resource utilizations, which can help adaptive resource schedulers.
Proceedings ArticleDOI
Ray: a distributed framework for emerging AI applications
Philipp Moritz,Robert Nishihara,Stephanie Wang,Alexey Tumanov,Richard Liaw,Eric Liang,Melih Elibol,Zongheng Yang,William Paul,Michael I. Jordan,Ion Stoica +10 more
TL;DR: Ray as mentioned in this paper is a distributed system that implements a unified interface that can express both task-parallel and actor-based computations, supported by a single dynamic execution engine and employs a distributed scheduler and a distributed and fault-tolerant store to manage the control state.
Proceedings ArticleDOI
Morpheus: towards automated SLOs for enterprise clusters
Sangeetha Abdu Jyothi,Carlo Curino,Ishai Menache,Shravan Narayanamurthy,Alexey Tumanov,Jonathan Yaniv,Ruslan Mavlyutov,Íñigo Goiri,Subru Krishnan,Janardhan Kulkarni,Sriram Rao +10 more
TL;DR: Morpheus is a new system that codifies implicit user expectations as explicit Service Level Objectives (SLOs) inferred from historical data, enforces SLOs using novel scheduling techniques that isolate jobs from sharing-induced performance variability, and mitigates inherent performance variance by means of dynamic reprovisioning of jobs.
Proceedings Article
Serverless Computing: One Step Forward, Two Steps Back.
Joseph M. Hellerstein,Jose M. Faleiro,Joseph E. Gonzalez,Johann Schleier-Smith,Vikram Sreekanti,Alexey Tumanov,Chenggang Wu +6 more
TL;DR: This paper addresses critical gaps in first-generation serverless computing, which place its autoscaling potential at odds with dominant trends in modern computing: notably data-centric and distributed computing, but also open source and custom hardware.
Proceedings ArticleDOI
TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters
TL;DR: TetriSched is a scheduler that works in tandem with a calendaring reservation system to continuously re-evaluate the immediate-term scheduling plan for all pending jobs on each scheduling cycle, and is experimentally shown to achieve significantly higher SLO attainment and cluster utilization than the best-configured YARN reservation and CapacityScheduler stack deployed on a real 256 node cluster.