scispace - formally typeset
A

Alexey Tumanov

Researcher at Georgia Institute of Technology

Publications -  51
Citations -  3702

Alexey Tumanov is an academic researcher from Georgia Institute of Technology. The author has contributed to research in topics: Computer science & Cloud computing. The author has an hindex of 24, co-authored 40 publications receiving 2654 citations. Previous affiliations of Alexey Tumanov include University of Toronto & Carnegie Mellon University.

Papers
More filters
Proceedings ArticleDOI

Heterogeneity and dynamicity of clouds at scale: Google trace analysis

TL;DR: Analysis of the first publicly available trace data from a sizable multi-purpose cluster finds that many longer-running jobs have relatively stable resource utilizations, which can help adaptive resource schedulers.
Proceedings ArticleDOI

Ray: a distributed framework for emerging AI applications

TL;DR: Ray as mentioned in this paper is a distributed system that implements a unified interface that can express both task-parallel and actor-based computations, supported by a single dynamic execution engine and employs a distributed scheduler and a distributed and fault-tolerant store to manage the control state.
Proceedings ArticleDOI

Morpheus: towards automated SLOs for enterprise clusters

TL;DR: Morpheus is a new system that codifies implicit user expectations as explicit Service Level Objectives (SLOs) inferred from historical data, enforces SLOs using novel scheduling techniques that isolate jobs from sharing-induced performance variability, and mitigates inherent performance variance by means of dynamic reprovisioning of jobs.
Proceedings Article

Serverless Computing: One Step Forward, Two Steps Back.

TL;DR: This paper addresses critical gaps in first-generation serverless computing, which place its autoscaling potential at odds with dominant trends in modern computing: notably data-centric and distributed computing, but also open source and custom hardware.
Proceedings ArticleDOI

TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters

TL;DR: TetriSched is a scheduler that works in tandem with a calendaring reservation system to continuously re-evaluate the immediate-term scheduling plan for all pending jobs on each scheduling cycle, and is experimentally shown to achieve significantly higher SLO attainment and cluster utilization than the best-configured YARN reservation and CapacityScheduler stack deployed on a real 256 node cluster.